Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

On-the-Fly Adaptation of MacroMoE LLMs Using Task-Specific Low-Rank Adapters

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • المؤلفون: Start, Johannes; Lunney, John
  • المصدر:
    Defensive Publications Series
  • نوع التسجيلة:
    text
  • اللغة:
    unknown
  • معلومة اضافية
    • بيانات النشر:
      Technical Disclosure Commons
    • الموضوع:
      2025
    • Collection:
      Technical Disclosure Common
    • نبذة مختصرة :
      A mixture of experts (MoE) is a way of building machine learning models (including large language models) that improves efficiency and scalability by activating only a subset of the model parameters during each inference step. The activated model-parameter subset is an expert in the domain of the input query. This disclosure describes techniques for efficient, dynamic specialization of LLMs that are based on a macro-MoE architecture. A plurality of foundational, pre-trained LLMs (‘macro-experts’), a library of distinct low-rank adaptation (LoRA) modules, and an adaptive gating network is provided. Upon receiving an input prompt, the gating network analyzes the prompt and selects an optimal pair that comprises a foundational macro-expert and a task-specific LoRA module from the library. During inference, a dynamic application engine applies the selected LoRA module to the chosen expert, creating a temporary, highly specialized model to handle the request. By thus combining generalist models with specialist adapters, fine-grained customization becomes possible, such that a single system can exhibit expert-level performance across multiple domains without the computational and storage costs of maintaining numerous fine-tuned models.
    • File Description:
      application/pdf
    • Relation:
      https://www.tdcommons.org/dpubs_series/8469; https://www.tdcommons.org/context/dpubs_series/article/9683/viewcontent/On_the_Fly_Adaptation_of_MacroMoE_LLMs_Using_Task_Specific_Low_Rank_Adapters.pdf
    • الدخول الالكتروني :
      https://www.tdcommons.org/dpubs_series/8469
      https://www.tdcommons.org/context/dpubs_series/article/9683/viewcontent/On_the_Fly_Adaptation_of_MacroMoE_LLMs_Using_Task_Specific_Low_Rank_Adapters.pdf
    • Rights:
      http://creativecommons.org/licenses/by/4.0/
    • الرقم المعرف:
      edsbas.67EDB1F6