Daily Paper Machine

Tag: LLM

All the papers with the tag "LLM".

MF-LLM: Simulating Collective Decision Dynamics via a Mean-Field Large Language Model Framework
grok-3-latest
Score: 0.58
Published:2025年4月30日 at 12:41
#LLM, #Social Simulation, #Mean Field Theory, #Feedback Loop, #Fine-Tuning
本文提出MF-LLM框架，通过均场理论与大型语言模型的结合及基于信息瓶颈的IB-Tune微调方法，显著提升了集体决策动态模拟的保真度和可扩展性。
Phi-4-reasoning Technical Report
grok-3-latest
Score: 0.73
Published:2025年4月30日 at 05:05
#LLM, #Reasoning, #Supervised Fine-Tuning, #Reinforcement Learning, #Inference Scaling
本文通过监督微调和强化学习，基于 14B 参数的 Phi-4 模型开发出 Phi-4-reasoning 和 Phi-4-reasoning-plus，显著提升复杂推理任务性能并展现出与更大规模模型的竞争力。
Memorization and Knowledge Injection in Gated LLMs
grok-3-latest
Score: 0.71
Published:2025年4月30日 at 00:28
#LLM, #Continual Learning, #Memory Embedding, #Gating Mechanism, #Knowledge Integration
本文提出MEGa框架，通过门控LoRA模块将新记忆嵌入大型语言模型权重中，有效缓解持续学习中的灾难性遗忘，并在记忆回忆与知识整合任务上取得显著成果。