Tag: LLM

All the papers with the tag "LLM".

AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning
grok-3-latest
Score: 0.48
Published:2025年5月6日 at 09:06
#LLM, #Prompt Engineering, #Workflow Design, #Reasoning, #Bias Mitigation
本文提出持久工作流程提示（PWP）方法，通过结构化提示库和元提示技术，指导大型语言模型完成复杂的学术同行评审任务，并在抑制输入偏见方面取得初步成功。
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation
grok-3-latest
Score: 0.64
Published:2025年5月6日 at 08:05
#LLM, #Tool Selection, #Retrieval-Augmented Generation, #Context Management, #Scalability
本文提出 RAG-MCP 框架，通过检索增强生成技术动态筛选相关工具描述，显著缓解了大型语言模型在大规模工具使用中的提示膨胀问题，并大幅提升了工具选择准确率。
Patterns and Mechanisms of Contrastive Activation Engineering
grok-3-latest
Score: 0.58
Published:2025年5月6日 at 05:15
#LLM, #Activation Engineering, #Steering Vector, #Inference Time Control, #Generalization
本文系统分析了 Contrastive Activation Engineering (CAE) 的模式与机制，发现其在分布内有效但分布外泛化不足，且对模型困惑度有负面影响，为实际应用提供了重要参考。
Soft Best-of-n Sampling for Model Alignment
grok-3-latest
Score: 0.71
Published:2025年5月6日 at 04:03
#LLM, #Model Alignment, #Sampling, #Reward Optimization, #KL Divergence
本文提出 Soft Best-of-n 采样方法，通过温度参数 λ 实现奖励优化与分布相似性的平滑权衡，并在理论上证明其以 O(1/n) 速率逼近最优倾斜分布，为大型语言模型对齐提供了一种高效且灵活的推理时策略。
AutoLibra: Agent Metric Induction from Open-Ended Feedback
grok-3-latest
Score: 0.46
Published:2025年5月5日 at 17:47
#Agent Evaluation, #Human Feedback, #Metric Induction, #Behavior Analysis, #LLM
AutoLibra 提出了一种从开放式人类反馈中自动诱导细粒度、可解释的AI代理评估指标的框架，显著提升了代理评估和改进的效果。
HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models
grok-3-latest
Score: 0.77
Published:2025年5月5日 at 17:09
#LLM, #Parameter-Efficient Fine-Tuning, #Split Learning, #Heterogeneous Computing, #Adaptation
HSplitLoRA 提出了一种异构参数高效微调框架，通过重要权重识别、自适应秩与模型分割配置及无噪声适配器聚合，显著提升了大型语言模型在异构环境下的微调性能和效率。

Tag: LLM

AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation

Patterns and Mechanisms of Contrastive Activation Engineering

Soft Best-of-n Sampling for Model Alignment

AutoLibra: Agent Metric Induction from Open-Ended Feedback

HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models