Tag: Critique Model
All the papers with the tag "Critique Model".
DeepCritic: Deliberate Critique with Large Language Models
grok-3-latestScore: 0.72Published: at 17:03本文提出 DeepCritic 框架,通过两阶段训练(监督微调与强化学习)显著提升大型语言模型在数学推理任务中的批判能力,为自动化监督和模型自我改进铺平道路。