Tag: Data Augmentation
All the papers with the tag "Data Augmentation".
$ extit{New News}$: System-2 Fine-tuning for Robust Integration of New Knowledge
grok-3-latestScore: 0.75Published: at 12:49本文提出 System-2 Fine-tuning(Sys2-FT)方法,通过自我生成数据显著提升大型语言模型对新知识的权重内学习能力,并揭示上下文遮蔽效应对微调的影响。
On the generalization of language models from in-context learning and finetuning: a controlled study
grok-3-latestScore: 0.84Published: at 17:02本文通过控制实验揭示上下文学习在系统性泛化任务上优于微调,并提出通过上下文推理增强微调数据的方法,显著提升了微调的泛化能力。