Daily Paper Machine

Tag: Training Strategy

All the papers with the tag "Training Strategy".

Bielik v3 Small: Technical Report
grok-3-latest
Score: 0.67
Published:2025年5月5日 at 10:39
#LLM, #Parameter Efficiency, #Tokenizer Optimization, #Data Quality, #Training Strategy
本文通过创新架构设计、高质量波兰语数据处理和高效训练策略，开发了参数高效的 Bielik v3 模型（1.5B 和 4.5B），为资源受限语言的语言模型开发树立了新标杆。
What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction
grok-3-latest
Score: 0.79
Published:2025年5月4日 at 11:46
#LLM, #Distribution Estimation, #Response Prediction, #Training Strategy, #Inference Method
本文提出一个理论框架，区分了大型语言模型输出概率的三种解释（源分布估计、目标分布估计、响应预测），并揭示了现有研究中因混淆这些分布导致的误解，为概率解释和模型设计提供了理论基础。