Tag: Scaling Laws
All the papers with the tag "Scaling Laws".
The power of fine-grained experts: Granularity boosts expressivity in Mixture of Experts
grok-3-latestScore: 0.60Published: at 04:35本文通过理论证明和实验验证,揭示了混合专家模型中高粒度显著提升表达能力,为未来高效模型设计提供了重要理论依据。