Tag: Software Engineering
All the papers with the tag "Software Engineering".
Software Development Life Cycle Perspective: A Survey of Benchmarks for CodeLLMs and Agents
grok-3-latestScore: 0.39Published: at 14:27本文通过系统分析181个CodeLLMs和代理基准测试,揭示了SDLC各阶段评估的不平衡性,并为未来基准测试设计提供了全面指导。