Tag: Hardware Awareness
All the papers with the tag "Hardware Awareness".
QuantX: A Framework for Hardware-Aware Quantization of Generative AI Workloads
grok-3-latestScore: 0.58Published: at 13:13QuantX 提出了一种硬件感知的量化框架,通过针对权重分布差异和硬件约束设计多种量化策略,将大型语言模型和视觉语言模型量化到3比特,同时保持性能损失在6%以内,显著优于现有方法。