Daily Paper Machine

Tag: Speech Synthesis

All the papers with the tag "Speech Synthesis".

Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications
grok-3-latest
Score: 0.56
Published:2025年5月12日 at 16:10
#TTS, #End-to-End, #Lightweight Model, #Speech Synthesis, #Low Resource
本文提出了一种轻量级端到端文本转语音模型（LE2E），通过联合训练声学模型和声码器，在低资源设备上实现了高质量实时语音合成，参数量减少90%且速度提升10倍。
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
grok-3-latest
Score: 0.73
Published:2025年5月5日 at 12:53
#LLM, #Speech Synthesis, #Streaming Generation, #Modular Design, #Real-time Interaction
LLaMA-Omni 2 通过模块化设计和自回归流式语音生成，以较低成本实现高质量端到端语音交互，显著超越依赖大规模数据的基线模型。
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
grok-3-latest
Score: 0.77
Published:2025年5月5日 at 12:53
#LLM, #Speech Synthesis, #Streaming Generation, #Modular Design, #Real-Time Interaction
LLaMA-Omni 2 通过模块化设计和自回归流式语音生成技术，显著提升了实时语音交互的智能性、自然性和低延迟表现，超越了现有 SpeechLM 模型。