Tag: Speech Synthesis
All the papers with the tag "Speech Synthesis".
Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications
grok-3-latestScore: 0.56Published: at 16:10本文提出了一种轻量级端到端文本转语音模型(LE2E),通过联合训练声学模型和声码器,在低资源设备上实现了高质量实时语音合成,参数量减少90%且速度提升10倍。
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
grok-3-latestScore: 0.73Published: at 12:53LLaMA-Omni 2 通过模块化设计和自回归流式语音生成,以较低成本实现高质量端到端语音交互,显著超越依赖大规模数据的基线模型。
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
grok-3-latestScore: 0.77Published: at 12:53LLaMA-Omni 2 通过模块化设计和自回归流式语音生成技术,显著提升了实时语音交互的智能性、自然性和低延迟表现,超越了现有 SpeechLM 模型。