2025 - MIKU-PAL: An Automated and Standardized Multimodal Method for
Speech Paralinguistic and Affect Labeling
Published in: Interspeech 2025 (Main Track) arXiv:2505.15772
2024 - Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Published in: arXiv:2411.01156