2025 - FCPE: A Fast Context-based Pitch Estimation Model
Under review: arXiv:2509.15140
2025 - MIKU-PAL: An Automated and Standardized Multimodal Method for
Speech Paralinguistic and Affect Labeling
Published in: Interspeech 2025 (Main Track) 10.21437/Interspeech.2025-648
2024 - Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Published in: arXiv:2411.01156