2026 - Fish Audio S2 Technical Report
Under review: arXiv:2603.08823
2026 - Physics Encoded Spatial and Temporal Generative Adversarial Network for Tropical Cyclone Image Super-resolution
Under review: arXiv:2602.17277
2025 - FCPE: A Fast Context-based Pitch Estimation Model
Under review: arXiv:2509.15140
2025 - MIKU-PAL: An Automated and Standardized Multimodal Method for
Speech Paralinguistic and Affect Labeling
Published in: Interspeech 2025 (Main Track) 10.21437/Interspeech.2025-648
2024 - Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Published in: arXiv:2411.01156