Haohe Liu 刘濠赫 (Leo)

GitHub stars    Twitter Follow    LinkedIn    Google Scholar

Email: haohe.liu AT surrey dot ac dot uk

example image
At the Pont de Bir-Hakeim, Paris
example image

I’m Haohe Liu, a final year PhD student at the Centre for Vision Speech and Signal Processing (CVSSP), University of Surrey. I’m the first author of paper such as AudioLDM, AudioLDM 2, NaturalSpeech, VoiceFixer, MusicLDM, AudioSR, etc., with 50+ research publications and over 1800 citations. My open-source projects/checkpoints on GitHub have received over 8300 stars and have been downloaded more than 150000 times.

My research includes topics related to speech, music, and general audio. I am fortunate to be advised by Prof. Mark D. Plumbley, co-supervised by Prof. Wenwu Wang. And I’m lucky to be jointly funded by BBC R&D and the Doctoral College. I’m a team member of the EPSRC AI for Sound Project (EP/T019751/1). Most of my studies are open-sourced.

Research highlights

My research includes tasks related to the audio generative model, source separation, quality enhancement, and recognition, appeared in journals and conferences such as TPAMI, TASLP, ICML, AAAI, NeurIPS, INTERSPEECH, and ICASSP.

Highlighted research performed as the first author:

Please refer to my Google Scholar Page for the full publication list: Google Scholar

Recent News

Education Experience

Centre for Vision, Speech and Signal Processing @ University of Surrey, UK, 01/2022 - 01/2025
– PhD in Vision, Speech and Signal Processing; Main advisor: Prof. Mark D. Plumbley
– With a studentship from the CVSSP and the EPSRC Grant EP/T019751/1 AI for Sound

School of Computer Science @ Northwestern Polytechnical University, China, 09/2016 - 07/2020
– Bachelor of Engineering, Outstanding graduate, Computer Science and Technology; Advisor: Prof. Lei Xie
– GPA: 3.8/4.0 (Top 5%)

Competitions

Honors & Awards

Scholarships

Teaching

example image
example image