Demo Of VoiceFixer
Voicefixer aims at the restoration of degraded human speech. It can handle noise, reveberation, low resolution (2kHz~44.1kHz) and clipping (0.1-1.0 threshold) effect within one model.

Built by Haohe Liu on Sep.14.2021 in Shanghai. With wild typhoon Chanthu outside.


Contents

1. About

models

2. Testset Data Demos

2.1 General Speech Restoration (ALL-GSR)

Speaker info
33_simulated
Spectrogram
127_simulated
Spectrogram

2.2 Speech Super Resolution (SR)

2.2.1 From 4kHz to 44.1kHz

Speaker info
p361_056
Spectrogram
p374_098
Spectrogram

2.2.2 From 8kHz to 44.1kHz

Speaker info
p360_002
Spectrogram
p361_002
Spectrogram

2.2.3 From 24kHz to 44.1kHz

Speaker info
p360_002
Spectrogram
p361_002
Spectrogram

2.3 Speech Enhancement (DENOISE)

Speaker info
p232_005
Spectrogram
p257_008
Spectrogram

2.4 Speech Dereverberation (DEREV)

Speaker info
p361_001
Spectrogram
p363_004
Spectrogram

2.5 Speech Declipping (DECLI)

Clipping threshold 0.1

Speaker info
p360_001
Spectrogram

2.5.1 Clipping threshold 0.25

Speaker info
p360_001
Spectrogram

3.1 Comparison between enhancement only SSR, Regression based GSR and VoiceFixer GSR

Speaker info
A recording of my voice.
Spectrogram
TV news interviews.
Spectrogram
Chinese Youtuber.
Spectrogram

3.2 More demos

Speaker info Before After Speaker info Before After
Speech by Bruce Lee (1940-1973) Speech by Amelia Earhart (1897-1937)
Spectrogram Spectrogram
Documentary film Speech by Hu Shi (1891-1962)
Spectrogram Spectrogram