Phase retrieval with Bregman divergences and application to audio signal recovery

Pierre-Hugo Vial, Paul Magron, Thomas Oberlin, Cédric Févotte
IEEE Journal of Selected Topics in Signal Processing, Vol. 15, no. 1, pp. 51-64, January 2021
[Paper] [Code]

On this page, we provide several audio excerpts that illustrate the audio signal recovery experiments presented in the paper. The speech excerpts are extracted from the TIMIT dataset, and the music snippets are extracted from the FMA dataset.


Speech recovery from exact spectrogram

Original:  Random phase: 

Gradient descents ADMM GLA-like
Quadratic Quadratic GLA
KL (left) KL (left) FGLA
KL (right) IS (left) GL-ADMM
β=0.5 (right)

Music recovery from exact spectrogram

Original:  Random phase: 

Gradient descents ADMM GLA-like
Quadratic Quadratic GLA
KL (left) KL (left) FGLA
KL (right) IS (left) GL-ADMM
β=0.5 (right)

Speech recovery from modified spectrogram

Original: 

Input SNR: 10 dB

Random phase: 

Gradient descents ADMM GLA-like
Quadratic Quadratic GLA
KL (left) KL (left) FGLA
KL (right) IS (left) GL-ADMM
β=0.5 (right)

Input SNR: 0 dB

Random phase: 

Gradient descents ADMM GLA-like
Quadratic Quadratic GLA
KL (left) KL (left) FGLA
KL (right) IS (left) GL-ADMM
β=0.5 (right)

Input SNR: -10 dB

Random phase: 

Gradient descents ADMM GLA-like
Quadratic Quadratic GLA
KL (left) KL (left) FGLA
KL (right) IS (left) GL-ADMM
β=0.5 (right)

Input SNR: -20 dB

Random phase: 

Gradient descents ADMM GLA-like
Quadratic Quadratic GLA
KL (left) KL (left) FGLA
KL (right) IS (left) GL-ADMM
β=0.5 (right)