Implementation discrepancies with paper #8

NeuronAppreciator · 2024-09-13T03:10:53Z

Hi,

I'm trying to manually implement this in another language. Can you confirm this codebase is the one that produced the results in "Speech Denoising without Clean Training Data: a Noise2Noise Approach"? There are a few discrepancies I've noticed so far:

A model complexity of (45//1.414) would seem to result in 31 encoder channels (and 62 for deeper ones), rather than 32 as described.
The complex batchnorm module seems to implement batchnorm separately on real and imaginary components of the complex number, rather than using the whitening approach described in “Deep complex networks"
Similarly the masking process seems to multiply real and imaginary components of the spectrogram separately rather than using complex multiplication.

I appreciate any insight you may have about these points.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation discrepancies with paper #8

Implementation discrepancies with paper #8

NeuronAppreciator commented Sep 13, 2024

Implementation discrepancies with paper #8

Implementation discrepancies with paper #8

Comments

NeuronAppreciator commented Sep 13, 2024