`nara_wpe` based WPE dereverberation as data augmentation #663

pzelasko · 2022-04-10T16:58:21Z

I added WPE using nara_wpe (optional dependency) as a torch.nn.Module DereverbWPE. I was able to verify by listening to the outputs that it appears to work correctly (although the audible differences are very slight, at least for single-channel data).

It also moves AudioTransform to a different file without any changes. Ultimately I didn't add a complementary cut.dereverb_wpe() API, but it should be straightforward to do by following other data aug examples (e.g. perturb_speed, reverb_rir) if somebody needs it.

boeddeker · 2023-03-16T16:12:13Z

In fgnt/nara_wpe#72 TeaPoly proposed to stabilize WPE by using the same idea as the numpy version to stabilize the code. This is not possible, because torch works differently (The async execution on the GPU doesn't allow that.)

In General, I recommend using the numpy version of WPE, since it is way better tested, and I found some situations, where numpy has a better numerical stability.

In lhotse, I found

    def __call__(self, samples: np.ndarray, *args, **kwargs) -> np.ndarray:
        if isinstance(samples, np.ndarray):
            samples = torch.from_numpy(samples)
        augmented = dereverb_wpe_torch(samples, **asdict(self))
        return augmented.numpy()

I personally would change it to the numpy implementation. Since the code doesn't use the GPU, it shouldn't have a relevant impact on the runtime.

@boeddeker

Thanks to @boeddeker for the [suggestion](#663 (comment)).

WPE dereverberation as data augmentation

994f719

pzelasko added this to the v1.1 milestone Apr 10, 2022

pzelasko mentioned this pull request Apr 10, 2022

PyTorch module emits UserWarning fgnt/nara_wpe#66

Open

pzelasko added 4 commits April 10, 2022 13:07

Add nara_wpe dependency for tests

136d9a8

black

9d5ef33

Add refs to nara_wpe and more docs

b495fa1

black

a54fca5

pzelasko merged commit 8968fbe into master Apr 10, 2022

pzelasko mentioned this pull request Mar 17, 2023

(cut|recording).dereverb_wpe() API + more stable numpy version #1000

Merged

pzelasko added a commit that referenced this pull request Mar 22, 2023

(cut|recording).dereverb_wpe() API + more stable numpy version (#1000)

2e8182f

Thanks to @boeddeker for the [suggestion](#663 (comment)).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`nara_wpe` based WPE dereverberation as data augmentation #663

`nara_wpe` based WPE dereverberation as data augmentation #663

pzelasko commented Apr 10, 2022

boeddeker commented Mar 16, 2023

nara_wpe based WPE dereverberation as data augmentation #663

nara_wpe based WPE dereverberation as data augmentation #663

Conversation

pzelasko commented Apr 10, 2022

boeddeker commented Mar 16, 2023

`nara_wpe` based WPE dereverberation as data augmentation #663

`nara_wpe` based WPE dereverberation as data augmentation #663