Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wavenet #2

Open
pranavmalikk opened this issue Jun 7, 2023 · 1 comment
Open

Wavenet #2

pranavmalikk opened this issue Jun 7, 2023 · 1 comment

Comments

@pranavmalikk
Copy link

pranavmalikk commented Jun 7, 2023

It was mentioned in the paper "Our model resembles WaveNet (Oord et al., 2016a) in the use of tree-structured dilated convolutions. However, our principle-guided design has distinct skip-connection structures and filter sharing patterns, resulting in significantly better parameter efficiency and performance...Additionally, the link we establish between wavelets and tree-structured dilated causal convolutions offers the first principled justification for the effectiveness of WaveNet in modeling raw audio waveforms, an exemplary case of lengthy sequences with multiscale structure."

Do you have any ablations on the difference in performance in any specific tasks or tests? Also any specific audio samples? Overall very interesting paper!

@thjashin
Copy link
Owner

Hi @pranavmalikk,

We did not include WaveNet results in the paper because we find it very difficult to have a fair setup. There are so many choices we need to make before we setup such a comparison:

  1. First of all, WaveNet has no official open-source implementation and many details are unclear from the paper. Existing re-implementations all differ to some extent (in the way of initialization/bias/latent dimension choices), which one should we use?
  2. WaveNet is originally developed for generation. Shall we modify it with a mean-pooling and then compare on the classification tasks in our paper?
  3. WaveNet does not use mixing layers and normalization. Shall we keep those components and only replace MultiresLayer with a WaveNet block? Or do we want to compare the whole MultiresNet with a WaveNet? In each choice, how should we choose the hyper-parameters of WaveNet?

If you can be specific about these questions, I am happy to run an ablation and post the results here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants