Verified mel spectrograms in winml sample against pytorch #400

janezdu · 2021-07-23T17:53:10Z

Main changes included in this PR involve:

Tuned the mel spectrogram settings to match more closely with PyTorch. Rendering the tensor data from a learning model in python/librosa shows that the learning model creates a mel spectrogram very similar to the ones made by torchaudio.
UI controls for the settings of the melspectrogram added
Resampling via learning model added
Some refactoring in PreprocessModel.cs; methods are now "less static" and include information about the audio file/mel spectrogram settings stored as instance properties.
Flip and transpose mel spectrogram in learning model so that time axis goes left to right and Mel matrix goes low to high (up to down)

…-Learning into audio-resampling

janezdu added 10 commits July 6, 2021 11:22

New HSV colouring, implemented in cs

805e3db

Compare hsv to rgb

77085b5

Refactor colorizing code

c45f2d6

Add toggles to UI

2b89b27

Hook up UI controls to melspectrogram settings

b9dec10

Merge branch 'master' of https://github.com/microsoft/Windows-Machine…

7656bb8

…-Learning into audio-resampling

code + ui cleanup

c2d7c3e

Refactor and resampling via convolution

8c55c97

Remove transpose and flip for verification with pytorch

14ec744

Code cleanup and updated preprocessing .onnx models

b87c90b

janezdu requested a review from a team as a code owner July 23, 2021 17:53

Provide feedback