Migrating kaldi tests #671

bhargavkathivarapu · 2020-05-31T09:11:52Z

Hi ,
Regarding #597 , This PR moves tests from test/test_compliance_kaldi.py to test/kaldi_compatibility_impl.py

Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

bhargavkathivarapu · 2020-05-31T11:33:49Z

fbank, mfcc , spectrogram outputs between kaldi and torchaudio kaldi are not matching in a few cases as per the threshold defined . In other cases they are matching as per threshold .

For resample what is the kaldi command to get resampled waveform ? , for now I kept the kaldi result of resample to None

mthrok · 2020-05-31T14:39:55Z

Hi @bhargavkathivarapu

Thanks for working on this.

First, let's put migration of different types of tests into separate PRs and start from existing one. bank or mfcc.

Second, instead of simple text file, can you use JSON Lines format? (with .json extension) so that no string parsing is required in test code. then we do not have to add _get_func_args, _parse, and TEST_PREFIX.
JSON Lines format is one JSON object (dict with proper arg name and values) per line, which can be read like

def _load_jsonl(path):
    with open('file.json', 'r') as file:
        return [json.loads(line) for line in file]

Third, when working on different types of test, can you use different parameter files? One JSON Line file for fbank, one for mfcc etc...

Forth, the current implementation stops when there is one test error. Let's try using parameterized.expand to list all such failures.
@vincentqb This is my temporary suggestion for parameterization. PyTorch has some discussion going on pytorch/pytorch#11578 but this is stall. Once they implement their own parameterization, we can migrate it.

Combining the second and forth suggestions, the required change should be fairly simple.

From

def test_fbank(self):

to

@parameterized.expand(_load_jsonl(path_to_parameter_jsonl_file))
def test_fbank(self, kwargs):

And you will need to add dependencies to CI config here and here

mthrok · 2020-05-31T14:42:43Z

test/kaldi_compatibility_impl.py

@@ -48,6 +89,8 @@ def _run_kaldi(command, input_type, input_value):


 class Kaldi(common_utils.TestBaseMixin):
+    test_8000_filepath = common_utils.get_asset_path('kaldi_file_8000.wav')


Please keep this inside of test each test. The intended usage of asset file is not immediately clear when someone else work on adding a new test, so it is not very suited to be common property.

README.md

bhargavkathivarapu · 2020-06-01T06:32:45Z

@mthrok ,Submitted another PR ( #672 ) just for fbank , includes below

Separate JSON file for each fbank args
Adding parameterized dependency to linux and windows yml files
Using the load json and parameterized.expand in tests

But the CI tests are not getting trigged for that pull request
I tried reverting the yaml to previous configuration without parametezied , but the commit is not triggering CI

bhargavkathivarapu added 3 commits May 31, 2020 14:24

fbank

579876c

Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

MFCC and spec

2abf7d4

Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

spec correction

4e96809

Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

bhargavkathivarapu marked this pull request as ready for review May 31, 2020 11:33

mthrok reviewed May 31, 2020

View reviewed changes

README.md Show resolved Hide resolved

bhargavkathivarapu closed this Jun 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrating kaldi tests #671

Migrating kaldi tests #671

bhargavkathivarapu commented May 31, 2020

bhargavkathivarapu commented May 31, 2020

mthrok commented May 31, 2020 •

edited

Loading

mthrok May 31, 2020

bhargavkathivarapu commented Jun 1, 2020

		@@ -48,6 +89,8 @@ def _run_kaldi(command, input_type, input_value):


		class Kaldi(common_utils.TestBaseMixin):
		test_8000_filepath = common_utils.get_asset_path('kaldi_file_8000.wav')

Migrating kaldi tests #671

Migrating kaldi tests #671

Conversation

bhargavkathivarapu commented May 31, 2020

bhargavkathivarapu commented May 31, 2020

mthrok commented May 31, 2020 • edited Loading

mthrok May 31, 2020

Choose a reason for hiding this comment

bhargavkathivarapu commented Jun 1, 2020

mthrok commented May 31, 2020 •

edited

Loading