Add wsj0mix dataset #895

mthrok · 2020-08-28T16:35:17Z

Add wsj0mix dataset
Add test to test/torchaudio_unittest

examples/source_separation/utils/dataset/wsj0mix.py

vincentqb · 2020-09-10T16:24:19Z

as mentioned here, training on an open dataset like LibriMix would allow for more users to experiment with the dataset

vincentqb · 2020-10-12T18:04:48Z

examples/source_separation/utils/dataset/wsj0mix.py

+    def _load_audio(self, path) -> torch.Tensor:
+        waveform, sample_rate = torchaudio.load(path)
+        if sample_rate != self.sample_rate:
+            raise ValueError(
+                f"The dataset contains audio file of sample rate {sample_rate}. "
+                "Where the requested sample rate is {self.sample_rate}."
+            )
+        return waveform


What does this function serve beyond wrapping load? Ensures sample rate is the same?

vincentqb · 2020-10-12T18:11:27Z

examples/source_separation/utils/dataset/utils.py

+
+from . import wsj0mix
+
+Batch = namedtuple("Batch", ["mix", "src", "mask"])


name may be misleading?

vincentqb

Minor comments, but LGTM

cpuhrsch · 2020-10-12T18:17:28Z

examples/source_separation/utils/dataset/wsj0mix.py

+        self.root = Path(root)
+        self.sample_rate = sample_rate
+        self.mix_dir = (self.root / "mix").resolve()
+        self.src_dirs = [(self.root / f"s{i+1}").resolve() for i in range(num_speakers)]


nit: could use os path join

mthrok · 2020-10-12T20:27:04Z

thanks!

mthrok mentioned this pull request Aug 28, 2020

Adding Conv-TasNet #897

Closed

14 tasks

mthrok force-pushed the conv-tasnet-wsj0mix branch 2 times, most recently from f4186b6 to e368349 Compare September 1, 2020 17:09

mthrok force-pushed the conv-tasnet-wsj0mix branch from e368349 to a7f0e2a Compare September 8, 2020 22:23

vincentqb reviewed Sep 10, 2020

View reviewed changes

examples/source_separation/utils/dataset/wsj0mix.py Outdated Show resolved Hide resolved

mthrok force-pushed the conv-tasnet-wsj0mix branch from ace5b6b to c871382 Compare September 28, 2020 19:06

mthrok changed the base branch from conv-tasnet to master September 28, 2020 19:06

mthrok force-pushed the conv-tasnet-wsj0mix branch 2 times, most recently from 51f83e3 to 8763416 Compare October 6, 2020 20:39

Add wsj0mix dataset

89c1c67

mthrok force-pushed the conv-tasnet-wsj0mix branch from 8763416 to 89c1c67 Compare October 6, 2020 20:42

vincentqb reviewed Oct 12, 2020

View reviewed changes

cpuhrsch approved these changes Oct 12, 2020

View reviewed changes

vincentqb reviewed Oct 12, 2020

View reviewed changes

vincentqb approved these changes Oct 12, 2020

View reviewed changes

cpuhrsch reviewed Oct 12, 2020

View reviewed changes

mthrok merged commit 2d87913 into pytorch:master Oct 12, 2020

mthrok deleted the conv-tasnet-wsj0mix branch October 12, 2020 20:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add wsj0mix dataset #895

Add wsj0mix dataset #895

mthrok commented Aug 28, 2020 •

edited

vincentqb commented Sep 10, 2020

vincentqb Oct 12, 2020 •

edited

vincentqb Oct 12, 2020

vincentqb left a comment

cpuhrsch Oct 12, 2020

mthrok commented Oct 12, 2020


		from . import wsj0mix

		Batch = namedtuple("Batch", ["mix", "src", "mask"])

Add wsj0mix dataset #895

Add wsj0mix dataset #895

Conversation

mthrok commented Aug 28, 2020 • edited

vincentqb commented Sep 10, 2020

vincentqb Oct 12, 2020 • edited

Choose a reason for hiding this comment

vincentqb Oct 12, 2020

Choose a reason for hiding this comment

vincentqb left a comment

Choose a reason for hiding this comment

cpuhrsch Oct 12, 2020

Choose a reason for hiding this comment

mthrok commented Oct 12, 2020

mthrok commented Aug 28, 2020 •

edited

vincentqb Oct 12, 2020 •

edited