Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data collection for http://www.openslr.org/1/ #84

Open
emzee831 opened this issue Oct 14, 2019 · 3 comments
Open

Data collection for http://www.openslr.org/1/ #84

emzee831 opened this issue Oct 14, 2019 · 3 comments

Comments

@emzee831
Copy link

Hello, looking for some direction for my first dataset contribution. I've copied the repo to my local computer and have installed the environment.

@emzee831
Copy link
Author

I'm also trying to run test files and I'm getting this error.
Traceback (most recent call last):
File "test_fluent_speech.py", line 8, in
from . import reader_test as rt
ImportError: cannot import name 'reader_test' from 'main' (test_fluent_speech.py)

Sorry for all the questions, I'm new to this.

@ynop
Copy link
Owner

ynop commented Oct 14, 2019

No problem.

For running the tests you can use pytest.
When you execute pytest in the main folder all tests are executed.
Or you can specify specific test files.

Or you run it using a IDE. For example PyCharm:
https://github.com/ynop/audiomate#running-the-test-suite

@ynop
Copy link
Owner

ynop commented Oct 14, 2019

For the general workflow I normally do:

  1. Create a mock dataset. Equal to the original one but with only a few samples/utterances. Only empty audio files.
  2. Create a test (use some other tests as a reference (There is a base test, so you only have to specifiy what utterances/files/... that are expected)
  3. Implement the reader based on the mock dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants