Adding loader for jamendo moodtheme #505

PRamoneda · 2021-04-28T16:41:27Z

Title

Please use the following title: "Adding loader for MyDATASET". If your pull request is work in progress, change your title to "[WIP] Adding loader for MyDATASET" to avoid reviews while the loader is not ready.

Description

Please include the following information at the top level docstring for the dataset's module mydataset.py:

Describe annotations included in the dataset
Indicate the size of the datasets (e.g. number files and duration, hours)
Mention the origin of the dataset (e.g. creator, institution)
Describe the type of music included in the dataset
Indicate any relevant papers related to the dataset
Include a description about how the data can be accessed and the license it uses (if applicable)

Dataset loaders checklist:

Create a script in scripts/, e.g. make_my_dataset_index.py, which generates an index file.
Run the script on the canonical version of the dataset and save the index in mirdata/indexes/ e.g. my_dataset_index.json.
Create a module in mirdata, e.g. mirdata/my_dataset.py
Create tests for your loader in tests/datasets/, e.g. test_my_dataset.py
Add your module to docs/source/mirdata.rst and docs/source/quick_reference.rst
Run tests/test_full_dataset.py on your dataset.

PRamoneda · 2021-04-28T16:41:52Z

hey!! ready to review!

codecov · 2021-04-28T17:26:45Z

Codecov Report

Merging #505 (5a69b50) into master (6e087cf) will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #505      +/-   ##
==========================================
+ Coverage   99.00%   99.01%   +0.01%     
==========================================
  Files          45       46       +1     
  Lines        5415     5491      +76     
==========================================
+ Hits         5361     5437      +76     
  Misses         54       54

philtgun

The .download() description tells to put the audio under mtg_jamendo_autotagging_moodtheme/audio, but the .validate() is checking under audios directory - an inconsistency.
The output of .validate() is huge when there are no tracks, not sure if such behavior is static across mirdata, but for such a big dataset is a bit too much output. It also doesn't say that the tracks are missing.
Not sure when is the metadata downloaded - I put the audio files manually, successfully ran .validate(), but still can't access the tags. Not sure if the metadata should be downloaded with .download(partial_download=['metadata']) which still just shows the instructions for downloading the audio and does nothing.

mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

docs/source/table.rst

mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

magdalenafuentes · 2021-04-29T14:39:14Z

Wow thanks @philtgun for the detailed review!

PRamoneda · 2021-05-01T10:18:19Z

Thank you so much for your review!!! @philtgun . I am going to add your suggestions and errors now :)

PRamoneda · 2021-05-01T11:06:53Z

Hi Philip! @philtgun

The output of .validate() is huge when there are no tracks, not sure if such behavior is static across mirdata, but for such a big dataset is a bit too much output. It also doesn't say that the tracks are missing.

This is the usual behaviour... but you are right maybe a tiny general description would be better

PRamoneda · 2021-05-01T11:59:52Z

hi @philtgun good catch!

Not sure when is the metadata downloaded - I put the audio files manually, successfully ran .validate(), but still can't access the tags. Not sure if the metadata should be downloaded with .download(partial_download=['metadata']) which still just shows the instructions for downloading the audio and does nothing.

PRamoneda · 2021-05-01T12:00:27Z

It is caused because i havent run the full_dataset test. I am running it now!

PRamoneda · 2021-05-01T16:50:27Z

four hours later....

 % pytest -s tests/test_full_dataset.py --local --dataset mtg_jamendo_autotagging_moodtheme
/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages/pep8.py:110: FutureWarning: Possible nested set at position 1
  EXTRANEOUS_WHITESPACE_REGEX = re.compile(r'[[({] | []}),;:]')
============================================================================== test session starts ===============================================================================
platform darwin -- Python 3.7.9, pytest-6.2.2, py-1.9.0, pluggy-0.13.1
rootdir: /Users/pedroramonedafranco/PycharmProjects/mirdata3
plugins: localserver-0.5.0, mock-3.3.1, cov-2.10.1, pep8-1.0.6
collected 4 items                                                                                                                                                                

tests/test_full_dataset.py If this dataset does not have openly downloadable data, follow the instructions printed by the download message and rerun this test.
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 18486/18486 [11:33<00:00, 26.67it/s]
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 18486/18486 [4:27:18<00:00,  1.15it/s]
..

================================================================================ warnings summary ================================================================================
tests/test_full_dataset.py: 18486 warnings
  /Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/site-packages/librosa/core/audio.py:162: UserWarning: PySoundFile failed. Trying audioread instead.
    warnings.warn("PySoundFile failed. Trying audioread instead.")

-- Docs: https://docs.pytest.org/en/stable/warnings.html
================================================================ 4 passed, 18486 warnings in 16739.36s (4:38:59) =================================================================
(base)

PRamoneda · 2021-05-01T16:50:48Z

Ready to merge! @rabitt @magdalenafuentes

PRamoneda · 2021-07-06T10:07:56Z

hi! I would like to remove the dataset from my laptop. What is your opinion about this loader?

rabitt · 2021-08-05T21:55:46Z

Hey @PRamoneda I'll try to take a look tomorrow!

rabitt

New loader checklist:

Create a script in scripts/, e.g. make_my_dataset_index.py
Run the script on the canonical version of the dataset and save the index in mirdata/indexes/ e.g. my_dataset_index.json
Create a module in mirdata, e.g. mirdata/my_dataset.py
Create tests in tests/, e.g. test_my_dataset.py
Add the module to docs/source/mirdata.rst
Add the module to docs/source/table.rst
Run test_full_dataset.py
Make sure the docs build properly

mirdata/datasets/good_sounds.py

mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

docs/source/table.rst

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

rabitt · 2021-09-07T13:35:51Z

Looks great @PRamoneda ! Merging now

PRamoneda added 6 commits April 24, 2021 16:28

add files

b2cd1ab

first prototype

8ed3193

add tests

ad61653

fixed tests

8cb75d1

finished tests

6221369

update.rst

11de09f

PRamoneda added 4 commits April 28, 2021 18:45

fix dheader docstring

55e2f2e

fix table.rst

298c6b4

black

3ecf21d

change name split

18dd83c

PRamoneda added 7 commits April 28, 2021 19:46

fixed tests split

8fbaee7

reduce audio size

f969265

reduce audio size

4d8a438

black

98df3d3

black

2d28230

black

b18f6e9

line not covered codecov

d939f0f

PRamoneda requested a review from nkundiushuti April 28, 2021 18:29

philtgun reviewed Apr 29, 2021

View reviewed changes

PRamoneda added 3 commits May 1, 2021 12:18

Merge branch 'master' into Pedro/jamendo

c50f318

fix philip suggestions and errors

9ee4ddd

Merge remote-tracking branch 'origin/Pedro/jamendo' into Pedro/jamendo

5daac6d

PRamoneda added 3 commits May 1, 2021 13:15

nit

80db3b8

catch philip

026b644

catch philip

0004a8b

catch philip

400eade

black

e095b57

PRamoneda requested a review from rabitt May 1, 2021 12:18

nkundiushuti approved these changes May 3, 2021

View reviewed changes

rabitt requested changes Sep 3, 2021

View reviewed changes

PRamoneda and others added 14 commits September 3, 2021 19:04

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

466054f

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

7a94922

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

94da2f5

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

9cc8960

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

9aeafbe

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

b321f7d

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

4f25fb6

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

bb0b003

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

fdc2d76

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

fixes of automatic commits

dea1dd0

Update mirdata/datasets/mtg_jamendo_autotagging_moodtheme.py

e8691b2

Co-authored-by: Rachel Bittner <rmb456@nyu.edu>

fixes of automatic commits

f8d2d06

fix docs

ed428b3

Merge branch 'master' into Pedro/jamendo

5a69b50

rabitt approved these changes Sep 7, 2021

View reviewed changes

rabitt merged commit 1c0eeda into master Sep 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding loader for jamendo moodtheme #505

Adding loader for jamendo moodtheme #505

PRamoneda commented Apr 28, 2021 •

edited

PRamoneda commented Apr 28, 2021

codecov bot commented Apr 28, 2021 •

edited

philtgun left a comment •

edited

magdalenafuentes commented Apr 29, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented Jul 6, 2021

rabitt commented Aug 5, 2021

rabitt left a comment •

edited

rabitt commented Sep 7, 2021

Adding loader for jamendo moodtheme #505

Adding loader for jamendo moodtheme #505

Conversation

PRamoneda commented Apr 28, 2021 • edited

Title

Description

Dataset loaders checklist:

PRamoneda commented Apr 28, 2021

codecov bot commented Apr 28, 2021 • edited

Codecov Report

philtgun left a comment • edited

Choose a reason for hiding this comment

magdalenafuentes commented Apr 29, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented May 1, 2021

PRamoneda commented Jul 6, 2021

rabitt commented Aug 5, 2021

rabitt left a comment • edited

Choose a reason for hiding this comment

rabitt commented Sep 7, 2021

PRamoneda commented Apr 28, 2021 •

edited

codecov bot commented Apr 28, 2021 •

edited

philtgun left a comment •

edited

rabitt left a comment •

edited