Adding loader for TAU Urban Acoustic Scenes 2022 #129

tanmayy24 · 2023-10-05T18:21:24Z

Description

Please include the following information at the top level docstring for the dataset's module mydataset.py:

Describe annotations included in the dataset
Indicate the total duration of the dataset in hours, and (optionally) also list the number of individual files
Mention the origin of the dataset (e.g. creator, institution)
Describe the type of audio included in the dataset
Indicate any relevant papers related to the dataset
Include a description about how the data can be accessed and the license it uses (if applicable)

Dataset loaders checklist:

Create a script in scripts/, e.g. make_my_dataset_index.py, which generates an index file.
Run the script on the canonical version of the dataset and save the index in soundata/indexes/ e.g. my_dataset_index.json.
Create a module in soundata, e.g. soundata/my_dataset.py.
Create tests for your loader in tests/, e.g. test_my_dataset.py.
Add your module to docs/source/soundata.rst and docs/source/quick_reference.rst.
Run black, flake8 and mypy (see Running your tests locally).
Run tests/test_full_dataset.py on your dataset.
Check that codecov coverage does not decrease.

codecov · 2023-10-06T18:48:12Z

Codecov Report

Merging #129 (04ff520) into main (6a5de09) will increase coverage by 0.04%.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #129      +/-   ##
==========================================
+ Coverage   98.64%   98.69%   +0.04%     
==========================================
  Files          27       28       +1     
  Lines        2363     2447      +84     
==========================================
+ Hits         2331     2415      +84     
  Misses         32       32

genisplaja

The loaders looks really good to me, testing coverage is high and I couldn't spot any issues. Just a question... do we actually need to have this massive docstring for the loader? Have not checked the others yet but, maybe it's too much to have about 500 lines of code for all the tables and so on... if there was a link or something for more info we could redirect the user there, and keep the basic and most important details of the dataset in the docstring. What do you think about that @magdalenafuentes, @tanmayy24 and @guillemcortes? If it's OK to have it, then everything is rendering really nicely and the loader looks ready to me :)

genisplaja · 2023-11-24T16:37:34Z

soundata/datasets/tau2022uas_mobile.py

+        """The clip's split.
+
+        Returns:
+            * str - subset the clip belongs to (for experiments): development (fold1, fold2, fold3, fold4) or evaluation


This line is not properly rendered.. maybe becuase of the :? Also I think we are not very consistent with the use of * before listing the attributes, input variables and so on. Sometimes we use it sometimes no. Maybe is not the goal of this PR but would be nice being consistent (unless there is a rule I am missing here).

guillemcortes · 2023-12-01T09:05:03Z

The loaders looks really good to me, testing coverage is high and I couldn't spot any issues. Just a question... do we actually need to have this massive docstring for the loader? Have not checked the others yet but, maybe it's too much to have about 500 lines of code for all the tables and so on... if there was a link or something for more info we could redirect the user there, and keep the basic and most important details of the dataset in the docstring. What do you think about that @magdalenafuentes, @tanmayy24 and @guillemcortes? If it's OK to have it, then everything is rendering really nicely and the loader looks ready to me :)

I agree that in this case, the docstring seems too large. The link could be a nice solution if it was a static link

rythmm24 added 5 commits October 4, 2023 16:15

addition of tau2022

e73e068

fixed checksum

c15d857

test cases

676d9e8

black fixed

e6d178f

fixed docs

afcd3a2

tanmayy24 requested a review from magdalenafuentes October 5, 2023 18:21

tanmayy24 and others added 4 commits October 6, 2023 23:17

Merge branch 'main' into tanmay/tau2022

1fe570b

basic edit

228e1a8

Fixed test cases for tau2022_mobile

f4239ff

Fixed test usecase

ca2aa7e

tanmayy24 changed the title ~~[WIP] Adding loader for TAU Urban Acoustic Scenes 2022~~ Adding loader for TAU Urban Acoustic Scenes 2022 Oct 6, 2023

Fixed badge issue

25ca654

magdalenafuentes requested review from guillemcortes and genisplaja October 10, 2023 18:25

magdalenafuentes and others added 2 commits October 10, 2023 14:25

Merge branch 'main' into tanmay/tau2022

e59f2f0

Merge branch 'main' into tanmay/tau2022

dab00ab

genisplaja approved these changes Nov 24, 2023

View reviewed changes

tanmayy24 added 3 commits November 30, 2023 10:44

Merge branch 'main' into tanmay/tau2022

6e006d7

Merge branch 'main' into tanmay/tau2022

3b8127a

Merge branch 'main' into tanmay/tau2022

04ff520

tanmayy24 merged commit 5cb1204 into main Nov 30, 2023
11 checks passed

magdalenafuentes deleted the tanmay/tau2022 branch February 6, 2024 21:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding loader for TAU Urban Acoustic Scenes 2022 #129

Adding loader for TAU Urban Acoustic Scenes 2022 #129

tanmayy24 commented Oct 5, 2023 •

edited

codecov bot commented Oct 6, 2023 •

edited

genisplaja left a comment

genisplaja Nov 24, 2023

guillemcortes commented Dec 1, 2023

Adding loader for TAU Urban Acoustic Scenes 2022 #129

Adding loader for TAU Urban Acoustic Scenes 2022 #129

Conversation

tanmayy24 commented Oct 5, 2023 • edited

Description

Dataset loaders checklist:

codecov bot commented Oct 6, 2023 • edited

Codecov Report

genisplaja left a comment

Choose a reason for hiding this comment

genisplaja Nov 24, 2023

Choose a reason for hiding this comment

guillemcortes commented Dec 1, 2023

tanmayy24 commented Oct 5, 2023 •

edited

codecov bot commented Oct 6, 2023 •

edited