[FEATURE]: Integrate UrbanSoundDataset for Audio Trainspace #1156

codingwithsurya · 2024-04-06T05:26:15Z

Feature Name

As part of our initiative to extend the Deep Learning Playground's capabilities to include audio data processing, we need to integrate the UrbanSoundDataset into our system.

Your Name

Surya Subramanian

Description

This task involves enhancing the training/core/dataset.py module to support audio data, specifically focusing on the UrbanSound8K dataset. The goal is to create a seamless pipeline for data ingestion, preprocessing, and loading, tailored for audio files.

Objectives

Take inspiration from the other classes in dataset.py and develop dataCreator, train_loader, and test_loader methods to facilitate the loading of preprocessed audio data into the model for training and testing. Ensure that data loading is efficient and integrates well with PyTorch's DataLoader mechanism. Essentially, the code in this class will be used to create train and test datasets.

Implementation Details

Your best bet is to take inspiration from ImageDefaultDatasetCreator and "recreate" that but for the UrbanSoundDataset. It should be seamless for us to call the createTrainDataset() and createTestDataset() methods from other files (like how it is being used called in training/training/routes/image/image.py.

So in summary,

The UrbanSoundDataset class should be added to training/core/dataset.py, encapsulating all functionalities related to the UrbanSound8K dataset.
Ensure that the dataset loader is compatible with our existing training pipeline, allowing for smooth integration with models and training routines.
UPDATE: schemas.py will be handled here. Here's a high level overview of what we are thinking:

# no layer params

# hardcode non tunable params in

# class AudioParams(Schema):
#     name: str
#     problem_type: Literal["CLASSIFICATION"]
#     default: Urban Sound Dataset
#     criterion: negative log likelihood loss
#     optimizer_name: Adam 
#     shuffle: bool
#     epochs: int
#     test_size: float
#     batch_size: int
#     user_arch: arch we are using

Feel free to play around with this!

The text was updated successfully, but these errors were encountered:

github-actions · 2024-04-06T05:26:27Z

Hello @codingwithsurya! Thank you for submitting the Feature Request Form. We appreciate your contribution. 👋

We will look into it and provide a response as soon as possible.

To work on this feature request, you can follow these branch setup instructions:

Checkout the main branch:

```
 git checkout nextjs
```

Pull the latest changes from the remote main branch:

```
 git pull origin nextjs
```

Create a new branch specific to this feature request using the issue number:

```
 git checkout -b feature-1156
```

Feel free to make the necessary changes in this branch and submit a pull request when you're ready.

Best regards,
Deep Learning Playground (DLP) Team

codingwithsurya added the enhancement New feature or request label Apr 6, 2024

codingwithsurya mentioned this issue May 4, 2024

Adding Audio Routing #1172

Merged

codingwithsurya linked a pull request May 14, 2024 that will close this issue

Integrate UrbanSoundDataset for Audio Data Processing #1179

Open

codingwithsurya self-assigned this May 14, 2024

codingwithsurya linked a pull request May 14, 2024 that will close this issue

Integrate UrbanSoundDataset for Audio Data Processing #1179

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE]: Integrate UrbanSoundDataset for Audio Trainspace #1156

[FEATURE]: Integrate UrbanSoundDataset for Audio Trainspace #1156

codingwithsurya commented Apr 6, 2024 •

edited

Loading

github-actions bot commented Apr 6, 2024

[FEATURE]: Integrate UrbanSoundDataset for Audio Trainspace #1156

[FEATURE]: Integrate UrbanSoundDataset for Audio Trainspace #1156

Comments

codingwithsurya commented Apr 6, 2024 • edited Loading

Feature Name

Your Name

Description

Description

Objectives

Implementation Details

github-actions bot commented Apr 6, 2024

codingwithsurya commented Apr 6, 2024 •

edited

Loading