Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
shvdiwnkozbw committed Apr 6, 2021
1 parent b40a6bd commit 8fa5856
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ We propose a two-stage learning framework, which establishes coarse-grained audi

## Prepare Dataset

The detailed preprocessing code and classification pseudo label generation can be referred to ```./prepare_data``` and ```./mobile_crnn/generate_data.py```. The pseudo labels for images are similar to the audio, but replace the model with ImageNet pretrained backbones, and use ```./utils/cluster_v3.npy``` for aggregation. And it is also practical to use the same pseudo labels for both images and audios, since the ImageNet labels cannot be well aligned with 7 general categories in AudioSet, introducing extra noise. There is only slight difference in the choice of label assignment. For datasets like AudioSet, directly use videos tags as classification labels.
The detailed preprocessing code and classification pseudo label generation can be referred to ```./prepare_data``` and ```./mobile_crnn/generate_label.py```. The pseudo labels for images are similar to the audio, but replace the model with ImageNet pretrained backbones, and use ```./utils/cluster_v3.npy``` for aggregation. And it is also practical to use the same pseudo labels for both images and audios, since the ImageNet labels cannot be well aligned with 7 general categories in AudioSet, introducing extra noise. There is only slight difference in the choice of label assignment. For datasets like AudioSet, directly use videos tags as classification labels.

#### SoundNet-Flickr Dataset

Expand Down

0 comments on commit 8fa5856

Please sign in to comment.