Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Errors in the FMA_large.zip and FMA_full.zip #61

Closed
nicolaus625 opened this issue Dec 30, 2022 · 6 comments
Closed

Errors in the FMA_large.zip and FMA_full.zip #61

nicolaus625 opened this issue Dec 30, 2022 · 6 comments

Comments

@nicolaus625
Copy link

There are some errors in the FMA_large.zip and FMA_full.zip.
I used multiple download approaches (wget and curl) from multiple links (zstd files are unavailable, the one on github repo and the kaggle version mentioned on other issues), and used multiple approaches for decompression (unzip, 7zip, tar, bzip etc.) on pmultiple linux machines. And there are many files are distorted in all the cases, such as:

/fma_large/000/000148.mp3
/fma_large/000/000149.mp3
/fma_large/000/000150.mp3
/fma_large/000/000151.mp3
/fma_large/000/000152.mp3
/fma_large/001/001000.mp3
/fma_large/001/001001.mp3
/fma_large/002/002076.mp3
/fma_large/002/002077.mp3
/fma_large/002/002078.mp3
/fma_large/002/002079.mp3
/fma_large/002/002080.mp3
/fma_large/002/002081.mp3
/fma_large/002/002082.mp3

I believed the kaggle version uploaded 9 months ago from github is a good demo of such noise. Could you please zip the fma_large and fma_full again and release them?

@mdeff
Copy link
Owner

mdeff commented Jan 3, 2023

What do you mean by errors?

Have you checked the wiki page that reports known issues?

You can check the integrity of the downloaded .mp3 files with sha1sum -c checksums.

Maybe what you think to be noise is what the artist intended to produce. For example, you can listen to the original of 000148.mp3 at https://freemusicarchive.org/music/Contradiction/Contradiction and understand why it sounds like it does:

If one ever played around with a microphone in front of a set of speakers, you know it can create feedback. If you are not afraid and keep on holding the microphone in front of the speakers, you know you can sing through it, or scream or shout.

Note also that this track's genre is "Experimental → Avant-Garde". You might want to filter by genre (or other tags) to exclude tracks.

@nicolaus625
Copy link
Author

Have you checked the wiki page that reports known issues?

Yes. I believe most of the audio I found are not belongs to the error audio are not mentioned in the wiki page.

You can check the integrity of the downloaded .mp3 files with sha1sum -c checksums.

Already done. It shows OK to me.

Note also that this track's genre is "Experimental → Avant-Garde". You might want to filter by genre (or other tags) to exclude tracks.

That make sense. How do you know the recording belongs to "Avant-Garde"? There is no much information in the tracks.csv

Best regards

@mdeff
Copy link
Owner

mdeff commented Jan 3, 2023

That make sense. How do you know the recording belongs to "Avant-Garde"? There is no much information in the tracks.csv

Great! Genre information is found in tracks.csv and genres.csv. Please checkout the usage.ipynb notebook.

@nicolaus625
Copy link
Author

THank you. Another concern is some audio like /fma_large/001/001000.mp3
https://freemusicarchive.org/music/Kevin_Shields/The_Death_of_Patience/ contain too much noise. It seems that the audio from 13 second on the website is broken. Is this also a genre issues?

@mdeff
Copy link
Owner

mdeff commented Jan 3, 2023

Well, the genre of these tracks is literally "Noise".

@nicolaus625
Copy link
Author

lol, thank you very much.

@mdeff mdeff closed this as completed Jan 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants