Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong times.txt file in CV15 #4252

Closed
neouyghur opened this issue Nov 27, 2023 · 2 comments
Closed

Wrong times.txt file in CV15 #4252

neouyghur opened this issue Nov 27, 2023 · 2 comments
Labels

Comments

@neouyghur
Copy link
Contributor

Hi, I downloaded the Uyghur dataset of CV15 and found out the times.txt file is not correct. I think it is for CV14. Here are top lines from the file:

`cv-corpus-14.0-delta-2023-06-23/ug/clips/common_voice_ug_34452661.mp3` = 6192
`cv-corpus-14.0-delta-2023-06-23/ug/clips/common_voice_ug_26209345.mp3` = 5148
`cv-corpus-14.0-delta-2023-06-23/ug/clips/common_voice_ug_33366211.mp3` = 8280
`cv-corpus-14.0-delta-2023-06-23/ug/clips/common_voice_ug_27575639.mp3` = 5400
`cv-corpus-14.0-delta-2023-06-23/ug/clips/common_voice_ug_27130049.mp3` = 6516
`cv-corpus-14.0-delta-2023-06-23/ug/clips/common_voice_ug_33463703.mp3` = 6588
@neouyghur neouyghur added the Bug label Nov 27, 2023
@HarikalarKutusu
Copy link
Contributor

You are correct to assume it is from v14.0. In v14.0 there was a times.txt file, but in v15.0 it is provided as clip_durations.tsv, like the other meta files.

It is just a leftover, we should disregard it.

@neouyghur
Copy link
Contributor Author

@HarikalarKutusu I have not noticed clip_durations.tsv.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants