This is the web app for Mozilla Common Voice, a platform for collecting speech donations in order to create public domain datasets for training voice recognition-related tools.
By participating in this project, you're agreeing to uphold the Mozilla Community Participation Guidelines. If you need to report a problem, please see our CODE_OF_CONDUCT.md guide.
This repository is released under MPL (Mozilla Public License) 2.0.
The majority of our sentence text in /server/data
comes directly from user submissions in our Sentence Collector or they are scraped from Wikipedia using our extractor tool, and are released under a CC0 public domain Creative Commons license.
Any files that follow the pattern europarl-VERSION-LANG.txt
(such as europarl-v7-de.txt) were extracted with our thanks from the Europarl Corpus, which features transcripts from proceedings in the European parliament.
There are many ways to get involved with Common Voice - you don't have to know how to code to contribute! For more information, check out CONTRIBUTING.md.
If you would like to submit new sentences or edit existing translations, please see this detailed guide on Discourse on how to do that. We do not accept direct pull requests for localization content. Check out your language on the Common Voice project through Mozilla's Pontoon localization system for more information.
For general discussion (feedback, ideas, random musings), head to our Discourse Category.
For technical problems or suggestions, please use the GitHub issue tracker.
Or come chat with us on Matrix