Skip to content

Nexdata-AI/500-Hours-Henan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Repository files navigation

500-Hours-Henan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Format

16kHz, 16bit, uncompressed wav, mono channel

Recording Environment

quiet indoor environment, without echo

Recording content

dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed

Demographics

1,000 speakers, balance for gender

Annotation

annotating for the transcription text, speaker identification and gender

Device

Android mobile phone, iPhone

Language

Henan Dialect

Application scenarios

speech recognition; voiceprint recognition

Accuracy rate

95%

Licensing Information

Commercial License