800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Description

1730 Sichuan native speakers participated in the recording and face-to-face free talking in a natural way in wide fields without the topic specified. It is natural and fluency in speech, and in line with the actual dialogue scene. We transcribed the speech into text manually to ensure high accuracy.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1065?source=Github

Format

16kHz, 16bit, uncompressed wav, mono channel

Recording environments

quiet indoor environment, without echo

Recording content

no topic is specified, and the speakers make dialogue while the recording is performed

Demographics

1,730 people, 74% of which are female; 88% of 1,730 people are not more than 25 years old; people are from Sichuan or Chongqing

Annotations

annotating for the transcription text, speaker identification and gender

Device

Android mobile phone, iPhone

Language

Sichuan dialect

Applications

speech recognition, voiceprint recognition.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
01.txt		01.txt
01.wav		01.wav
02.txt		02.txt
02.wav		02.wav
03.txt		03.txt
03.wav		03.wav
04.txt		04.txt
04.wav		04.wav
05.txt		05.txt
05.wav		05.wav
README.md		README.md

Nexdata-AI/800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Folders and files

Latest commit

History

Repository files navigation

800-Hours-Sichuan-Dialect-Conversational-Speech-Data-by-Mobile-Phone

Description

Format

Recording environments

Recording content

Demographics

Annotations

Device

Language

Applications

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks