Add KsponSpeech recipe #1353

whsqkaak · 2024-06-10T05:26:04Z

KsponSpeech is a large-scale spontaneous speech corpus of Korean.
This corpus contains 969 hours of open-domain dialog utterances,
spoken by about 2,000 native Korean speakers in a clean environment.

All data were constructed by recording the dialogue of two people
freely conversing on a variety of topics and manually transcribing the utterances.

The transcription provides a dual transcription consisting of orthography and pronunciation,
and disfluency tags for spontaneity of speech, such as filler words, repeated words, and word fragments.

The original audio data has a pcm extension.
During preprocessing, it is converted into a file in the wav extension and saved anew.

KsponSpeech is publicly available on an open data hub site of the Korea government.
The dataset must be downloaded manually.

For more details, please visit:

Dataset: https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=123
Paper: https://www.mdpi.com/2076-3417/10/19/6936

pzelasko

Thanks, great work! I left 2 comments.

pzelasko · 2024-06-11T13:12:36Z

lhotse/recipes/ksponspeech.py

+)
+
+
+def normalize(


can we make normalization optional (and enabled by default) via a parameter in prepare_ksponspeech? you can see other recipes, some of them have a string option making it possible to choose different flavors of normalization (we can have "default" and "none" here)

can we make normalization optional (and enabled by default) via a parameter in prepare_ksponspeech? you can see other recipes, some of them have a string option making it possible to choose different flavors of normalization (we can have "default" and "none" here)

Fix it in e7dc868

pzelasko · 2024-06-11T13:13:58Z

lhotse/recipes/ksponspeech.py

+    return manifests
+
+
+def pcm_to_wav(


could we convert to FLAC instead? 2x storage space savings

could we convert to FLAC instead? 2x storage space savings

Fix it in 36aa16d

whsqkaak · 2024-06-12T03:30:46Z

Thanks, great work! I left 2 comments.

Thanks for your feedback! I'll fix the proposal quickly.

pzelasko · 2024-06-12T13:49:49Z

Thanks! LGTM

whsqkaak added 3 commits June 10, 2024 14:22

Add KsponSpeech recipe

c74d81c

Fix an error occured during prepare ksponspeech.

66e9dbb

Merge branch 'master' into feature/ksponspeech-recipe

4572330

pzelasko reviewed Jun 11, 2024

View reviewed changes

whsqkaak added 3 commits June 12, 2024 12:33

Merge branch 'master' into feature/ksponspeech-recipe

f624884

Make normalization optional in prepare_ksponspeech

e7dc868

Modify pcm_to_wav -> pcm_to_flac

36aa16d

pzelasko approved these changes Jun 12, 2024

View reviewed changes

pzelasko enabled auto-merge (squash) June 12, 2024 13:49

pzelasko merged commit f9fb181 into lhotse-speech:master Jun 12, 2024
9 checks passed

whsqkaak deleted the feature/ksponspeech-recipe branch June 13, 2024 01:23

pzelasko modified the milestone: v1.24.0 Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KsponSpeech recipe #1353

Add KsponSpeech recipe #1353

whsqkaak commented Jun 10, 2024

pzelasko left a comment

pzelasko Jun 11, 2024

whsqkaak Jun 12, 2024

pzelasko Jun 11, 2024

whsqkaak Jun 12, 2024

whsqkaak commented Jun 12, 2024

pzelasko commented Jun 12, 2024

		)


		def normalize(

Add KsponSpeech recipe #1353

Add KsponSpeech recipe #1353

Conversation

whsqkaak commented Jun 10, 2024

pzelasko left a comment

Choose a reason for hiding this comment

pzelasko Jun 11, 2024

Choose a reason for hiding this comment

whsqkaak Jun 12, 2024

Choose a reason for hiding this comment

pzelasko Jun 11, 2024

Choose a reason for hiding this comment

whsqkaak Jun 12, 2024

Choose a reason for hiding this comment

whsqkaak commented Jun 12, 2024

pzelasko commented Jun 12, 2024