GitHub - seanghay/khmer-forced-aligner: Khmer Forced Aligner

git clone --recursive https://github.com/seanghay/khmer-forced-aligner.git

Inference

Resample the audio to be 16kHz

ffmpeg -i input_audio.ogg -ar 16000 -ac 1 -c:a pcm_s16le audio.wav

Align

python align_and_segment.py \
  -a audio.wav \
  -t audio.txt \
  --lang khm \
  --outdir results/audio \
  --uroman_path uroman/bin

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
uroman @ eab35dd		uroman @ eab35dd
.gitignore		.gitignore
.gitmodules		.gitmodules
align_and_segment.py		align_and_segment.py
align_utils.py		align_utils.py
norm_config.py		norm_config.py
punctuations.lst		punctuations.lst
readme.md		readme.md
requirements.txt		requirements.txt
segment.sh		segment.sh
text_normalization.py		text_normalization.py