speech-regconize

语音识别

1 环境安装

   pip install soundfile
   
   pip install tensorflow-gpu==1.12
   
   pip install python_speech_features
   
   pip install tqdm
   
   pip install easydict
   
   cuda9.0

2 测试

    python decoder.py

3 训练

    数据准备：
    
        见data文件夹 txt格式 音频路径+'\t' + label (label用空格分割)  //'\t'是指tab建不是字符
    
        config.py 中data_path+音频路径  为音频的绝对路径
    
        运行 python generate_data.py 不报错 则数据准备正确
    
    运行 python train.py 进行训练

4 模型冻结

    freeze_graph.py  修改ckpt_file为自己训练的checkpoint路径
                      
                     pb_file   生成的pb文件保存路径
    
    运行  python freeze_graph.py
    
    修改 config.py 中__C.PREDICT.pb = pb_file

运行python decode.py 测试

5 checkpoint模型地址链接：https://pan.baidu.com/s/1_CgXG3AvBDrXGTRr5_Rv8Q 提取码：ryqc

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
config		config
data		data
model		model
pb_save		pb_save
util		util
D4_750.wav		D4_750.wav
README.md		README.md
ctc_prefix_score.py		ctc_prefix_score.py
decoder.py		decoder.py
freeze_graph.py		freeze_graph.py
generate_data.py		generate_data.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech-regconize

About

Releases

Packages

Languages

Titook/speech-recognize

Folders and files

Latest commit

History

Repository files navigation

speech-regconize

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages