Here is some useful shell commands that you may need when you work on NLP tasks such as counting words in a corpus, preparing a language model, process file names, search in files, extract information from files, clean text files.
- Clean and extract info from a text file
- Search in files
- Process file names, convert, merge, copy and move file
- Count words in a corpus
- Split files
- Compress files and directories
- Configure language on Linux and reset swap
- Install CMU Sphinx on Linux
- Prepare a language model
- Process wav files
- Prepare a docker image
Your contributions to improve the code are welcomed. Please follow the steps below.
- Fork the project.
- Modify the code, test it, make sure that it works fine.
- Make a pull request.
Please consult github help to get help.