Danish ASR and TTS models associated with the CoRal project.
Developers:
- Dan Saattrup Nielsen (dan.nielsen@alexandra.dk)
- Run
make install
, which installs Poetry (if it isn't already installed), sets up a virtual environment and all Python dependencies therein. - Run
source .venv/bin/activate
to activate the virtual environment.
If you're on MacOS and get an error saying something along the lines of "fatal error:
'lzma.h' file not found" then try the following and rerun make install
afterwards:
export CPPFLAGS="-I$(brew --prefix)/include"
To install new PyPI packages, run:
$ poetry add <package-name>
Simply write make
to display a list of the commands available. This includes the
above-mentioned make install
command, as well as building and viewing documentation,
publishing the code as a package and more.
- Poetry: Dependency management
- hydra: Manage configuration files
- pre-commit plugins: Automate code reviewing formatting
- pdoc: Automatically create an API documentation for your project
.
├── .github
│ └── workflows
│ ├── ci.yaml
│ └── docs.yaml
├── .gitignore
├── .pre-commit-config.yaml
├── LICENSE
├── README.md
├── config
│ ├── __init__.py
│ ├── config.yaml
│ ├── datasets
│ │ ├── alvenir_test_set.yaml
│ │ ├── common_voice_13_da.yaml
│ │ ├── common_voice_13_nn.yaml
│ │ ├── common_voice_13_sv.yaml
│ │ ├── common_voice_9_da.yaml
│ │ ├── fleurs_da.yaml
│ │ ├── fleurs_nb.yaml
│ │ ├── fleurs_sv.yaml
│ │ ├── ftspeech.yaml
│ │ ├── nota.yaml
│ │ ├── nst_da.yaml
│ │ └── test_dataset.yaml
│ ├── hydra
│ │ └── job_logging
│ │ └── custom.yaml
│ └── model
│ ├── test_wav2vec2.yaml
│ ├── test_whisper.yaml
│ ├── wav2vec2.yaml
│ ├── whisper_large.yaml
│ ├── whisper_medium.yaml
│ ├── whisper_small.yaml
│ ├── whisper_xsmall.yaml
│ └── whisper_xxsmall.yaml
├── docs
│ └── .gitkeep
├── makefile
├── poetry.lock
├── poetry.toml
├── pyproject.toml
├── src
│ ├── coral
│ │ ├── __init__.py
│ │ ├── compute_metrics.py
│ │ ├── data.py
│ │ ├── finetune.py
│ │ ├── model_setup.py
│ │ ├── plot.py
│ │ ├── prepare_raw_data.py
│ │ ├── protocols.py
│ │ ├── utils.py
│ │ ├── wav2vec2.py
│ │ └── whisper.py
│ └── scripts
│ ├── build_coral_data.py
│ ├── build_ftspeech.py
│ ├── build_nota.py
│ ├── build_nst_da.py
│ ├── download_ftspeech.py
│ ├── evaluate_model.py
│ ├── find_faulty_audio_clips.py
│ ├── finetune_model.py
│ ├── fix_dot_env_file.py
│ ├── plot_training_trajectory.py
│ ├── push_to_hub.py
│ ├── train_ngram_decoder.py
│ └── versioning.py
└── tests
├── __init__.py
├── conftest.py
├── test_compute_metrics.py
├── test_data.py
├── test_finetune.py
├── test_protocols.py
├── test_utils.py
├── test_wav2vec2.py
└── test_whisper.py