Skip to content

Latest commit

 

History

History
129 lines (83 loc) · 2.2 KB

api.rst

File metadata and controls

129 lines (83 loc) · 2.2 KB

API Reference

This page contains a comprehensive list of all classes and functions within lhotse.

Datasets

PyTorch Dataset wrappers for common tasks.

Speech Recognition

lhotse.dataset.speech_recognition

Source Separation

lhotse.dataset.source_separation

Unsupervised

lhotse.dataset.unsupervised

Voice Activity Detection

lhotse.dataset.vad

Diarization (experimental)

lhotse.dataset.diarization

Recording manifests

Data structures used for describing audio recordings in a dataset.

lhotse.audio

Supervision manifests

Data structures used for describing supervisions in a dataset.

lhotse.supervision

Feature extraction and manifests

Data structures and tools used for feature extraction and description.

Features API - extractor and manifests

lhotse.features.base

Torchaudio feature extractors

lhotse.features.fbank

lhotse.features.mfcc

lhotse.features.spectrogram

Feature storage

lhotse.features.io

Feature-domain mixing

lhotse.features.mixer

Augmentation

lhotse.augmentation

Cuts

Data structures and tools used to create training/testing examples.

lhotse.cut

Recipes

Convenience methods used to prepare recording and supervision manifests for standard corpora.

lhotse.recipes

Kaldi conversion

Convenience methods used to interact with Kaldi data directories.

lhotse.kaldi

Others

Helper methods used throughout the codebase.

lhotse.manipulation