Luxembourgish Vowel Classifier

This is a proof of concept to train an acoustic classifier for vowels. The ultimate aim is to use this in an app to assess pronunciations of language learners. The project focuses specifically on Luxembourgish vowels, which present unique challenges due to their distinctive phonetic characteristics and the language's complex vowel system. By leveraging state-of-the-art machine learning techniques, this classifier aims to provide accurate and real-time feedback to language learners, helping them improve their pronunciation of Luxembourgish vowels. The system is designed to be both educational and practical, serving as a valuable tool for language learning and phonetic research.

An AI-powered vowel classification web application for Luxembourgish using a fine-tuned HuBERT transformer model. This Streamlit application provides real-time vowel classification with multiple input methods.

🎯 Features

HuBERT Transformer Model: State-of-the-art speech recognition model fine-tuned for Luxembourgish vowels
Multiple Input Methods:
- 📁 Upload WAV audio files
- 🎤 Record live from microphone
- 📋 Use pre-existing example vowels
Real-time Classification: Instant vowel prediction with confidence scores
Debug Mode: Detailed audio processing visualization and information
User-friendly Interface: Clean Streamlit web interface

🔊 Supported Vowels

The system classifies 9 Luxembourgish vowel categories:

aː (long a)
eː (long e)
oː (long o)
ɑɪ
æːɪ
ɜɪ
əʊ
ɑʊ
æːʊ

🚀 Quick Start

Prerequisites

Python 3.8+
Git

Installation

Clone the repository:

git clone https://github.com/PeterGilles/luxembourgish-vowel-classifier.git
cd luxembourgish-vowel-classifier

Install dependencies:

pip install -r requirements.txt

Running the Application

streamlit run streamlit_hubert.py

The application will open in your browser at http://localhost:8501.

🖥️ How to Use

Choose Input Method:
- Upload WAV file: Select a .wav file containing a vowel sound
- Record from microphone: Click the record button and speak a vowel
- Use Example Vowels: Select from pre-loaded vowel samples
Get Prediction: The app automatically classifies the vowel and shows:
- Predicted vowel category
- Confidence percentage
- For examples: accuracy comparison with true label
Debug Mode: Enable in the sidebar for detailed information:
- Audio waveform visualization
- Processing parameters
- Technical details

🔬 Model Details

Base Model: Pre-trained HuBERT from Hugging Face (facebook/hubert-base-ls960)
Fine-tuning: Custom sequence classification head for Luxembourgish vowels
Training Data: 27,283 vowel segments from the Schnëssen corpus with the following distribution:
- aː: 7,905 samples
- eː: 4,812 samples
- oː: 3,703 samples
- ɜɪ: 3,384 samples
- ɑɪ: 2,588 samples
- æːɪ: 1,924 samples
- əʊ: 1,157 samples
- ɑʊ: 1,001 samples
- æːʊ: 809 samples
Sample Rate: 16kHz
Input Length: 90-300ms audio segments (max 250ms for training)
Training Configuration:
- Epochs: 8 (default)
- Batch Size: 8
- Learning Rate: 5e-5
- Optimizer: AdamW with weight decay (0.01)
- Augmentation: Time stretching (0-20% speed variation)
- Train/Validation Split: 80/20
- Loss Function: Cross-entropy with optional class weighting and label smoothing (0.1)
Audio Processing:
- Feature Extraction: Wav2Vec2FeatureExtractor with optimized parameters
- FFT Size: 1024
- Hop Length: 160 samples
- Window Length: 400 samples
- Padding: Max length with consistent strategy
Training Features:
- Automatic masking disabled for vowel classification
- Class weighting option for imbalanced data
- Best model selection based on validation accuracy
- Confusion matrix and classification report generation

🎵 Audio Requirements

For best results:

File format: WAV files
Duration: At least 0.5 seconds of sustained vowel sound
Quality: Clear recording without background noise
Content: Single vowel sound (not diphthongs or consonants)

🛠️ Technical Architecture

The application uses:

Streamlit: Web framework for the user interface
HuBERT: Transformer-based speech representation model
Librosa: Audio processing and analysis
PyTorch: Deep learning framework
st_audiorec: Browser-based audio recording

📞 Contact

Peter Gilles - @PeterGilles

Project Link: https://github.com/PeterGilles/luxembourgish-vowel-classifier

🔗 Model

The fine-tuned HuBERT model is available on Hugging Face: pgilles/vowel-classifier-hubert

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
examples		examples
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
streamlit_hubert.py		streamlit_hubert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Luxembourgish Vowel Classifier

🎯 Features

🔊 Supported Vowels

🚀 Quick Start

Prerequisites

Installation

Running the Application

🖥️ How to Use

🔬 Model Details

🎵 Audio Requirements

🛠️ Technical Architecture

📞 Contact

🔗 Model

About

Uh oh!

Releases

Packages

Uh oh!

Languages

PeterGilles/luxembourgish-vowel-classifier

Folders and files

Latest commit

History

Repository files navigation

Luxembourgish Vowel Classifier

🎯 Features

🔊 Supported Vowels

🚀 Quick Start

Prerequisites

Installation

Running the Application

🖥️ How to Use

🔬 Model Details

🎵 Audio Requirements

🛠️ Technical Architecture

📞 Contact

🔗 Model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages