Kasanoma ASR is a research-focused speech recognition project for African languages and atypical speech. The goal is to enable personalized, code-switched, and edge-deployable ASR systems with a focus on English & Twi.
- Personalized ASR for atypical or disordered speech
- Code-switched models for English and West African languages
- Edge deployment on low-power devices such as Raspberry Pi
- Assistive features for accessibility and disabled users
project-app/: Next.js application, UI components, Firebase integration, and edge-ready frontend- Dataset access and loading examples for the Kasa speech dataset
- Research and model development resources for code-switched ASR
The Project Kasa Dataset contains paired audio and text transcripts for ASR and NLP research. It is built to support code-switched speech recognition for West African language blends.
This dataset is hosted on Hugging Face as a gated repository. To access it:
- Create a Hugging Face account
- Request access on the repository page:
- Use a Hugging Face access token during download
Use the Hugging Face datasets library to load the data once access is granted:
from datasets import load_dataset
# Authenticate with huggingface-cli or set HF_TOKEN
dataset = load_dataset(
"Kennethdot/Ghana_English-Twi_Code-switching_ASR",
use_auth_token=True,
)
print(dataset["train"][0])The dataset follows the standard Hugging Face speech format and includes fields such as:
speaker_idaudio(48 kHz waveform)transcriptiongender
-
project-app/: main app foldersrc/app/: Next.js pages and layoutsrc/components/: reusable UI components and audio recording interfacesrc/firebase/: Firebase config and authentication utilitiessrc/lib/: helper utilities, prompts, and placeholder assets
-
notebooks/: exploratory notebooks for setup and Whisper experimentation -
requirements.txt/environment.yml: Python environment dependencies
- Clone the repository
- Install dependencies for the app and/or Python notebooks
- Configure Hugging Face access and Firebase credentials as needed
- Run the Next.js app from
project-app/or explore dataset notebooks
The dataset is available under the CC BY-NC 4.0 license for research and educational use. Please cite the Project Kasa Project contributors if you leverage this work in academic or professional settings.