ASR Evaluation Pipeline for Clinical Applications with Older Adults

This repository contains the code and evaluation pipeline for the research paper: "Out of the Box, into the Clinic? Evaluating State-of-the-Art ASR for Clinical Applications for Older Adults".

Overview

This pipeline evaluates state-of-the-art Automatic Speech Recognition (ASR) models on Dutch speech from older adults, comparing their performance on both clinical conversation data (Welzijn.AI chatbot interactions) and general speech data (Mozilla Common Voice). The evaluation focuses on accuracy-speed trade-offs and model generalization capabilities.

What This Pipeline Does

Audio Processing: Converts and segments audio files, performs speaker diarization to separate different speakers
ASR Evaluation: Tests multiple ASR models including:
- Generic multilingual models (Whisper variants, Voxtral)
- Dutch-specific models (wav2vec2-xls-r-1b-dutch-3, whisper-native-elderly-9-dutch)
Performance Analysis: Computes Word Error Rate (WER) and processing time metrics
Comparative Study: Evaluates models on two datasets:
- Welzijn.AI (clinical conversations with older adults) - referred to as "Beatrix" in the code
- Mozilla Common Voice (general Dutch speech of older adults)

Key Findings

Generic multilingual models often outperform fine-tuned models
Model truncation helps balance accuracy-speed trade-offs
Some models show high WER due to hallucinations and mishearings

Data Privacy

Due to privacy concerns, no audio files or transcripts are provided in this repository. The pipeline is designed to work with your own audio data following the same structure as described in the paper.

Requirements

Install dependencies with:

pip install -r requirements.txt

Usage

The main analysis is conducted through the analysis.ipynb notebook, which processes audio data, runs ASR models, and generates performance comparisons and visualizations.

Paper Reference

This paper is currently a preprint at arXiv:2508.08684, but is accepted for publication at the HCINLP workshop @ EMNLP 2025.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.secrets		.secrets
data		data
output		output
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
analysis.ipynb		analysis.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ASR Evaluation Pipeline for Clinical Applications with Older Adults

Overview

What This Pipeline Does

Key Findings

Data Privacy

Requirements

Usage

Paper Reference

About

Uh oh!

Releases

Packages

Languages

bma-vandijk/asr_pipelines

Folders and files

Latest commit

History

Repository files navigation

ASR Evaluation Pipeline for Clinical Applications with Older Adults

Overview

What This Pipeline Does

Key Findings

Data Privacy

Requirements

Usage

Paper Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages