Skip to content

v0.0.5

Compare
Choose a tag to compare
@chainyo chainyo released this 30 Aug 19:27
· 57 commits to main since this release

🚀 ASR Providers

  • Simplified the launch/get_transcription function for each ASR provider #55
  • Implemented WER data preparation for 5 providers #66 #67

🎯 Evaluation

  • Implemented the WER evaluation process #70

📁 Datasets

  • Added a new dataset google/fleurs for WER #77
  • Fixes for AMI to avoid server overload #59
  • Added the number of files per dataset splits #52
  • Added compatible metrics for each dataset #60
  • Removed one audio type for AMI to simplify transcription #59
  • Added code to prepare AMI dataset for WER evaluation #57

💬 CLI commands

  • Created a command to create plots from evaluation results #73
  • Improved list command #60
  • Added an audio-length command to get the duration of a dataset per split #74

⚙️ Tests

  • Added a lot of unit tests #51