Skip to content

This repository contains the dataset for the first Arabic automotive speech recognition system, focusing on the Moroccan dialect. With 20 in-car commands meticulously selected to enhance safety and reduce distraction, it's tailored for Arabic speakers.

Notifications You must be signed in to change notification settings

SoufiyaneOuali/Automative-Morrocan-Arabic-Speech-Command-Datset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 

Repository files navigation

Automative-Morrocan-Arabic-Speech-Command-Datset

In this repository, you will find the dataset used for building the first automotive speech recognition system in the Arabic dialect, specifically the Moroccan dialect.
The dataset includes 20 commonly used in-car commands, carefully selected to aid Arabic drivers in making their driving experience comfortable and safe by reducing the distraction ratio.

The Moroccan Arabic Speech Command Dataset (MASCD) Description

The Moroccan Arabic Speech Command Dataset (MASCD) was curated to develop a robust Speech Recognition system capable of identifying in-car voice commands in the Moroccan Arabic language.

Dataset Details:

  • Format: (X, Y), where X represents the audio (input) and Y denotes the label of the audio.
  • Classes: The dataset comprises 20 command classes.
  • Total Audio Files: 2800 labeled audio samples.
  • Audio Length: Each audio file is approximately two seconds long.
  • Sampling Rate: 16 kHz with 16 bits per sample and mono channel.
  • Noise Level: 25% of the audios were recorded in noisy conditions (e.g., cars, roads, traffic noise), while 75% were recorded in clean environments.
  • Repetitions: Each command was recorded 10 times by 14 contributors, resulting in a total of 140 audio files per command.
  • Total Size: The dataset size is approximately 180 MB. This dataset is designed to facilitate a seamless driving experience for Arabic speakers and assist individuals with disabilities. Additionally, our goal is to contribute to the advancement of voice-controlled systems in the Arabic dialect language.

Dataset Structure

The built dataset is presented in the "Dataset" file within the repository. There are 14 files, each containing recordings from individual contributors. Within each contributor's file, there are 10 sub-files, each containing recordings of each command repeated 10 times.

Dataset File Structure:

  • Contributor-1
    • Command-1
      • Command-1 Repetition-1
        ...
      • Command-1 Repetition-10
        ...
    • Command-20
      • Command-20 Repetition-1
        ...
      • Command-20 Repetition-10
  • ...
  • ...
  • ...
  • Contributor-14
    • Command-1
      • Command-1 Repetition-1
        ...
      • Command-1 Repetition-10
        ...
    • Command-20
      • Command-20 Repetition-1
        ...
      • Command-20 Repetition-10

The methodology used in our paper is outlined in the following flowchart.

Alt Text

Command selected

The commands included in our paper total 20. To ensure relevance and utility, we conducted a survey involving 12 participants and investigated well-established commands commonly used in cars.
The selected commands, along with their translations from Arabic to English, are listed in the tables below.
Alt Text

Note :

The dataset used in this research is currently in use and not publicly available. However, authors interested in utilizing this dataset or our codes for research purposes are welcome to do so. Please send an email to my official account, and I will gladly provide you with access to these resources.

my email : soufiyane.ouali@usmba.ac.ma

About

This repository contains the dataset for the first Arabic automotive speech recognition system, focusing on the Moroccan dialect. With 20 in-car commands meticulously selected to enhance safety and reduce distraction, it's tailored for Arabic speakers.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published