AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario. The project, served as baseline, is divided into two parts, named speaker and asr. A detailed description of each system is in their respective README.md. The goal of this project is to simplify the training and evaluation procedure and make it easy and flexible for participants to carry out experiments and verify neural network based methods.

Setup

git clone https://github.com/qq379840315/AliMeeting.git

Introduction

ASR: Train and evaluate the asr model.
Speaker Diarization: Generate the speaker diarization results.

General steps

Generate training data for speaker diarization and asr model and evaluation data for both track.
Do speaker diarization to generate rttm which includes vad and speaker diarization information, and then generate DER results.
Train single-speaker and multi-speaker ASR model respectively.
Generate CER results for both ASR models.

Citation

If you use AliMeeting dataset and baseline system of M2MeT Challenge in a publication, please cite the following paper:

@article{yu2021m2met,
title={M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge},
author={Yu, Fan and Zhang, Shiliang and Fu, Yihui and Xie, Lei and Zheng, Siqi and Du, Zhihao and Huang, Weilong and Guo, Pengcheng and Yan, Zhijie and Ma, Bin and others},
journal={arXiv preprint arXiv:2110.07393},
year={2021}
}

The paper is available at https://arxiv.org/abs/2110.07393

The data will be sent to all challenge participants through email.

Contributors

Code license

Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
asr		asr
speaker		speaker
.DS_Store		.DS_Store
README.md		README.md
alibaba.ai		alibaba.ai
alibaba.png		alibaba.png
fig_aishell.jpg		fig_aishell.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

asr

speaker

speaker

.DS_Store

.DS_Store

README.md

README.md

alibaba.ai

alibaba.ai

alibaba.png

alibaba.png

fig_aishell.jpg

fig_aishell.jpg

Repository files navigation

AliMeeting

Setup

Introduction

General steps

Citation

Contributors

Code license

About

Releases

Packages

Languages

chang111/AliMeeting

Folders and files

Latest commit

History

Repository files navigation

AliMeeting

Setup

Introduction

General steps

Citation

Contributors

Code license

About

Resources

Stars

Watchers

Forks

Languages