Skip to content

alibaba-damo-academy/SpokenNLP

Repository files navigation

SpokenNLP

SpokenNLP: The official repository for codebases on a wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

🔥 News

  • [2024-02-05]: SLD was accepted by ICASSP 2024. It introduces SLD: a novel approach which applies a KL divergence loss with smoothed labels on speech tokens for Discrete-token-based ASR.
  • [2023-10-23]: Ditto was accepted by EMNLP 2023. It introduces Ditto: a learning-free approach that uses model-based importance estimations to weight words and compute sentence embeddings from pre-trained model representations.
  • [2023-10-07]: Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling was accepted by EMNLP 2023. It enhances the pretrained language model’s ability to capture coherence from both structure and similarity perspectives to further improve the topic segmentation performance.
  • [2023-05-22]: PoNet are submitted to huggingface hub. PoNet can now be used directly through the Transformers library.
  • [2022-12-02]: alimeeting4mug released the official baseline system codebase for ICASSP2023 General Meeting Understanding and Generation Challenge (MUG)!
  • [2022-02-24]: MDERank was accepted by Findings of ACL 2022. It is a Masked Document Embedding Rank approach for unsupervised keyphrase extraction, which outperforms state-of-the-art unsupervised keyphrase extraction approaches, especially on long documents.
  • [2022-01-21]: PoNet was accepted by ICLR 2022. It is a novel Pooling Network (PoNet) for token mixing in long sequences with linear complexity, which achieves a good balance between transfer learning capability and accuracy and complexity for long sequence modeling. Models are released at Modelscope (English and Chinese).
  • [2021-09-11]: SeqModel was accepted by IEEE ASRU 2021. It is a sequence model with self-adaptive sliding window for efficient spoken document segmentation. A new Chinese Wikipedia-based document segmentation dataset Wiki-zh was released. Models are released at Modelscope (English and Chinese).
  • [2019-02-28]: JointBERT was proposed for joint intent classification and slot filling with BERT. The third-party PyTorch implementation of JointBERT is available.
  • [2018-10-17]: ESIM ranks the top on both datasets on DSTC7 Noetic End-to-end Response Selection track !

📝 License

SpokenNLP is released under the Apache License 2.0. This project contains various third-party components under other open source licenses.

About

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published