Skip to content
View duyichao's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report duyichao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duyichao/README.md

Hi there πŸ‘‹

🍻 Welcome stranger

  • πŸŽ“: I am a first second-year PhD student at the University of Science and Technology of China (USTC), supervised by Professor Enhong Chen. Currently, I am working as a research intern at Tencent AI Lab, mentored by Zhirui Zhang. Prior to that, I worked as a research intern in the Language Technology Lab at Alibaba DAMO Academy.
  • 🎯: I am working in the field of NLP & Data Mining. My research interests and past experience can be summarized as follows:
    • Pre-trained Language Models: Foundation Model; Multimodal LLM
    • Multilingual and Multimodal Translation: End-to-End Speech Translation; Multilingual Neural Machine Translation
    • Data mining in healthcare: ICD coding; Extreme Multi-Label Classification
    • Knowledge Graph: Entity Linking

πŸ“° What's New?

  • [2023-10-08] Three papers are accepted by EMNLP'2023, about theoretical understanding of kNN-MT, interactive machine translation and interventional rationalization!
  • [2023-07-10] Our MineTrans speech translation system is accepted by IWSLT'2023, which win the championship in the Speech-to-Speech Translation (End-to-End subtrack).
  • [2023-01-21] Two papers are accepted by ICLR'2023, about federated machine translation and efficient kNN-MT!
  • [2022-10-06] One paper is accepted by EMNLP'2022, about domain adaptation for speech translation!
  • [2022-09-13] One paper is accepted by NeurIPS'2022, about disentangled representation learning!

πŸŽ‰ Publications

2023

  • Xu Huang, Zhirui Zhang, Ruize Gao, Yichao Du, Lemao Liu, Guoping Huang, Shuming Shi, Jiajun Chen, Shujian Huang. IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems. In EMNLP'2023 (Oral, Top3%).
  • Ruize Gao, Zhirui Zhang, Yichao Du, Lemao Liu, Rui Wang. Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer. In EMNLP'2023.
  • Linan Yue, Qi Liu, Li Wang, Yanqing An, Yichao Du, Zhenya Huang. Interventional Rationalization. In EMNLP'2023.
  • Yichao Du, Zhengsheng Guo, Jinchuan Tian, Zhirui Zhang, Xing Wang, Jianwei Yu, Zhaopeng Tu, Tong Xu and Enhong Chen. The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks. In IWSLT'2023.
  • Yichao Du, Zhirui Zhang, Bingzhe Wu, Lemao Liu, Tong Xu, and Enhong Chen. Federated Nearest Neighbor Machine Translation. In ICLR'2023.
  • Yuhan Dai, Zhirui Zhang, Qiuzhi Liu, Qu Cui, Weihua Li, Yichao Du and Tong Xu. Simple and Scalable Nearest Neighbor Machine Translation. In ICLR'2023.

2022

  • Yichao Du, Weizhi Wang, Zhirui Zhang, Boxing Chen, Tong Xu, Jun Xie, and Enhong Chen. Non-Parametric Domain Adaptation for End-to-end Speech Translation. In EMNLP'2022 (Oral, Top3%).
  • Linan Yue, Qi Liu, Yichao Du, Yanqing An, Li Wang and Enhong Chen. DARE: Disentanglement-Augmented Rationale Extraction. In NeurIPS'2022 (Spotlight, Top8%).
  • Yichao Du, Zhirui Zhang, Weizhi Wang, Boxing Chen, Jun Xie, Tong Xu, and Enhong Chen. Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement. In AAAI'2022.

2021

  • Yichao Du, Pengfei Luo, Xudong Hong, Tong Xu, Zhe Zhang, Chao Ren, Yi Zheng, Enhong Chen. Inheritance-guided Hierarchical Assignment for Clinical Automatic Diagnosis. In DASFAA'2021.
  • Weizhi Wang, Zhirui Zhang, Yichao Du, Boxing Chen, Jun Xie, Weihua Luo. Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables. In EMNLP'2021.
  • Zhe Zhang, Tong Xu, Le Zhang, Yichao Du, Hui Xiong, and Enhong Chen. Knowledge Powered Cooperative Semantic Fusion for Patent Classification. In CICAI'2021. (Best Student Paper Finalist)

2020

  • Yichao Du, Tong Xu, Jianhui Ma, Enhong Chen, Yi Zheng, Tongzhu Liu, Guixian Tong. An Automatic ICD Coding Method for Clinical Records Based on Deep Neural Network. In Big Data Research.

πŸ† Competition

  • IWSLT 2023: ranked 1st (English->Chinese) in Speech-to-Speech Translation Track (End-to-End subtrack).

Popular repositories

  1. E2E-ST-TDA E2E-ST-TDA Public

    Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"

    Python 18 3

  2. MINETrans-IWSLT23 MINETrans-IWSLT23 Public

    Official implementation of our IWSLT 2023 paper "The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks"

    Python 12

  3. NPDA-KNN-ST NPDA-KNN-ST Public

    Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"

    Python 11 3

  4. floodlight floodlight Public

    Server cluster load balancing

    Java

  5. hexo-theme-tranquilpeak hexo-theme-tranquilpeak Public

    Forked from LouisBarranqueiro/hexo-theme-tranquilpeak

    πŸ’„ A gorgeous responsive theme for Hexo blog framework

    CSS

  6. GPT2 GPT2 Public

    Forked from ConnorJL/GPT2

    An implementation of training for GPT2, supports TPUs

    Python