I am doing PhD on NLP, especially on Efficient LLMs and Robust LLMs evaluation, at University of Surrey, UK.
I am currently a Student Researcher at Microsoft Research.
Publicaitons list at: https://liyucheng09.github.io/
I am doing PhD on NLP, especially on Efficient LLMs and Robust LLMs evaluation, at University of Surrey, UK.
I am currently a Student Researcher at Microsoft Research.
Publicaitons list at: https://liyucheng09.github.io/
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
The first Chinese metaphor corpus serving for identification and generation. 中文比喻数据集. Presented at COLING 2022.
Lightweight tool to identify Data Contamination in LLMs evaluation
FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning. Presented at EACL 2023.