Popular repositories Loading
-
structure-gen
structure-gen PublicLet Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
-
streambench-final-project
streambench-final-project PublicThe final project for Applied Deep Learning (ADL) 2024 @NTU lectured by Prof. Yun-Nung (Vivian) Chen. The project is based on the paper StreamBench: Towards Benchmarking Continuous Improvement of L…
-
i-need-help
i-need-help PublicWe propose a framework for LLMs to seek user support, design evaluation metrics to measure the trade-off between performance boost and user burden, and empirically assess this ability on Text-to-SQ…
Python 2
-
-
language-matters
language-matters PublicIn which language do these models reason when solving problems presented in different languages? Our findings reveal that, despite multilingual training, LRMs tend to default to reasoning in high-r…
Python 1
Repositories
- language-matters Public
In which language do these models reason when solving problems presented in different languages? Our findings reveal that, despite multilingual training, LRMs tend to default to reasoning in high-resource languages (e.g., English) at test time
appier-research/language-matters’s past year of commit activity - structure-gen Public
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
appier-research/structure-gen’s past year of commit activity - robust-llm-finetunes Public
appier-research/robust-llm-finetunes’s past year of commit activity - streambench-final-project Public
The final project for Applied Deep Learning (ADL) 2024 @ntu lectured by Prof. Yun-Nung (Vivian) Chen. The project is based on the paper StreamBench: Towards Benchmarking Continuous Improvement of Language Agents.
appier-research/streambench-final-project’s past year of commit activity - i-need-help Public
We propose a framework for LLMs to seek user support, design evaluation metrics to measure the trade-off between performance boost and user burden, and empirically assess this ability on Text-to-SQL generation.
appier-research/i-need-help’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…