ryokamoi

Follow

Ryo Kamoi ryokamoi

Follow

PhD Student at Penn State University. Building trustworthy and reliable NLP systems.

47 followers · 36 following

Penn State University
https://ryokamoi.github.io
@ryokamoi
https://scholar.google.com/citations?user=4OWTLKAAAAAJ&hl=en

Achievements

Achievements

Highlights

Pro

ryokamoi/README.md

I am a Ph.D. student at Penn State University advised by Dr. Rui Zhang. I’m interested in building reliable and trustworthy NLP systems.

[Personal Website] [Google Scholar] [Semantic Scholar]

Datasets

ReaLMistake [huggingface dataset] [code]
- Paper: Evaluating LLMs at Detecting Errors in LLM Responses (COLM 2024)
- Benchmark for evaluating error detection methods that detect mistakes in LLM responses
- Expert error annotations on responses from GPT-4 and Llama 2 70B on three tasks
WiCE [dataset and code]
- Paper: WiCE: Real-World Entailment for Claims in Wikipedia (EMNLP2023)
- Dataset for document-level NLI
- Fine-grained textual entailment dataset built on natural claim and evidence pairs extracted from Wikipedia

Other Resources

Shortcomings of Question Answering Based Factuality Frameworks for Error Localization [human annotation]
- Paper: Shortcomings of Question Answering Based Factuality Frameworks for Error Localization (EACL2023)

Pinned Loading

psunlpgroup/ReaLMistake psunlpgroup/ReaLMistake Public

This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".

Python 25 3
wice wice Public

This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.

Python 39 1
QA-metrics-human-annotation QA-metrics-human-annotation Public

Human Generated Questions in ''Shortcomings of Question Answering Based Factuality Frameworks for Error Localization'' (EACL2023)

1
llm_wrapper llm_wrapper Public

This repository includes an unofficial wrapper for LLMs, such as OpenAI API and HuggingFace models, mainly for caching LLM responses to avoid duplicate inference with the identical prompts.

Python 1

BibTex file for LLMs

1

comment = {Proprietary Models}

2

3

comment = {GPT-4}

4

5

@article{openai2023gpt4,

BibTex file for LVLMs

1

comment = {Proprietary Models}

2

3

comment = {OpenAI}

4

5

comment = {GPT-4o}