Popular repositories Loading
-
mathtutorbench
mathtutorbench PublicBenchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors
Python 5
-
-
verify-then-generate
verify-then-generate PublicStepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors, EMNLP 2024
Python 3
Repositories
Showing 10 of 12 repositories
- Productive-Failure-Problems Public
- mathgap-experiments Public
- verify-then-generate Public
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors, EMNLP 2024
-
- engage-your-readers Public
- solving-biases Public
Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners? ICML 2024