Skip to content

chillestt/Automated-IELTS-essay-evaluation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

A STUDY OF REINFORCEMENT LEARNING AND ITS APPLICATION IN IELTS WRITING TASK 2 EVALUATION

Project Overview

  • Project Name: A STUDY OF REINFORCEMENT LEARNING AND ITS APPLICATION IN IELTS WRITING TASK 2 EVALUATION
  • Team Size: 1 (Leader: Nguyen Minh Chi)
  • Time: December 2023 - March 2024

Demo

Description

This project focuses on leveraging the power of Large Language Models (LLMs) and reinforcement learning techniques for automated evaluation of IELTS Writing Task 2 essays. By fine-tuning LLMs and implementing reinforcement learning algorithms, we aim to develop a robust system capable of accurately assessing the quality of IELTS essays, providing valuable feedback to test takers and educators.

Activities

  • Web Scraping: Gathered IELTS essays from online sources for dataset creation.
  • Synthetic Data Generation: Utilized Gemini APIs, GPT-3.5-Turbo, and Groq for generating synthetic data to augment the dataset.
  • Model Fine-tuning: Fine-tuned Mistral-7b, Llama-2-7b, and Gemma-7b models using QLoRA technique. Training process conducted on a single GPU, leveraging Google Colab's free T4 Tesla GPU.
  • Model Selection: Identified Mistral-7b as the model with the best performance and forwarded it to the Direct Preference Optimization (DPO) step.
  • Preference Dataset Acquisition: Acquired preference dataset for DPO training.
  • Fine-tuning with DPO: Further fine-tuned the model using the preference dataset and DPO technique.

SFT Dataset

Preference Dataset

SFT model

DPO model

Additional Information

For more details on the project methodology, dataset preparation, and results, please refer to the project documentation and code repository.


This project was developed by Nguyen Minh Chi as part of a research study. For inquiries or collaboration opportunities, please contact Nguyen Minh Chi.

About

Automated Writing Evaluation system

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published