Study on the Application of Fine-tuning in Educational Robotics

This repository hosts a comprehensive study that investigates the convergence between Artificial Intelligence (AI) and educational robotics, proposing the use of fine-tuning as a solution to the challenges faced by teams in this context. From conception to current advancements, AI plays a fundamental role in society, and educational robotics, focused on STEM, adopts innovative practices to educate students but faces obstacles such as resource scarcity and disparities in competitions.

Objective 🎯

The main goal of this work is to explore the application of fine-tuning as a tool to disseminate knowledge equitably among educational robotics teams. We aim to overcome obstacles, drive development, and strengthen the community through this innovative approach.

All codes and files related to the development of the application can be found here.

Development 🔨

The project is divided into three main parts:

Backend: Consumes the model created through the OpenAI API and exposes a route via fastify for connection with the frontend. It also handles business logic, such as data validation and information transmission.
Frontend: Responsible for user interaction with the trained model, as well as the insertion of team information. Specific components were created for each part using React hooks to manage states and send data in requests to the backend.

Fine-tuning: Here, the fundamentals of Machine Learning are applied, along with the best practices recommended by OpenAI.

First, data on the topic was collected and generated;

Then, the data was split into 70% for training and 30% for testing;

Next, the data was structured as specified in the OpenAI documentation (see the example below or click here to view the entire file [in portuguese])

{"messages": [{"role": "system", "content": "You are a technical assistant helping a FIRST LEGO League team in the robot design process, which includes assembly with LEGO pieces (wheels, sensors, controllers, etc.), block programming, mission table strategy, documentation, presentation, continuous improvement, critical thinking, proactivity, and teamwork."}, {"role": "user", "content": "How can we adjust the PID controller settings to meet the specific needs of our robot, considering the mission strategy?"}, {"role": "assistant", "content": "Conduct practical tests, collect performance data, and adjust the PID parameters based on the results to optimize robot control."}]}

After that, the data was validated, and some metrics were generated, such as (example file - generated in the first training):

Num samples: 196
No errors found
Num examples missing system message: 0
Num examples missing user message: 0

Distribution of num_messages_per_example:
min / max: 3, 23
mean / median: 3.4591836734693877, 3.0
p5 / p95: 3.0, 3.0

Distribution of num_total_tokens_per_example:
min / max: 65, 761
mean / median: 121.99489795918367, 106.0
p5 / p95: 83.0, 135.0

Distribution of num_assistant_tokens_per_example:
min / max: 18, 450
mean / median: 58.755102040816325, 44.0
p5 / p95: 33.0, 86.5

0 examples may be over the 4096 token limit, they will be truncated during fine-tuning
Dataset has ~23911 tokens that will be charged for during training
By default, you'll train for 3 epochs on this dataset
By default, you'll be charged for ~71733 tokens''

With the data validated, the .jsonl file is uploaded to the OpenAI environment, where the new model will be trained;
Finally, the model is trained based on the uploaded data, and it becomes available for use in the OpenAI playground or as an API, which is the case for this project.

Note: It is important to highlight that fine-tuning (PEFT) allows for the use of smaller amounts of data for training the model.

Metrics

The following graph represents the loss during the training of the first (Woodie) and the last (Woodie2) models generated.

The graph shows the losses (y-axis) during the training, with significant variation observed over the "steps" (x-axis). The final result, close to 1 (0.9696 for 581 steps), shows an upward trend followed by a decline, indicating that it's still possible to significantly reduce losses and approach the ideal value (0), which is already observable in the second graph (0.0073 for 1901 steps).

Running locally ⚙️

Clone this repository;

Create a new file .env to insert your credentials:

//BACKEND
DATABASE_URL= "your_db_url" //I used a container with postgresql
OPENAI_KEY = "your_api_key_here"

MAIL_USER = "your_mailing_service_user" //optional
MAIL_PASS = "your_mailing_service_password" //optional

//FINE-TUNING
OPENAI_KEY = "your_api_key_here"
FILE_ID = "file_to_uploaded_id"

//FRONTEND
VITE_AUTH0_DOMAIN = "your_auth0_domain"
VITE_AUTH0_CLIENT_ID = "your_auth0_client_id"

2.1 You can also create two other .env in order to better organize your files and separete them in: .env.production and .env.development

VITE_AUTH0_CALLBACK_URL = <callback_url_of_your_env_prod_or_dev>
VITE_CONNECTION_SERVER = <server_url_of_your_env_prod_or_dev>

Next, to install the dependencies, in the /frontend and /backend directories - for each one - run:
```
npm install
```
Start your database.

Note.: If you are using docker remember to get the image of your prefered db (which is set in schema.prisma as "provider") and insert the correct url for the database on your backend .env file run on your terminal:
```
docker run <container_id>
```
or just press the play button on docker desktop interface.
To start communication with the database (abstraction - ORM):
```
npx prisma studio
```
Then, run the following command to start the application (both for front and backend):
```
npm run dev
```

After these steps, the application will be ready to use. Click here to see a demonstration video.

Note: Endpoints can be tested directly through the file routes.http

Requirements 📋

Python (v3.10)
Numpy
OpenAI (API key e lib)
TypeScript (v5.0.2)
ReactJS (v18.2.0)
TailwindCSS
Axios
NodeJS (v18.17.1)
Fastify (v4.23.0)
PrismaORM (v5.2.0)
Docker

Future implementation proposals 💡

Database Optimization
Automated Testing, CI/CD pipelines, and official release
Save and Select Prompts for Consultation or Reuse
Selection of Models for Different Competitions, Categories, etc.
Customization of Models (Open-source)

References 📚

OpenAI: https://platform.openai.com/docs/guides/fine-tuning

OpenAI Cookbook: https://cookbook.openai.com/examples/chat_finetuning_data_prep

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
backend		backend
fine_tuning		fine_tuning
frontend		frontend
images		images
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Study on the Application of Fine-tuning in Educational Robotics

Objective 🎯

Development 🔨

Running locally ⚙️

Requirements 📋

Future implementation proposals 💡

References 📚

About

Releases

Packages

Languages

jvoliveirag/Woodie.ai

Folders and files

Latest commit

History

Repository files navigation

Study on the Application of Fine-tuning in Educational Robotics

Objective 🎯

Development 🔨

Running locally ⚙️

Requirements 📋

Future implementation proposals 💡

References 📚

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages