MiniCPM_FT

Introduction

MiniCPM_FT is a repository for fine-tuning the base model MiniCPM-2B-sft-fp32 using datasets from Hugging Face Datasets and Cosmos. This repository provides the necessary tools and code to finetune the model and evaluate its performance on various datasets.

Paper

The paper associated with this repository is available here.

Dataset

This repository utilizes the following datasets:

Repository Structure

cleandata: Contains separately reconstructed datasets from Cosmos, Trivia QA Wikipedia, and Trivia QA Web. Data is reformatted into standard question-answer pairs.
mergedata: Represents a composite dataset split into train, development, and test datasets derived from the cleaned datasets.
finetune: Contains all the code necessary for the complete process of fine-tuning the base model. To produce a fine-tuned model, use bash {}_finetune.sh.
models: Stores examples of the base and fine-tuned models.
evaluate: Contains codes and sample data for evaluating the fine-tuned model's performance on given datasets. Results are stored in the result folder.

Usage

To finetune the base model, follow these steps:

Clone the repository:

git clone https://github.com/InezYu0928/MiniCPM_FT.git

Navigate to the Finetune folder:
```
cd MiniCPM_FT/finetune
```
Execute the finetuning script:
```
bash xxx_finetune.sh
```

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dataclean		dataclean
datamerge		datamerge
evaluate		evaluate
finetune		finetune
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MiniCPM_FT

Introduction

Paper

Dataset

Repository Structure

Usage

About

Releases

Packages

Languages

InezYu0928/MiniCPM_FT

Folders and files

Latest commit

History

Repository files navigation

MiniCPM_FT

Introduction

Paper

Dataset

Repository Structure

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages