Model Card

Overview

This model is a 156M-parameter English-language causal language model trained on a large-scale text corpus and instruction-tuned for general question answering and task completion.

How to run

to run test (simple question answer session on fine-tuned version)

download model weights from huggingface (https://huggingface.co/firdavsus/LLM-D2)
place weights in fine_tuned_model/ folder
run "python3 test.py"
just type a queation (if you want to exit type 'stop')

train.py is a training script fine_tune.py is a fine tuning script eval.py is a scipt to evaluate raw model using loss on hellaswag

Model Details

Model size: 156M parameters
Architecture: Transformer (causal LM)
Tokenizer: GPT-2 tokenizer
Languages: English only

Training Curves

Training Data

Pretraining

Dataset: The Pile (10B token subset)
Domain: mixed-domain text (web, books, articles, code, etc.)

Instruction Fine-tuning

Dataset: Alpaca (cleaned subset)
Size: ~50,000 instruction–response examples
Formatting: instruction-style prompt/response pairs

Training Setup

Pretraining

Steps: 218,000
Final training loss: 2.6

Post-training (Instruction Fine-tuning)

Steps: 2,500
Final training loss: 1.9

Evaluation

Benchmark	Score
HellaSwag	28.5

Intended Use

Instruction-style prompting
Basic question answering
Text generation and summarization
Lightweight assistant-style tasks (English)

Limitations

Small model size limits reasoning and factual reliability
May produce incorrect or inconsistent answers
Instruction-following quality depends strongly on prompt format
Not suitable for high-stakes or safety-critical use

license: mit

This model has not been safety-aligned. Please apply your own moderation and guardrails when deploying it ;)

FOR ADDITIONAL INFO CHEKC INFO.TXT (with input and ouput examples)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
LLM.py		LLM.py
README.md		README.md
eval.py		eval.py
fine_tune.py		fine_tune.py
info.txt		info.txt
loss.png		loss.png
show.py		show.py
show_1.py		show_1.py
test.py		test.py
train.py		train.py
training_curves.png		training_curves.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model Card

Overview

How to run

Model Details

Training Curves

Training Data

Pretraining

Instruction Fine-tuning

Training Setup

Pretraining

Post-training (Instruction Fine-tuning)

Evaluation

Intended Use

Limitations

license: mit

This model has not been safety-aligned. Please apply your own moderation and guardrails when deploying it ;)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Model Card

Overview

How to run

Model Details

Training Curves

Training Data

Pretraining

Instruction Fine-tuning

Training Setup

Pretraining

Post-training (Instruction Fine-tuning)

Evaluation

Intended Use

Limitations

license: mit

This model has not been safety-aligned. Please apply your own moderation and guardrails when deploying it ;)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages