This model is a 156M-parameter English-language causal language model trained on a large-scale text corpus and instruction-tuned for general question answering and task completion.
to run test (simple question answer session on fine-tuned version)
- download model weights from huggingface (https://huggingface.co/firdavsus/LLM-D2)
- place weights in fine_tuned_model/ folder
- run "python3 test.py"
- just type a queation (if you want to exit type 'stop')
train.py is a training script fine_tune.py is a fine tuning script eval.py is a scipt to evaluate raw model using loss on hellaswag
- Model size: 156M parameters
- Architecture: Transformer (causal LM)
- Tokenizer: GPT-2 tokenizer
- Languages: English only
- Dataset: The Pile (10B token subset)
- Domain: mixed-domain text (web, books, articles, code, etc.)
- Dataset: Alpaca (cleaned subset)
- Size: ~50,000 instruction–response examples
- Formatting: instruction-style prompt/response pairs
- Steps: 218,000
- Final training loss: 2.6
- Steps: 2,500
- Final training loss: 1.9
| Benchmark | Score |
|---|---|
| HellaSwag | 28.5 |
- Instruction-style prompting
- Basic question answering
- Text generation and summarization
- Lightweight assistant-style tasks (English)
- Small model size limits reasoning and factual reliability
- May produce incorrect or inconsistent answers
- Instruction-following quality depends strongly on prompt format
- Not suitable for high-stakes or safety-critical use
This model has not been safety-aligned. Please apply your own moderation and guardrails when deploying it ;)
FOR ADDITIONAL INFO CHEKC INFO.TXT (with input and ouput examples)

