Poetron - AI Haiku Generator

A Python-based haiku generator using GPT-Neo-1.3B (run locally on your machine) with intelligent fallback to rule-based generation. Creates authentic 5-7-5 haikus with automatic validation and quality reporting. 100% local - no external API calls or cloud services required since you will be running the trained model on your CPU, I have chosen to make it run on the CPU as I am aware that a GPU is something some people may not have.

Quick Start

First Time Setup

python setup_and_run.py

This one-time setup will:

Check your Python version (3.8+ required)
Install all dependencies automatically
Download GPT-Neo-1.3B model (~5GB, runs locally on your CPU)
Launch the interactive haiku generator

Subsequent Uses

After the first setup, you can run directly:

python interactive_haiku.py

This skips the dependency check and model download, launching immediately.

Features

100% Local Execution: All AI inference runs on your machine - no API keys, no cloud services, no external calls
Dual-Generator System: AI-powered GPT-Neo (local) with rules-based fallback
Automatic Validation: Checks 5-7-5 syllable patterns
Quality Reporting: Shows both AI output and corrected alternatives
Interactive Interface: Easy-to-use command-line interface
Save to File: Export your haikus to text files
Adjustable Creativity: Control generation temperature (0.1-1.0)
Privacy Focused: Your topics and haikus never leave your computer

Example Output

Enter haiku topic (or 'quit' to exit): ocean

Haiku 1:
--------------------------------------------------
AI Model Output (Invalid 5-7-5):
The ocean waves crash on shore
in the dark of night
stars shimmer above the sea

--------------------------------------------------
Rules-Based Alternative (Valid 5-7-5):
Silent ocean waits
watching waves drift in spring rain
grace fills the shore
--------------------------------------------------

How It Works

Dual-Generator Architecture

Poetron uses a two-tier generation sys (Local)

Model: EleutherAI/gpt-neo-1.3B (1.3 billion parameters)
Execution: Runs locally on your CPU with no external API calls
Method: Few-shot prompting with haiku examples
Download Size: ~5GB (downloaded once, cached locally)
Privacy: All generation happens on your machine
Output: Creative, varied haikus

Technical Details:

Downloaded from HuggingFace Hub on first run (cached in ~/.cache/huggingface/)
Uses causal language modeling with custom prompts
Few-shot examples guide the model toward haiku structure
Generates up to 60 new tokens per haiku
Parameters: temperature=0.1-1.0, top_p=0.9, top_k=40
Pure PyTorch inference - no API calls, no subscriptions, no usage limits
Generates up to 60 new tokens per haiku
Parameters: temperature=0.1-1.0, top_p=0.9, top_k=40

2. Fallback: Rules-Based Generator

Method: Template-based with curated word banks
Validation: Always produces valid 5-7-5 structure
Features: Grammar-aware, theme-detection, typo correction
Speed: Instant generation

Technical Details:

6 themed word banks (spring, summer, autumn, winter, night, water)
Automatic verb conjugation (singular/plural agreement)
Pronoun selection (subjective/objective forms)
Syllable-counted templates with 3 variations per line type

Validation System

When GPT-Neo generates a haiku, Poetron:

Extracts Content: Removes instruction artifacts and formatting
Counts Syllables: Uses heuristic-based counter with 50+ exception words
Validates Structure: Checks for exact 5-7-5 pattern
Reports Results:
- If valid: Shows AI output only
- If invalid: Shows both AI output + rules-based alternative

Syllable Counter Features:

Exception dictionary for common words (e.g., 'beautiful': 3, 'ocean': 2)
Silent 'e' detection
-ed ending handling
Punctuation stripping

Why Both Generators?

GPT-Neo Strengths:

Creative and varied output
Natural language flow
Contextually rich imagery
Unpredictable (interesting) results

GPT-Neo Limitations:

Not trained specifically on poetry
Often produces invalid syllable counts
May generate 6-7-7 or 8-5-8 patterns
No guaranteed haiku structure

Rules-Based Strengths:

Always valid 5-7-5 structure
Perfect grammar and agreement
Instant generation
Predictable quality

Rules-Based Limitations:

Limited vocabulary (word banks)
Templated structure
Less creative variety
Formulaic output or want to skip the setup script:

# 1. Install dependencies
pip install -r requirements.txt

# 2. Run the generator directly
python interactive_haiku.py

Note: After first-time setup with setup_and_run.py, you can always run python interactive_haiku.py directly. The model will be cached and dependencies already installed.

2. Run the generator

python interactive_haiku.py

- **Storage**: 6GB free space (for model cache)
- **Internet**: Required only for first-time model download (then fully offline)
- **No API Keys**: Everything runs locally on your machine

- **Python**: 3.8 or higher
- **RAM**: 8GB+ recommended for GPT-Neo
- **Storage**: 6GB free space (for model cache)
- **Internet**: Required for first-time model download

## Dependencies

torch>=2.1.0 # PyTorch for model inference transformers>=4.44.0 # Hugging Face transformers library tokenizers>=0.19.0 # Fast tokenization huggingface_hub # Model downloading click>=8.0.0 # CLI utilities


## Troubleshooting

### "Model download is slow" (downloads ~5GB from HuggingFace)
- Model is cached in `~/.cache/huggingface/` after first run
- Subsequent runs load instantly from cache - no re-download
- After first setup, use `python interactive_haiku.py` to skip setup checks

### "Out of memory error"
- GPT-Neo requires ~4GB RAM during inference
- Close other applications to free memory
- On low-RAM systems, choose option 1 (rule-based generator) - it's instant and uses minimal RAM

### "Invalid syllable counts"
- This is normal! GPT-Neo isn't trained specifically on poetry
- The system shows you both versions automatically
- Use the rules-based alternative if you need strict 5-7-5

### "Import errors"
- Run: `pip install --upgrade -r requirements.txt`
- Ensure Python 3.8+ with: `python --version`

### "Can I use this offline?"
- Yes! After first model download, everything runs offline
- No internet connection needed after initial setup
- Ensure Python 3.8+ with: `python --version`

## Technical Specifications

- **Execution**: 100% local - no API calls
- **Source**: EleutherAI via HuggingFace Hub
- **Cache Location**: `~/.cache/huggingface/hub/`
### GPT-Neo-1.3B Configuration
- **Architecture**: Transformer decoder (causal LM)
- **Parameters**: 1.3 billion
- **Context Window**: 2048 tokens
- **Vocabulary**: 50,257 tokens
- **Precision**: FP32 (CPU inference)

### Generation Parameters
```python
{
    'max_new_tokens': 60,
    'temperature': 0.1-1.0,    # User adjustable
    'top_p': 0.9,
    'top_k': 40,
    'do_sample': True,
    'repetition_penalty': 1.2,
    'no_repeat_ngram_size': 2
}

Syllable Counter Algorithm

def count_syllables(word):
    1. Check exception dictionary
    2. Count vowel groups
    3. Adjust for silent 'e'
    4. Handle -ed endings
    5. Return max(1, count)

First Time

$ python setup_and_run.py
# Setup runs, model downloads, generator starts
Enter haiku topic: mountain
Number of haikus [1]: 1
Creativity (0.1-1.0) [0.8]: 0.8

After Setup (Quick Launch)

$ python interactive_haiku.py
# Loads instantly from cache
Enter haiku topic: ocean
Number of haikus [1]: 2
Enter haiku topic: mountain
Number of haikus [1]: 1
Creativity (0.1-1.0) [0.8]: 0.8
``` (run locally)
- **Transformers**: Hugging Face's transformer library (for local inference)
- **PyTorch**: Deep learning framework (CPU inference)

## Acknowledgments

- **EleutherAI**: for GPT-Neo-1.3B open-source model
- **Hugging Face**: for transformers library and model hosting
- **PyTorch Team**: for the deep learning framework

Batch Generation

Number of haikus: 3   # Generate multiple variations

Contributing

This project uses:

GPT-Neo-1.3B: EleutherAI's open-source language model
Transformers: Hugging Face's transformer library
PyTorch: Deep learning framework

Acknowledgments

EleutherAI: for GPT-Neo-1.3B model
Hugging Face: for transformers library
PyTorch Team: for the deep learning framework
Kaggle: for the dataset as well as the initial cloud training

Version

Current Version: 2.0

Dual-generator system with intelligent fallback
Automatic syllable validation
Grammar-corrected rules-based generator

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.github/workflows		.github/workflows
Poetron/src		Poetron/src
src		src
tests		tests
.gitignore		.gitignore
Poetron.bat		Poetron.bat
README.md		README.md
build_exe.py		build_exe.py
download_data.py		download_data.py
download_kaggle_trained_model.sh		download_kaggle_trained_model.sh
interactive_haiku.py		interactive_haiku.py
interactive_poet.py		interactive_poet.py
poetron.spec		poetron.spec
poetry_cli.py		poetry_cli.py
quickstart.sh		quickstart.sh
requirements-build.txt		requirements-build.txt
requirements-training.txt		requirements-training.txt
requirements.txt		requirements.txt
setup_and_run.py		setup_and_run.py
uninstall.py		uninstall.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Poetron - AI Haiku Generator

Quick Start

First Time Setup

Subsequent Uses

Features

Example Output

How It Works

Dual-Generator Architecture

2. Fallback: Rules-Based Generator

Validation System

Why Both Generators?

2. Run the generator

Syllable Counter Algorithm

After Setup (Quick Launch)

Batch Generation

Contributing

Acknowledgments

Version

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Poetron - AI Haiku Generator

Quick Start

First Time Setup

Subsequent Uses

Features

Example Output

How It Works

Dual-Generator Architecture

2. Fallback: Rules-Based Generator

Validation System

Why Both Generators?

2. Run the generator

Syllable Counter Algorithm

After Setup (Quick Launch)

Batch Generation

Contributing

Acknowledgments

Version

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages