Pseudocode to Python Generator - Streamlit App

A Streamlit web application for converting pseudocode to Python code using a fine-tuned GPT-2 model.

Features

🐍 Convert pseudocode to Python code using fine-tuned GPT-2 model
⚙️ Adjustable generation parameters (temperature, top-p, max length)
🎨 Clean and intuitive user interface
📋 One-click code copying
💻 GPU support (CUDA) when available

Setup

Install dependencies:
```
pip install -r requirements.txt
```
Ensure model files are in the same directory:
- model.safetensors (or pytorch_model.bin)
- config.json
- tokenizer_config.json
- vocab.json
- merges.txt
- special_tokens_map.json
- generation_config.json (optional)
Run the Streamlit app:
```
streamlit run app.py
```
The app will open in your default web browser at http://localhost:8501

Usage

Enter your pseudocode in the text area
Adjust generation parameters in the sidebar (optional)
Click "Generate Python Code" button
Copy the generated code using the copy button

Example Pseudocode Inputs

create integer variable x
read input from user
if x greater than 5 print yes
for i from 0 to 10 print i
create list numbers

Model Information

Base Model: GPT-2 Small
Training Dataset: SPOC (Pseudocode to Code)
Task: Pseudocode → Python Code Generation
Architecture: GPT2LMHeadModel (12 layers, 768 hidden size)

Requirements

Python 3.8+
PyTorch
Transformers library
Streamlit

Deployment to Streamlit Cloud

Step 1: Prepare Your Repository

Upload all your files to a GitHub repository:
- app.py
- requirements.txt
- All model files (model.safetensors, config.json, tokenizer_config.json, vocab.json, merges.txt, special_tokens_map.json)
- .streamlit/config.toml (optional, for custom theming)

Step 2: Deploy on Streamlit Cloud

Go to share.streamlit.io
Sign in with your GitHub account
Click "New app"
Select your repository and branch
Set the main file path to: app.py
Click "Deploy"

Important Notes for Streamlit Cloud:

Git LFS Configuration: The repository includes .lfsconfig to ensure Git LFS uses HTTPS (not SSH) for downloads
Model Size: Model files are stored with Git LFS. Streamlit Cloud will automatically handle the download
Memory Limits: Streamlit Cloud has memory limits, so ensure your model fits within the constraints
Startup Time: First load may take a few minutes as Streamlit Cloud installs dependencies and loads your model
CPU Only: Streamlit Cloud runs on CPU, so GPU optimizations won't apply

If Git LFS Issues Persist:

If you see "Permission denied (publickey)" errors, ensure:

The repository is public (recommended for Streamlit Cloud)
The .lfsconfig file is committed to the repository
Try rebooting the app on Streamlit Cloud

Alternative: Using Cloud Storage for Large Models

If your model is too large for GitHub, modify app.py to download from cloud storage:

# Add this to load_model() function if needed
import requests

def download_model_from_url(url, local_path):
    response = requests.get(url, stream=True)
    with open(local_path, 'wb') as f:
        for chunk in response.iter_content(chunk_size=8192):
            f.write(chunk)

Local Development

If you want to test locally before deploying:

Install dependencies:
```
pip install -r requirements.txt
```
Run the app:
```
streamlit run app.py
```

Notes

The model automatically uses GPU if CUDA is available (local only; Streamlit Cloud uses CPU)
Generation parameters can be adjusted in the sidebar for different outputs
The model expects pseudocode in natural language format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pseudocode to Python Generator - Streamlit App

Features

Setup

Usage

Example Pseudocode Inputs

Model Information

Requirements

Deployment to Streamlit Cloud

Step 1: Prepare Your Repository

Step 2: Deploy on Streamlit Cloud

Important Notes for Streamlit Cloud:

If Git LFS Issues Persist:

Alternative: Using Cloud Storage for Large Models

Local Development

Notes

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.streamlit		.streamlit
.gitattributes		.gitattributes
.lfsconfig		.lfsconfig
README.md		README.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
a3_t2_GPT_Small.ipynb		a3_t2_GPT_Small.ipynb
app.py		app.py
config.json		config.json
generation_config.json		generation_config.json
merges.txt		merges.txt
model.safetensors		model.safetensors
requirements.txt		requirements.txt
setup.sh		setup.sh
special_tokens_map.json		special_tokens_map.json
tokenizer_config.json		tokenizer_config.json
training_args.bin		training_args.bin
vocab.json		vocab.json

mtaha-23/finetuning-pseudocode-to-python

Folders and files

Latest commit

History

Repository files navigation

Pseudocode to Python Generator - Streamlit App

Features

Setup

Usage

Example Pseudocode Inputs

Model Information

Requirements

Deployment to Streamlit Cloud

Step 1: Prepare Your Repository

Step 2: Deploy on Streamlit Cloud

Important Notes for Streamlit Cloud:

If Git LFS Issues Persist:

Alternative: Using Cloud Storage for Large Models

Local Development

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages