Wobble – RL Wordle™ Solver

Wobble is a Reinforcement Learning project that trains an agent to play Wordle™. It includes:

Backend: Q-learning agent + FastAPI REST API
Frontend: Astro-based web interface to interact with the bot

Project Structure

wobble
│
├── backend/                - Environment, Q-learning agent, API server
│  └── app/
│      ├── data/            - Word lists used for training and guessing
│      └── models/          - Saved Q-learning models (Q-table)
│
└── frontend/               - Astro-based web interface
    └── src
        ├── components      - Reusable UI components
        ├── layouts         - Shared page layouts
        ├── pages           - Page-level views
        └── styles          - Global and component styles

How It Works

Wobble is powered by a reinforcement learning agent trained to solve Wordle-like puzzles using a simplified Q-learning algorithm.

Backend – Q-Learning Agent + API

The backend trains a Q-learning agent to guess 5-letter words based on feedback similar to Wordle™.
The agent interacts with a custom environment (WordleEnv) where:
- State: A tuple representing feedback from the previous guess — the number of correct letters in the correct position (greens) and correct letters in the wrong position (yellows).
- Actions: A set of predefined word-picking strategies (e.g. frequency-based, position-based, constraint-filtered).
- Reward: Positive points for accurate guesses (greens/yellows), a bonus for solving the word, and penalties for failure after 6 attempts.

Training Process

Training runs as a loop of simulated Wordle games (episodes):

Initialize – Load word lists, set up the environment (WordleEnv), initialize strategies, and start with an empty Q-table.
Episode Loop – For each game:
- Reset the environment with a new secret word.
- At each step, the agent:
  - Observes feedback (greens, yellows).
  - Chooses an action (explore randomly with ε, or exploit the best-known action).
  - Picks a guess word using the chosen strategy.
  - Submits the guess to the environment.
Reward + Update – The environment returns feedback and a reward. The Q-value for the current state–action pair is updated using the Q-learning rule:

$$ Q(s, a) \leftarrow Q(s, a) + \alpha \cdot \Big( r + \gamma \cdot \max_{a'} Q(s', a') - Q(s, a) \Big) $$

Exploration Decay – Over episodes, $\varepsilon$ decays so the agent explores less and exploits learned strategies more.
Progress Tracking – Win rate and average rewards are logged every N episodes.
Model Saving – After training completes, the Q-table is saved to app/models/q_table.pkl for later use by the API.

API Capabilities

POST /start: Start a new game with an optional secret word.
POST /step: Submit a user's guess and get feedback.
POST /bot-move: Request the bot to make the next guess using the learned Q-values and current constraints.

Frontend – Astro Web Interface

The frontend is built with Astro, offering an interactive Wordle-style UI.
Users can:
Play the game manually by entering guesses.
Let the bot take over and observe its strategy in action.
The interface communicates with the FastAPI backend to manage game state and display feedback in real time.

Installation

Requirements:

Python 3.10+

Node.js (for frontend)

Backend Setup

cd backend
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Frontend Setup

cd frontend
npm install
npm run dev

Training the Agent

To train the Q-learning agent from scratch:

cd backend
python app/train.py

This will train the agent using custom word lists and save the Q-table to app/models/q_table.pkl.

API Usage (Example)

Start the backend server:

uvicorn app.api:app --reload

Then use an API tool (like Postman or cURL):

Start a game:

POST /start
{
  "secret": "crane"  // optional
}

Submit a guess:

POST /step
{
  "game_id": "1234",
  "guess": "slate"
}

Bot makes a move:
```
POST /bot-move
{
  "game_id": "1234"
}
```

Note:

No official Wordle™ code, data, or other resources are used.
All training is done locally with custom word lists and environments.

Wordle is a trademark of The New York Times Company. This project is not affiliated with or endorsed by The New York Times Company.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Wobble – RL Wordle™ Solver

Project Structure

How It Works

Backend – Q-Learning Agent + API

Training Process

API Capabilities

Frontend – Astro Web Interface

Installation

Backend Setup

Frontend Setup

Training the Agent

API Usage (Example)

About

Uh oh!

Contributors 2

Uh oh!

Languages

s3nthi/wobble

Folders and files

Latest commit

History

Repository files navigation

Wobble – RL Wordle™ Solver

Project Structure

How It Works

Backend – Q-Learning Agent + API

Training Process

API Capabilities

Frontend – Astro Web Interface

Installation

Backend Setup

Frontend Setup

Training the Agent

API Usage (Example)

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages