GitHub Issue Analyzer

A production-ready FastAPI application t## 📦 Installation & Running Locally

This project uses a Makefile to simplify development workflows.

Configure Virtual Environment (Recommended):

python -m venv venv

# Windows (Git Bash)
source venv/Scripts/activate
# Windows (PowerShell)
.\venv\Scripts\Activate
# Mac/Linux
source venv/bin/activate

Install Dependencies: Installs both production and development dependencies.
```
make install
```
Alternatively: pip install -r requirements.txt && pip install -r requirements-dev.txt
Run the Application: Starts the FastAPI server with hot-reload enabled.
```
make run
```

🚀 Features

GitHub Integration: Asynchronously fetches and paginates through GitHub issues using httpx.
Intelligent Analysis: Summarizes and analyzes issue context using OpenAI's gpt-4o-mini.
Local Caching: Stores issues in an SQLite database using SQLAlchemy to minimize API calls and improve performance.
production-Grade API: Built with FastAPI, featuring structured logging, telemetry middleware, and Pydantic validation.
Code Quality: Enforced via ruff for linting/formatting and mypy for static type checking.
Containerization: Fully Dockerized for easy deployment.
Developer Experience: Includes a Makefile for automating common tasks.

🛠️ Tech Stack

Framework: FastAPI, Uvicorn
Database: SQLAlchemy, SQLite
Async HTTP: HTTPX
LLM Integration: OpenAI API
Validation: Pydantic v2
Testing: Pytest, Pytest-Asyncio, Pytest-Cov
Linting & Typing: Ruff, Mypy
Infrastructure: Docker

🏗️ Architecture & Design Decisions

Modular Design: Structured into clear layers (Routers, Services, Models) to separate concerns, ensuring scalability and maintainability.
Asynchronous Processing: Uses httpx for non-blocking I/O, allowing the server to handle concurrent requests efficiently while waiting for GitHub API responses.
Local Caching Strategy (SQLite):
- Reasoning: Chosen for its zero-configuration, serverless architecture which simplifies local development and testing.
- Benefit: It acts as a reliable persistence layer without the overhead of spinning up a separate Docker container for Postgres/MySQL.
- Trade-off: While excellent for this standalone service, a distributed system would require migrating to a client-server DB (like Postgres) for handling multiple writer instances.
LLM Optimization: Applies intelligent truncation to issue bodies to fit within context windows while preserving key information, balancing cost and analysis quality.
Observability: Features custom telemetry middleware to log request metrics (duration, status), enabling performance monitoring and debugging.
Project Tooling: Utilizes a Makefile and strict linting (ruff, mypy) to enforce a standardized, production-grade development workflow.

📋 Prerequisites

Python 3.10+
Docker (optional)
OpenAI API Key

⚙️ Configuration

Clone the repository:

git clone https://github.com/sk31Dev/github_issue_analyzer.git
cd github_issue_analyzer

Environment Setup: Copy the example environment file (if available) or create a .env file in the root directory:
```
touch .env
```
Add the following variables to .env:
```
OPENAI_API_KEY=your_openai_api_key_here
DATABASE_URL=sqlite:///./issues.db
GITHUB_API_URL=https://api.github.com
```

📦 Installation & Running Locally

This project uses a Makefile to simplify development workflows.

Install Dependencies: Installs both production and development dependencies.
```
make install
```
Alternatively: pip install -r requirements.txt && pip install -r requirements-dev.txt
Run the Application: Starts the FastAPI server with hot-reload enabled.
```
make run
```
Alternatively: python -m app.main

The API will be available at http://localhost:8000.
API Documentation (Swagger UI): http://localhost:8000/docs

🧪 Development

Running Tests

Run the test suite with coverage reporting:

make test

Linting & Formatting

Ensure code quality before committing:

make lint    # Checks for linting errors and type issues
make format  # Auto-formats code using Ruff

🐳 Docker Support

Build and run the application as a container.

Build the Image:
```
make docker-build
```
Run the Container: Runs the container on port 8000 using your local .env file.
```
make docker-run
```

🔌 API Endpoints

1. Scan Repository

Fetches open issues from a public GitHub repository and caches them. In this example, we scan the OpenAI Python SDK repository.

URL: /scan
Method: POST
Body:
```
{
  "repo": "openai/openai-python"
}
```

Response:

{
  "repo": "openai/openai-python",
  "issues_fetched": 287,
  "cached_successfully": true
}

2. Analyze Issues

Sends cached issues to the LLM for summarization or analysis. Here we ask for specific connection issues in the httpx library.

URL: /analyze
Method: POST

Body:

{
  "repo": "encode/httpx",
  "prompt": "Identify common themes related to connection timeouts in the last 50 issues."
}

Response:

{
  "analysis": "Based on the recent issues, users are frequently experiencing connection timeouts when using proxies. Key themes include:\n\n1. **Proxy Authentication**: Several reports indicate timeouts specifically when digest auth is enabled with proxies.\n2. **Keep-Alive defaults**: Users migrating from requests are encountering changes in default keep-alive behavior causing hanging connections.\n3. **Async Contexts**: Improper closure of async contexts leading to pool exhaustion."
}

📂 Project Structure

github_issue_analyzer/
├── app/
│   ├── routers/       # API endpoints
│   ├── services/      # Business logic (GitHub, LLM)
│   ├── utils/         # Utilities (Logging)
│   ├── main.py        # App entry point & middleware
│   ├── models.py      # Database models
│   ├── schemas.py     # Pydantic models
│   └── database.py    # DB Setup
├── tests/             # Pytest suite
├── .env               # Environment variables (GitIgnored)
├── .gitignore         # Ignored files
├── Dockerfile         # Docker configuration
├── Makefile           # Task automation
├── pyproject.toml     # Tool configuration (Ruff, Mypy)
└── requirements.txt   # Production dependencies

🤖 Development Prompts (LLM Usage)

This project was developed using an iterative prompting strategy to simulate a pair-programming environment with an AI assistant. The development process involved initial architectural planning with Gemini 3 Pro (Web) followed by implementation and refinement using Gemini 3 Pro on GitHub Copilot.

Why Gemini 3 Pro (Web)? I chose to use the Gemini web interface for the high-level planning phase because the GitHub Copilot integration (Gemini 3 Pro) is currently in preview. I wanted to ensure the foundational architectural decisions were made using the most stable and feature-rich version of the model available.

Phase 1: Planning & Architecture (Gemini 3 Pro web)

Initial Strategy & Tech Stack Selection: "I started by sharing the requirement spec and asked for a recommendation on the best language and framework..."

Prompt: "I am preparing for an interview assignment [Attached Requirement Spec]. First, suggest the most suitable language and framework for this task. List all options with their pros and cons, and explain the reasoning behind the final choice. For every design or technical decision, provide a clear justification."
Project Scaffolding & Initial Code: "After agreeing on the stack, I asked for the project structure and the initial code skeleton..."

Prompt: "I agree with the proposed stack and will use an OpenAI API key. Please generate the code, but first define the project structure. Ensure you follow Python best practices and coding standards, including SOLID principles, clean code, and performance optimizations."
Demo Planning: "To prepare for the demo, I asked for suggestions on open-source repositories to scan..."

Prompt: "Suggest an example repository I can use to demo this project. Also, how much credit should I add to my OpenAI account for testing and the demo?"
Testing Strategy: "I asked about the most appropriate unit tests for this kind of application..."

Prompt: "What types of unit tests should I generate for this project? Provide step-by-step instructions for adding these unit tests."

Phase 2: Implementation & Refinement (GitHub Copilot)

Scaffolding & Implementation: "I instructed Copilot to implement the folder structure and code skeleton provided by Gemini..."

Prompt: "Create the following structure [github_issue_analyzer tree] with app/ (routers, services, models), tests/, and configuration files."
Dependency Management (Production vs. Dev): "I asked to generate separate requirement files for production and development..."

Prompt: "Generate a requirements.txt for production dependencies and a separate requirements-dev.txt for development tools, ensuring the dev file imports the main one."
Core Logic Refinement: "I took the core logic provided by Gemini for github_service.py and llm_service.py and asked Copilot to refine it..."

Prompt: "Implement github_service.py to fetch issues using httpx and cache them in SQLite. Ensure pagination is handled. Then implement llm_service.py to read from the DB, truncate content, and send to OpenAI."
Feature Implementation (Telemetry): "I needed to track performance, so I asked to implement logging middleware..."

Prompt: "Implement logging middleware in main.py. Then create a corresponding test in tests/test_logging.py to verify that logs are captured correctly."
Automation (Makefile): "To automate the workflow, I asked for a Makefile..."

Prompt: "Create a Makefile including targets for install, run, test, lint, and docker-build. Also, create a pyproject.toml to configure generic linting settings with Ruff."
Code Quality & Refactoring: "I ran MyPy and fed the errors back to Copilot..."

Prompt: "Run MyPy on the codebase, analyze the type errors, and recursively apply fixes to all files until valid."
Requirement Verification: "I requested a formal review of the implemented code against the original assignment spec..."

Prompt: "Review the current codebase against the provided assignment requirements. Verify that all functional requirements and edge cases (like 'repo not found') are strictly met."
Troubleshooting: "When I ran into a 'pytest not found' error, I asked for help debugging it..."

Prompt: "I am encountering the error: 'bash: pytest: command not found'. Also, how can I add test statistics, such as the number of tests executed and a coverage report?"
Documentation: "Finally, I asked for a professional README..."

Prompt: "Generate a detailed, professional README for this project. Populate the API documentation with real-world usage examples for both endpoints, using repositories like openai/openai-python and httpx."

Phase 3: System Prompt Engineering (GitHub Copilot)

System Prompt Refinement: "I iterated on the system prompt in llm_service.py, specifically asking to make the persona more 'expert-level'..."

Prompt: "Refine the system prompt in llm_service.py to be more expert-level. Instruct the model to focus on patterns, be actionable, use evidence (issue IDs), and use strict Markdown formatting. Do not hallucinate issues."

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GitHub Issue Analyzer

🚀 Features

🛠️ Tech Stack

🏗️ Architecture & Design Decisions

📋 Prerequisites

⚙️ Configuration

📦 Installation & Running Locally

🧪 Development

Running Tests

Linting & Formatting

🐳 Docker Support

🔌 API Endpoints

1. Scan Repository

2. Analyze Issues

📂 Project Structure

🤖 Development Prompts (LLM Usage)

Phase 1: Planning & Architecture (Gemini 3 Pro web)

Phase 2: Implementation & Refinement (GitHub Copilot)

Phase 3: System Prompt Engineering (GitHub Copilot)

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
app		app
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

GitHub Issue Analyzer

🚀 Features

🛠️ Tech Stack

🏗️ Architecture & Design Decisions

📋 Prerequisites

⚙️ Configuration

📦 Installation & Running Locally

🧪 Development

Running Tests

Linting & Formatting

🐳 Docker Support

🔌 API Endpoints

1. Scan Repository

2. Analyze Issues

📂 Project Structure

🤖 Development Prompts (LLM Usage)

Phase 1: Planning & Architecture (Gemini 3 Pro web)

Phase 2: Implementation & Refinement (GitHub Copilot)

Phase 3: System Prompt Engineering (GitHub Copilot)

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages