JobScraper API

A FastAPI microservice that scrapes job listings using JobSpy, persists them in MySQL, and exposes a REST API for querying.
Now powered by Poetry for dependency and environment management. The jobScraper is used with an N8n workflow to automate the job application process.

🚀 Features

POST /scrape → Run job scraping and persist results
GET /jobs → Query stored job postings with filters & pagination
GET /jobs/{id} → Fetch individual job
Logging (Loguru)
Alembic migrations
MySQL database
Poetry-based dependency management
Docker & Docker Compose support

⚙️ Local Development (with Poetry)

Install Poetry

curl -sSL https://install.python-poetry.org | python3 -
export PATH="$HOME/.local/bin:$PATH"

Install dependencies
```
poetry install
```
Run migrations
```
poetry run alembic upgrade head
```

Start the API

poetry run uvicorn app.main:app --reload

Visit docs
- Swagger: http://localhost:8000/docs
- Redoc: http://localhost:8000/redoc

🐳 Docker Deployment

Build the container
```
docker compose build
```
Run
```
docker compose up
```
- API: http://localhost:8000
- MySQL: on port 3306 (default credentials from .env)

Apply migrations in container

docker compose exec api poetry run alembic upgrade head

🧰 Environment Variables (`.env`)

Example:

APP_NAME=JobScraper API
APP_ENV=dev
DB_HOST=db
DB_PORT=3306
DB_USER=jobs
DB_PASSWORD=jobs_pw
DB_NAME=jobsdb

🧠 Common Commands

Task	Command
Add new dependency	`poetry add <package>`
Add dev dependency	`poetry add --group dev <package>`
Remove dependency	`poetry remove <package>`
Run migrations	`poetry run alembic upgrade head`
Start dev server	`poetry run uvicorn app.main:app --reload`
Run tests	`poetry run pytest`

📦 Project Structure

app/
├─ main.py
├─ models.py
├─ crud.py
├─ schemas.py
├─ db.py
├─ config.py
├─ scraper.py
docs/
├─ openapi.yaml
migrations/
├─ env.py
├─ versions/
tests/
├─ test_smoke.py

🧩 Alembic Migrations

Alembic is already configured for autogeneration based on app.models.

Generate new migration

poetry run alembic revision --autogenerate -m "add new columns"

Apply migrations

poetry run alembic upgrade head

✅ Health Check

curl http://localhost:8000/health

Response:

{"status": "ok"}

🧹 Cleaning Up

docker compose down -v

Deletes containers, volumes, and networks.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
app		app
docs		docs
migrations		migrations
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
alembic.ini		alembic.ini
docker-compose-dev.yml		docker-compose-dev.yml
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JobScraper API

🚀 Features

⚙️ Local Development (with Poetry)

🐳 Docker Deployment

🧰 Environment Variables (`.env`)

🧠 Common Commands

📦 Project Structure

🧩 Alembic Migrations

Generate new migration

Apply migrations

✅ Health Check

🧹 Cleaning Up

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

JobScraper API

🚀 Features

⚙️ Local Development (with Poetry)

🐳 Docker Deployment

🧰 Environment Variables (.env)

🧠 Common Commands

📦 Project Structure

🧩 Alembic Migrations

Generate new migration

Apply migrations

✅ Health Check

🧹 Cleaning Up

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🧰 Environment Variables (`.env`)

Packages