🤖 LinkedIn AI Agent

Intelligent LinkedIn opportunity analysis and automated response generation powered by AI

Automate your LinkedIn job search with enterprise-grade AI. Analyze recruiter messages, score opportunities, and generate personalized responses - all while you focus on what matters.

🎯 Quick Start

git clone <repository-url> && cd nexton

# Choose one:
make start-lite  # Lite: backend + frontend + postgres (~500MB RAM)
make start       # Full: all services including Celery, Redis (~2GB RAM)

That's it! Open http://localhost:3000

Version Comparison

Command	Services	Features	RAM
`make start-lite`	3 (postgres, backend, frontend)	Manual scraping, no emails	~500MB
`make start`	8 (+ Redis, Celery, Mailpit, Flower)	Scheduled jobs, emails, monitoring	~2GB

Note: Both versions require editing .env with your LinkedIn credentials. The make start* commands auto-create it from .env.example.

For running without Docker, see Lite Version without Docker

Web Dashboard Preview

Dashboard	Opportunities	Responses
Stats, charts, scan button	Filter, search, score breakdown	Approve, edit, decline AI responses

🎨 Visual Overview

🖥️ System Dashboards

Grafana Monitoring	Jaeger Tracing	API Documentation

Track opportunities, pipeline performance	Visualize complete request flows	Test endpoints in the browser

Celery Flower	Prometheus	PostgreSQL

Monitor background jobs	Query custom metrics	Persistent opportunity storage

🎯 The Problem

Job searching on LinkedIn is time-consuming:

50+ recruiter messages per month that need individual responses
Manual analysis of each opportunity (salary, tech stack, company)
Context switching between LinkedIn, research, and drafting responses
Missed opportunities due to delayed responses
Repetitive work that could be automated

💡 The Solution

LinkedIn AI Agent is an intelligent automation system that:

📥 Scrapes your LinkedIn messages once daily (9 AM)
🧠 Analyzes each opportunity using AI (DSPy + LLM)
📊 Scores opportunities based on your preferences (tech stack, salary, location)
✍️ Generates personalized responses adapted to your professional situation
📧 Sends ONE daily summary email with all new opportunities
🚀 Sends approved responses back to LinkedIn

All running on your infrastructure with full observability and production-grade reliability.

✨ Key Features

🤖 Intelligent Analysis Pipeline

AI-Powered Extraction: Automatically extracts company, role, salary, tech stack from messages
Smart Scoring: Multi-dimensional scoring (tech match, salary, seniority, company quality)
Tiered Classification: A/B/C/D tier system for opportunity prioritization
Multi-Model Support: Use OpenAI, Anthropic, or Ollama (local/free) for LLM processing
Context-Aware Responses: Generates human-like responses that mirror language and tone
Real-time Granular Streaming: Watch the AI analyze messages step-by-step (extracting, scoring, drafting) in real-time via Server-Sent Events (SSE)

🔄 Complete Automation Workflow

Daily at 9 AM:
┌─────────────────────────────────────────────────────────────────┐
│  LinkedIn Messages → Scraper → AI Analysis → Score & Tier      │
│         ↓                                                       │
│  Generate Personalized Response (based on your job status)     │
│         ↓                                                       │
│  Store in Database                                              │
└─────────────────────────────────────────────────────────────────┘
                              ↓
              ONE Daily Summary Email with ALL opportunities
                              ↓
              Review → Edit → Approve → Send to LinkedIn

🎛️ Production-Ready Features

Feature	Description
Daily Scraping	Playwright-based LinkedIn scraper runs once daily at 9 AM
Smart Caching	Redis-based multi-layer caching reduces LLM calls by 60%
Background Jobs	Celery Beat schedules daily scraping and cleanup tasks
Daily Summary Email	ONE beautiful HTML email with all new opportunities
Mailpit Integration	Local email testing in development (catches all emails)
Response Workflow	Review, edit, approve, and send responses via REST API
Rate Limiting	Respects LinkedIn limits to avoid account restrictions
Session Management	Persistent cookies for reliable long-term operation

📊 Enterprise-Grade Observability

Tool	Purpose	Access
Prometheus	Metrics collection (30+ custom metrics)	`:9090`
Grafana	Pre-configured dashboards	`:3000`
Jaeger	Distributed tracing (OpenTelemetry)	`:16686`
Loki	Log aggregation	via Grafana
Flower	Celery task monitoring	`:5555`

Track everything:

Pipeline execution time
LLM token usage and costs
Cache hit rates
Opportunity distribution by tier
System health metrics

🧪 Comprehensive Testing

85% code coverage with 140+ tests
Unit tests for all core modules
Integration tests for end-to-end workflows
Load testing with Locust
Automated CI/CD pipeline

🖥️ Web Dashboard

A modern React dashboard for managing your LinkedIn opportunities - no command line needed!

Access

After starting the application, open http://localhost:3000

Pages

Page	URL	Description
Dashboard	`/dashboard`	Overview with stats, charts, and "Scan LinkedIn" button
Opportunities	`/opportunities`	Browse all opportunities with filters and search
Opportunity Detail	`/opportunities/:id`	Full details, score breakdown, AI response
Responses	`/responses`	Approve, edit, or decline pending AI responses
Profile	`/profile`	Configure your preferences (tech stack, salary, etc.)
Settings	`/settings`	LLM config, LinkedIn credentials, notifications

Key Features

Scan LinkedIn Button: Trigger scraping directly from the dashboard
Real-time Status: See scraping progress and health status
Toast Notifications: User-friendly messages for scraping results (success, no messages, errors)
Smart Filtering: Filter opportunities by tier, status, score, company
Response Management: Review AI responses before sending
Profile Editor: Visual editor for all your preferences
Mobile Responsive: Works on desktop and mobile

Tech Stack (Frontend)

React 18 + TypeScript
Vite for fast development
Tailwind CSS + shadcn/ui components
React Query for server state
Zustand for UI state
Recharts for visualizations

See docs/USER_GUIDE.md for the complete user guide.

🏗️ Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                         LinkedIn AI Agent                           │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│  ┌───────────────────────────────────────────────────────────────┐ │
│  │                     Frontend (React)                          │ │
│  │                                                               │ │
│  │   Dashboard ─── Opportunities ─── Responses ─── Profile      │ │
│  │       │              │                │            │          │ │
│  │       └──────────────┴────────────────┴────────────┘          │ │
│  │                           │                                    │ │
│  │                      REST API                                  │ │
│  └───────────────────────────┬───────────────────────────────────┘ │
│                              ▼                                      │
│  ┌───────────────────────────────────────────────────────────────┐ │
│  │                    Application Layer                          │ │
│  │                                                               │ │
│  │   ┌──────────┐      ┌──────────┐      ┌──────────┐          │ │
│  │   │ FastAPI  │◄────►│ Service  │◄────►│   DSPy   │          │ │
│  │   │   API    │      │  Layer   │      │ Pipeline │          │ │
│  │   └────┬─────┘      └────┬─────┘      └────┬─────┘          │ │
│  │        │                 │                  │                │ │
│  │        ▼                 ▼                  ▼                │ │
│  │   ┌──────────┐      ┌──────────┐      ┌──────────┐          │ │
│  │   │PostgreSQL│      │  Redis   │      │  Ollama  │          │ │
│  │   │    DB    │      │  Cache   │      │   LLM    │          │ │
│  │   └──────────┘      └──────────┘      └──────────┘          │ │
│  │                                                               │ │
│  └───────────────────────────────────────────────────────────────┘ │
│                                                                     │
│  ┌───────────────────────────────────────────────────────────────┐ │
│  │                 Background Processing                         │ │
│  │                                                               │ │
│  │   ┌──────────┐   ┌──────────┐   ┌──────────┐   ┌─────────┐  │ │
│  │   │  Celery  │   │Playwright│   │  Email   │   │ Flower  │  │ │
│  │   │ Workers  │   │ Scraper  │   │  Sender  │   │ Monitor │  │ │
│  │   └──────────┘   └──────────┘   └──────────┘   └─────────┘  │ │
│  │                                                               │ │
│  └───────────────────────────────────────────────────────────────┘ │
│                                                                     │
│  ┌───────────────────────────────────────────────────────────────┐ │
│  │              Observability Stack (Optional)                   │ │
│  │                                                               │ │
│  │   ┌──────────┐   ┌──────────┐   ┌──────────┐   ┌─────────┐  │ │
│  │   │Prometheus│───│ Grafana  │───│   Loki   │───│ Jaeger  │  │ │
│  │   │ Metrics  │   │Dashboard │   │   Logs   │   │ Traces  │  │ │
│  │   └──────────┘   └──────────┘   └──────────┘   └─────────┘  │ │
│  │                                                               │ │
│  └───────────────────────────────────────────────────────────────┘ │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

Data Flow

Daily Scraping: Celery Beat triggers scraping at 9 AM daily
Analysis: DSPy pipeline analyzes each message → extracts info → scores → classifies tier
Response Generation: AI generates personalized response based on your professional status
Storage: All opportunities stored in PostgreSQL with their AI responses
Daily Summary: ONE email sent with ALL new opportunities (uses Mailpit in development)
User Action: Review responses in email → Approve/Edit/Decline via API
Send: Approved responses sent back to LinkedIn
Monitoring: All operations tracked with metrics, traces, and logs

🛠️ Tech Stack

Core Application

FastAPI - Modern async Python web framework
DSPy - Structured LLM programming framework
PostgreSQL 15 - Primary database with async support
Redis 7 - Caching and Celery broker
Celery 5 - Distributed task queue
Playwright - Browser automation for LinkedIn

AI/ML

Ollama - Local LLM runtime (free, private)
OpenAI - GPT-4, GPT-3.5-turbo support
Anthropic - Claude 3 support

Observability

Prometheus - Metrics collection
Grafana - Visualization and dashboards
Jaeger - Distributed tracing
Loki - Log aggregation
OpenTelemetry - Instrumentation

Development

pytest - Testing framework
Docker - Containerization
Alembic - Database migrations
Pydantic - Data validation

🚀 Quick Start

Prerequisites

Docker and Docker Compose (recommended)
Python 3.11+ (for local development)
LinkedIn credentials (for scraping)
8GB+ RAM recommended

Option 1: Docker (Recommended)

# 1. Clone repository
git clone https://github.com/yourusername/linkedin-ai-agent.git
cd linkedin-ai-agent

# 2. Configure environment
cp .env.example .env
nano .env  # Add your LinkedIn credentials and settings

# 3. Start all services (one command!)
./scripts/start.sh

# 4. Verify deployment
curl http://localhost:8000/health
# Expected: {"status":"healthy","timestamp":"..."}

That's it! The system is now:

✅ Scheduled to scrape LinkedIn daily at 9 AM
✅ Analyzing opportunities with AI (considering your professional status)
✅ Caching results in Redis
✅ Sending ONE daily summary email (view at http://localhost:8025 in dev)
✅ Ready to generate personalized responses

Option 2: Local Development

# 1. Create virtual environment
python3.11 -m venv .venv
source .venv/bin/activate  # or .venv\Scripts\activate on Windows

# 2. Install dependencies
pip install -r requirements.txt
playwright install chromium

# 3. Start dependencies (Postgres, Redis, Ollama)
docker-compose up -d postgres redis ollama

# 4. Run migrations
alembic upgrade head

# 5. Start FastAPI server
uvicorn app.main:app --reload

# 6. Start Celery worker (separate terminal)
celery -A app.tasks.celery_app worker --loglevel=info

📱 Usage Examples

Access the API

# Interactive API documentation
open http://localhost:8000/docs

List Opportunities

# Get all A-tier opportunities
curl "http://localhost:8000/api/v1/opportunities?tier=A&limit=10"

# Get opportunities above score 80
curl "http://localhost:8000/api/v1/opportunities?min_score=80"

Process a Message

# Manually process a LinkedIn message
curl -X POST http://localhost:8000/api/v1/opportunities \
  -H "Content-Type: application/json" \
  -d '{
    "recruiter_name": "Jane Smith",
    "raw_message": "Hi! Senior Python Engineer role at Google. $180k-$220k, remote. Interested?"
  }'

Review & Approve Response

# Get pending response for opportunity
curl http://localhost:8000/api/v1/responses/123

# Approve and send
curl -X POST http://localhost:8000/api/v1/responses/123/approve

# Edit before sending
curl -X POST http://localhost:8000/api/v1/responses/123/edit \
  -H "Content-Type: application/json" \
  -d '{"edited_response": "Thanks Jane! I'd love to learn more..."}'

# Decline (no message sent)
curl -X POST http://localhost:8000/api/v1/responses/123/decline

View Analytics

# Get opportunity statistics
curl http://localhost:8000/api/v1/opportunities/analytics/stats

# Response:
# {
#   "total": 150,
#   "by_tier": {"A": 12, "B": 45, "C": 68, "D": 25},
#   "avg_score": 62.5,
#   "last_updated": "2024-01-18T..."
# }

📊 Observability

Access Monitoring Tools

Once the system is running, access these dashboards:

Service	URL	Credentials	Purpose
Web Dashboard	http://localhost:3000	-	Main application UI
API Docs	http://localhost:8000/docs	-	Interactive API testing
Mailpit	http://localhost:8025	-	View emails in development
Flower	http://localhost:5555	admin/admin	Celery task monitoring
Grafana	http://localhost:3001	admin/admin	Metrics dashboards (with monitoring stack)
Prometheus	http://localhost:9090	-	Raw metrics queries
Jaeger	http://localhost:16686	-	Request tracing

Key Metrics

Business Metrics:

opportunities_created_total - Total opportunities by tier
opportunity_score_distribution - Score histogram
opportunities_by_tier - Current distribution

Performance Metrics:

dspy_pipeline_execution_time_seconds - Pipeline latency
llm_api_latency_seconds - LLM response time
llm_tokens_used_total - Token usage and costs

Cache Metrics:

cache_operations_total - Hit/miss rates
cache_hit_rate - Percentage of cache hits

System Metrics:

db_query_latency_seconds - Database performance
scraper_operations_total - Scraping success/failure

Example Queries

# Average pipeline execution time (last 1h)
rate(dspy_pipeline_execution_time_seconds_sum[1h]) /
rate(dspy_pipeline_execution_time_seconds_count[1h])

# Cache hit rate
sum(rate(cache_operations_total{status="hit"}[5m])) /
sum(rate(cache_operations_total[5m])) * 100

# Opportunities created per day by tier
sum by (tier) (increase(opportunities_created_total[1d]))

Grafana Dashboards

Pre-configured dashboards available in monitoring/grafana/dashboards/:

LinkedIn Agent Overview - Main business metrics
System Performance - CPU, memory, network
DSPy Pipeline - AI/ML performance
Database & Cache - Data layer metrics

⚙️ Configuration

Environment Variables

Key configuration options (see .env.example for all options):

# === Application ===
ENV=development
LOG_LEVEL=INFO

# === LinkedIn Credentials ===
LINKEDIN_EMAIL=your@email.com
LINKEDIN_PASSWORD=your-password

# === Database ===
DATABASE_URL=postgresql+asyncpg://user:pass@localhost:5432/linkedin_agent

# === Redis ===
REDIS_URL=redis://localhost:6379/0

# === AI/ML Configuration ===
# Choose provider: ollama (local/free), openai, anthropic
LLM_PROVIDER=ollama
LLM_MODEL=llama3.2

# Ollama (local)
OLLAMA_URL=http://localhost:11434
OLLAMA_MODEL=llama3.2

# OpenAI (paid)
OPENAI_API_KEY=sk-...
OPENAI_MODEL=gpt-4-turbo

# Anthropic (paid)
ANTHROPIC_API_KEY=sk-ant-...
ANTHROPIC_MODEL=claude-3-sonnet-20240229

# Per-module configuration (optional)
ANALYZER_LLM_PROVIDER=ollama
ANALYZER_LLM_MODEL=llama3.2
RESPONSE_LLM_PROVIDER=openai
RESPONSE_LLM_MODEL=gpt-4-turbo

# === Email Notifications ===
# Development: Use Mailpit (local email catcher)
SMTP_HOST=localhost
SMTP_PORT=1025
SMTP_USE_TLS=false
SMTP_USERNAME=
SMTP_PASSWORD=
SMTP_FROM_EMAIL=noreply@linkedin-agent.local
NOTIFICATION_EMAIL=you@example.com

# Production: Use real SMTP (Gmail, SendGrid, etc.)
# SMTP_HOST=smtp.gmail.com
# SMTP_PORT=587
# SMTP_USE_TLS=true
# SMTP_USERNAME=your_email@gmail.com
# SMTP_PASSWORD=your_app_password

# Only notify for these tiers
NOTIFICATION_TIER_THRESHOLD=["A", "B"]
NOTIFICATION_SCORE_THRESHOLD=60

# === Scraper Settings ===
SCRAPER_HEADLESS=true
SCRAPER_MAX_REQUESTS_PER_MINUTE=10
SCRAPER_MIN_DELAY_SECONDS=3.0

# === Observability (Optional) ===
OTEL_ENABLED=true
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4318
PROMETHEUS_MULTIPROC_DIR=/tmp/prometheus

User Profile Configuration

Configure your preferences in config/profile.yaml:

# Personal Information
name: "Your Name"

# Skills and Experience
preferred_technologies:
  - Python
  - FastAPI
  - PostgreSQL
  - Docker
  - React

years_of_experience: 5
current_seniority: "Senior"  # Junior/Mid/Senior/Staff/Principal

# Compensation Expectations (USD)
minimum_salary_usd: 80000
ideal_salary_usd: 120000

# Work Preferences
preferred_remote_policy: "Remote"  # Remote/Hybrid/On-site/Flexible
preferred_locations:
  - "Remote"
  - "United States"

# Company Preferences
preferred_company_size: "Mid-size"  # Startup/Mid-size/Enterprise
industry_preferences:
  - "Technology"
  - "AI/ML"
  - "SaaS"

Professional Status & AI Response Personalization

The system adapts AI-generated responses based on your current professional situation. Configure the job_search_status section to personalize how responses are generated:

# Professional Status (used for AI response generation)
job_search_status:
  currently_employed: true
  actively_looking: false  # true = actively searching, false = only exceptional opportunities

  # Urgency level determines response tone
  # Options: urgent, moderate, selective, not_looking
  urgency: "selective"

  # Your current situation (free text - be specific!)
  situation: |
    Currently employed and happy, but open to exceptional opportunities.
    Only considering roles with 4-day work week.
    Focused on AI/ML engineering positions.

  # Deal-breakers - opportunities missing these will be politely declined
  must_have:
    - "4-day work week (mandatory)"
    - "Remote-first company"
    - "Focus on AI/ML projects"
    - "Senior or Staff level position"

  # Nice to have - will express interest if present
  nice_to_have:
    - "Equity compensation"
    - "Conference/learning budget"
    - "Modern tech stack"
    - "Flexible hours"

  # Automatic rejection criteria - will decline opportunities matching these
  reject_if:
    - "Agencies or consulting firms"
    - "Cryptocurrency/blockchain only"
    - "Early-stage startups (pre-seed)"
    - "5-day work week requirement"
    - "Full-time on-site"

How urgency affects responses:

Urgency Level	Response Behavior
`urgent`	Proactive, enthusiastic responses. Express strong interest in good matches.
`moderate`	Balanced responses. Show interest and ask clarifying questions.
`selective`	Reserved responses. Emphasize specific requirements before proceeding.
`not_looking`	Polite but firm. Only engage with truly exceptional opportunities.

Example response behaviors:

HIGH_PRIORITY opportunity + selective urgency: Express interest but ask about must-have requirements (e.g., "Before we proceed, does the role offer a 4-day work week?")
INTERESANTE opportunity + not_looking urgency: Politely acknowledge but mention you're not actively looking unless it meets specific criteria
Any opportunity missing must_have items: Politely decline and mention the specific requirement that wasn't met
Opportunity matching reject_if criteria: Automatic polite decline with brief explanation

Daily Summary Email

Instead of sending individual emails for each opportunity, the system sends ONE daily summary email at 9 AM containing all new opportunities found.

Email includes for each opportunity:

Tier classification (HIGH_PRIORITY, INTERESANTE, POCO_INTERESANTE, NO_INTERESA)
Score breakdown (tech stack, salary, seniority, company)
Extracted information (company, role, salary range, tech stack)
AI-generated response (personalized to your professional status)
Action buttons: Approve / Edit / Decline

Development with Mailpit:

In development, emails are captured by Mailpit instead of being sent to real addresses:

# Mailpit is included in docker-compose.yml
# View captured emails at:
open http://localhost:8025

Mailpit captures all outgoing emails, making it easy to test and preview the daily summary without configuring a real SMTP server.

🧑‍💻 Development

Local Setup

# Install development dependencies
pip install -r requirements-dev.txt

# Setup pre-commit hooks
pre-commit install

# Run tests
pytest tests/ -v --cov=app

# Run linters
black app/ tests/
ruff check --fix app/ tests/
mypy app/

# Security scan
bandit -r app/
safety check

Project Structure

linkedin-ai-agent/
├── app/
│   ├── api/                    # REST API endpoints
│   │   └── v1/
│   │       ├── opportunities.py
│   │       ├── responses.py
│   │       └── health.py
│   ├── cache/                  # Redis caching layer
│   ├── core/                   # Configuration & utilities
│   ├── database/               # SQLAlchemy models & repos
│   ├── dspy_pipeline/          # AI analysis pipeline
│   │   ├── opportunity_analyzer.py
│   │   └── response_generator.py
│   ├── observability/          # Metrics & tracing
│   ├── scraper/                # LinkedIn scraper
│   ├── services/               # Business logic layer
│   ├── tasks/                  # Celery background tasks
│   └── main.py                 # FastAPI application
├── tests/
│   ├── unit/                   # Unit tests
│   ├── integration/            # Integration tests
│   └── performance/            # Load tests
├── monitoring/                 # Observability configs
│   ├── grafana/
│   ├── prometheus/
│   └── loki/
├── infrastructure/
│   └── docker/                 # Dockerfiles
├── scripts/                    # Automation scripts
├── config/                     # Configuration files
├── docs/                       # Documentation
├── docker-compose.yml
└── requirements.txt

Testing

# Run all tests with coverage
pytest tests/ -v --cov=app --cov-report=html

# Run specific test categories
pytest tests/unit/ -v              # Unit tests only
pytest tests/integration/ -v       # Integration tests
pytest -k "cache" -v               # Cache tests only

# View coverage report
open htmlcov/index.html

# Load testing
locust -f tests/performance/locustfile.py --host=http://localhost:8000

Test Coverage:

✅ 140+ tests
✅ 85% code coverage
✅ All core modules tested
✅ Integration tests for workflows
✅ Performance benchmarks

🚢 Deployment

Docker Compose (Recommended)

# Development
docker-compose up -d

# Production
docker-compose -f docker-compose.prod.yml up -d

# With monitoring stack
docker-compose up -d
docker-compose -f docker-compose.monitoring.yml up -d

Manual Deployment

See docs/DEPLOYMENT.md for:

Cloud deployment (AWS, GCP, Azure)
Kubernetes manifests
CI/CD setup with GitHub Actions
Secrets management
Backup/restore procedures
Scaling strategies

Resource Requirements

Minimum:

2 CPU cores
4GB RAM
20GB disk

Recommended:

4 CPU cores
8GB RAM
50GB disk

Production:

8+ CPU cores
16GB+ RAM
100GB+ SSD

📚 Documentation

Comprehensive documentation available in docs/:

Document	Description
USER_GUIDE.md	Start here! Complete user guide with frontend
ARCHITECTURE.md	System design and data flow
API.md	Complete API reference
DEPLOYMENT.md	Production deployment guide
DEVELOPMENT.md	Development workflow
TESTING_GUIDE.md	Testing strategies
MULTI_LLM_GUIDE.md	Multi-model LLM setup
NOTIFICATIONS_AND_RESPONSES.md	Email & response workflow
SCRAPER.md	LinkedIn scraper details

Specialized Guides

Getting Started with Ollama: docs/guides/OLLAMA_SETUP.md
User Profile Configuration: docs/guides/PROFILE_CONFIGURATION.md
Scraper Improvements: docs/guides/SCRAPER_IMPROVEMENTS.md
Monitoring Stack: monitoring/README.md
All Guides: docs/guides/

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

Development Workflow

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Code Standards

Python: Black formatting, Ruff linting, MyPy type checking
Tests: 80%+ coverage required
Commits: Conventional commits format
Documentation: Update relevant docs with changes

🐛 Troubleshooting

Common Issues

Ollama not responding:

docker-compose restart ollama
docker-compose logs ollama

Database connection errors:

docker-compose exec postgres pg_isready
# Check DATABASE_URL in .env

Playwright browser issues:

playwright install chromium --with-deps

LinkedIn login failing:

Check credentials in .env
Verify LinkedIn account isn't locked
Try with SCRAPER_HEADLESS=false to debug

See docs/TROUBLESHOOTING.md for more solutions.

📈 Performance

Benchmarks

Based on testing (M1 MacBook Pro, 16GB RAM):

Metric	Performance
API Response Time (no LLM)	p95 < 100ms
Pipeline Execution	2-4s per message
Throughput	~15 messages/min (single worker)
Cache Hit Rate	~60% steady state
Database Queries	p95 < 10ms

Optimization Tips

Increase workers: Scale Celery workers for higher throughput
Batch processing: Process multiple messages in batches
Use cheaper models: Ollama/Llama for analysis, GPT-4 for responses
Cache aggressively: Longer TTLs for stable data
Connection pooling: Reuse DB connections

🛡️ Security

✅ Secrets Management: All credentials in environment variables
✅ Input Validation: Pydantic models for all inputs
✅ SQL Injection: SQLAlchemy ORM with parameterized queries
✅ Rate Limiting: Prevents LinkedIn account restrictions
✅ Dependency Scanning: Automated with safety and trivy
✅ Container Scanning: Docker image vulnerability checks
✅ Non-root Containers: All containers run as non-root users

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built with amazing open-source tools:

DSPy - Stanford NLP's structured prompting framework
FastAPI - Modern Python web framework
Ollama - Local LLM runtime
Playwright - Browser automation

Special thanks to the open-source community for making projects like this possible.

📬 Contact & Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Documentation: docs/

⭐ Star this repo if you find it useful!

Built with ❤️ for automating the job search

Report Bug • Request Feature • Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.agent/skills		.agent/skills
.github		.github
app		app
config		config
docs		docs
frontend		frontend
infrastructure		infrastructure
monitoring		monitoring
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
PUBLISH_CHECKLIST.md		PUBLISH_CHECKLIST.md
QUICK_POLISH_SUMMARY.md		QUICK_POLISH_SUMMARY.md
README.md		README.md
alembic.ini		alembic.ini
check_langfuse.py		check_langfuse.py
docker-compose.lite.yml		docker-compose.lite.yml
docker-compose.monitoring.yml		docker-compose.monitoring.yml
docker-compose.yml		docker-compose.yml
env.lite.example		env.lite.example
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-ai-review.txt		requirements-ai-review.txt
requirements-dev.txt		requirements-dev.txt
requirements-lite.txt		requirements-lite.txt
requirements.txt		requirements.txt
test_message_generation.py		test_message_generation.py
test_scraper.py		test_scraper.py
test_scraper_demo.py		test_scraper_demo.py
test_scraper_quick.py		test_scraper_quick.py

Folders and files

Latest commit

History

Repository files navigation