🤖 ConsensusWatch (Bot Detector MVP)

Detects synthetic consensus and coordinated narrative steering in Reddit threads.

🎯 What It Does

Given a Reddit thread or subreddit over a time window, ConsensusWatch outputs a Synthetic Consensus Score and human-readable explanation indicating whether the apparent consensus is organic or potentially coordinated.

🚀 Features

Real-time Reddit Analysis: Fetch and analyze threads using PRAW
Semantic Similarity Detection: Uses sentence transformers to identify coordinated messaging
Temporal Pattern Analysis: Detects suspicious timing and burst patterns
Graph-based Coordination Detection: Identifies clusters of accounts with coordinated behavior
Explainable Results: Human-readable rationales for detection decisions
Multiple Interfaces: CLI, REST API, and comprehensive web dashboard
Interactive Visualizations: Rich graphs and charts for analysis results
Batch Analysis: Analyze multiple threads across subreddits

📊 Synthetic Consensus Index

The system calculates a Synthetic Consensus Score (0-100) based on:

Semantic Similarity: Unusually high similarity between comments
Temporal Clustering: Coordinated posting patterns and timing bursts
User-Comment Graphs: Network analysis of user interactions
Language Patterns: Repetitive phrasing and template usage

See docs/synthetic_consensus_index.md for methodology.

🛠️ Installation

# Clone the repository
git clone https://github.com/yourusername/consensuswatch.git
cd consensuswatch

# Install dependencies
pip install -r requirements.txt

# Or install with pip
pip install -e .

⚙️ Configuration

Create a .env file with your Reddit API credentials:

REDDIT_CLIENT_ID=your_client_id
REDDIT_CLIENT_SECRET=your_client_secret
REDDIT_USER_AGENT=ConsensusWatch/0.1.0

Get Reddit API credentials here

🚀 Quick Start & Deployment

Fastest Way to Get Started

# Deploy to Streamlit Cloud (FREE):
# 1. Fork this repo
# 2. Visit https://share.streamlit.io
# 3. Deploy dashboard/app_new.py
# 4. Add your Reddit API secrets
# Done! Live in 2 minutes.

See QUICKSTART.md for detailed guides.

Local Development

# Quick start
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
streamlit run dashboard/app_new.py

Docker Deployment

# One command to run everything
docker-compose up --build

# Or use the deployment helper
./deploy.sh

See DEPLOYMENT.md for all deployment options.

🎮 Usage

CLI

# Analyze a specific thread
consensuswatch analyze --url "https://www.reddit.com/r/worldnews/comments/..."

# Analyze a subreddit over time
consensuswatch analyze --subreddit worldnews --hours 24

# With detailed output
consensuswatch analyze --url "..." --verbose --output report.json

Python API

from ingestion.thread_fetcher import fetch_thread
from analysis.consensus_score import calculate_consensus_score
from explainability.rationale import generate_rationale

# Fetch thread data
thread_data = fetch_thread("https://www.reddit.com/r/...")

# Calculate consensus score
score, metrics = calculate_consensus_score(thread_data)

# Get explanation
explanation = generate_rationale(score, metrics)

print(f"Synthetic Consensus Score: {score}/100")
print(f"Analysis: {explanation}")

REST API

# Start the API server
uvicorn api.main:app --reload

# Analyze a thread
curl -X POST "http://localhost:8000/analyze" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://www.reddit.com/r/..."}'

Web Dashboard

# Launch comprehensive web interface
streamlit run dashboard/app_new.py

# Or use the original simple dashboard
streamlit run dashboard/app.py

Features:

🏠 Home: Interactive introduction and feature showcase
🔍 Analyze Thread: Single thread analysis with comprehensive visualizations
📊 Batch Analysis: Analyze multiple threads from a subreddit
📚 Documentation: Complete methodology and usage guide
ℹ️ About: Project information and ethics

Visualizations Include:

Score gauge with risk levels
Component score breakdowns (bar & radar charts)
Similarity heatmaps
Temporal burst detection graphs
Network interaction graphs
Cluster visualizations (2D projections)
Metrics comparison tables

📁 Project Structure

consensuswatch/
├── ingestion/          # Reddit data fetching & normalization
├── embeddings/         # Text embedding & caching
├── analysis/           # Core detection algorithms
├── explainability/     # Human-readable explanations
├── api/                # FastAPI REST endpoints
├── dashboard/          # Streamlit web interface
├── cli/                # Command-line tool
├── tests/              # Unit tests
└── docs/               # Documentation

🧪 Testing

# Run all tests
pytest

# With coverage
pytest --cov=. --cov-report=html

# Run specific test
pytest tests/test_consensus_score.py

📖 Documentation

🔬 Example Case Studies

⚠️ Limitations

False positives: Genuine grassroots movements may appear coordinated
Language support: Currently optimized for English
API limits: Reddit API rate limiting applies
Not definitive proof: Results are probabilistic indicators, not proof

See docs/limitations.md for full discussion.

🤝 Contributing

Contributions welcome! Please read our contributing guidelines and submit PRs.

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Built with:

PRAW - Reddit API wrapper
Sentence Transformers - Semantic embeddings
HDBSCAN - Clustering
FastAPI - REST API
Streamlit - Dashboard

⚖️ Ethics & Responsible Use

This tool is designed for research and transparency. Users should:

Respect Reddit's Terms of Service
Not harass or dox suspected bot accounts
Consider context and avoid false accusations
Use findings to improve platform health, not witch hunts

Status: MVP - Active Development

Version: 0.1.0

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.do		.do
.github/workflows		.github/workflows
.railway		.railway
.streamlit		.streamlit
analysis		analysis
api		api
cli		cli
dashboard		dashboard
data		data
docs		docs
embeddings		embeddings
examples		examples
explainability		explainability
ingestion		ingestion
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
DEPLOYMENT.md		DEPLOYMENT.md
DEPLOYMENT_WALKTHROUGH.md		DEPLOYMENT_WALKTHROUGH.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Procfile		Procfile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
app.json		app.json
deploy-complete.sh		deploy-complete.sh
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
runtime.txt		runtime.txt
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 ConsensusWatch (Bot Detector MVP)

🎯 What It Does

🚀 Features

📊 Synthetic Consensus Index

🛠️ Installation

⚙️ Configuration

🚀 Quick Start & Deployment

Fastest Way to Get Started

Local Development

Docker Deployment

🎮 Usage

CLI

Python API

REST API

Web Dashboard

📁 Project Structure

🧪 Testing

📖 Documentation

🔬 Example Case Studies

⚠️ Limitations

🤝 Contributing

📄 License

🙏 Acknowledgments

⚖️ Ethics & Responsible Use

About

Uh oh!

Releases

Packages

Languages

License

etelford32/Bot_Detector

Folders and files

Latest commit

History

Repository files navigation

🤖 ConsensusWatch (Bot Detector MVP)

🎯 What It Does

🚀 Features

📊 Synthetic Consensus Index

🛠️ Installation

⚙️ Configuration

🚀 Quick Start & Deployment

Fastest Way to Get Started

Local Development

Docker Deployment

🎮 Usage

CLI

Python API

REST API

Web Dashboard

📁 Project Structure

🧪 Testing

📖 Documentation

🔬 Example Case Studies

⚠️ Limitations

🤝 Contributing

📄 License

🙏 Acknowledgments

⚖️ Ethics & Responsible Use

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages