Your personal AI intelligence feed. Zero cost. Adversarially verified.
python run.py --topic "LLM agents"One command. Verified digest. No subscriptions. No cloud. No cost.
A four-stage Python pipeline that fetches AI research from arXiv, RSS feeds, and HuggingFace Papers, ranks it by relevance to your topic, extracts one factual claim per item, and runs a reasoning model to verify whether each claim is actually supported by the evidence cited.
Every item gets a label: VERIFIED, DISPUTED, or UNVERIFIED.
The output is a structured markdown report saved to digest_output/.
- Not a RAG system. It does not index your documents.
- Not a chatbot. You cannot ask follow-up questions.
- Not real-time. arXiv has a 24-hour lag.
- Not a memory system. Each run starts fresh.
- Not a full-paper reader. It works from abstracts and summaries.
These are honest limitations, not bugs. The architecture is intentionally minimal.
arXiv API ──┐
RSS feeds ──┼──► fetch_all() ──► rank_items() ──► summarize_all() ──► verify_all() ──► digest
HF Papers ──┘
(zero cost) (priority score) (fast model) (reasoning model)
Stage 0 — Fetch. Pulls from three sources. No LLM calls. Zero cost.
Stage 0.5 — Rank. Scores each item: S = Authority + Semantic Density - Temporal Decay. ArXiv papers score highest (60), HuggingFace papers next (40), RSS last (20). Each keyword match in the title adds 25. Transparent weights you can change in two lines.
Stage 1 — Summarize. Fast model (Gemini Flash / Qwen2.5) extracts one factual claim and the verbatim evidence supporting it. Does not verify. Just extracts.
Stage 2 — Verify. Reasoning model (DeepSeek-R1) receives only the claim and evidence. It cannot see Stage 1's confidence. It applies five rules and returns a verdict.
Stage 3 — Digest. Formats everything as structured markdown. Saves to digest_output/.
- Evidence directly supports claim with numbers or verbatim quotes → VERIFIED
- Claim uses a superlative not present in the evidence → DISPUTED
- Single social media source, no corroboration → UNVERIFIED
- Benchmark claim without benchmark name AND score → UNVERIFIED
- Blog post or press release without linked primary data → UNVERIFIED
Hard rules. Not soft heuristics. The verifier cannot decide to be lenient.
# 1. Get a free API key at openrouter.ai
# 2. Clone and install
git clone https://github.com/your-handle/ai-feed
cd ai-feed
pip install -r requirements.txt
# 3. Add your key
cp .env .env.local
# edit .env.local: OPENROUTER_API_KEY=your_key_here
# 4. Run
python run.py --topic "speculative decoding"curl https://ollama.ai/install.sh | sh
ollama pull qwen2.5:7b
ollama pull deepseek-r1:7b
# edit scripts/llm.py: switch to ollama backend
python run.py --topic "transformer architecture"python run.py --profile profiles/ml_engineer.yaml
python run.py --profile profiles/fmcg_data_scientist.yaml
python run.py --profile profiles/agent_builder.yamlEdit any profile in profiles/. Add your topics, your RSS feeds, your email.
══════════════════════════════════════════════════════════════
AI FEED — LLM AGENTS — 2026-05-10
══════════════════════════════════════════════════════════════
🟢 VERIFIED CLAIMS
──────────────────
📄 EAGLE3: 3.1x inference speedup with zero accuracy loss on LLaMA-3-70B
Claim: wall-clock speedup 3.1x, perplexity unchanged at 5.68
Source: arxiv.org/abs/2605.07891
🟡 DISPUTED CLAIMS
──────────────────
📄 "Our method outperforms all existing speculative decoding approaches"
Flag: Superlative not supported. Paper compares to 3 methods, not comprehensive survey.
⚪ UNVERIFIED
─────────────
📄 OpenAI planning native speculative decoding in API this quarter
Source: twitter.com — single source, no corroboration
══════════════════════════════════════════════════════════════
1 verified / 1 disputed / 1 unverified
══════════════════════════════════════════════════════════════
# Every morning at 7am
0 7 * * * cd /path/to/ai-feed && python run.py --profile profiles/ml_engineer.yaml| Machine | Recommended setup |
|---|---|
| 8GB RAM laptop | OpenRouter free tier (default) |
| 16GB RAM / M1 Mac | Ollama: qwen2.5:7b + deepseek-r1:7b |
| 32GB RAM | Ollama: qwen2.5:14b + deepseek-r1:14b |
ai-feed/
├── run.py ← entry point
├── scripts/
│ ├── llm.py ← LLM interface (OpenRouter / Ollama)
│ ├── fetch.py ← fetch + rank
│ ├── summarize.py ← Stage 1: claim extraction
│ ├── verify.py ← Stage 2: adversarial verification
│ └── digest.py ← Stage 3: markdown output
├── profiles/
│ ├── ml_engineer.yaml
│ ├── fmcg_data_scientist.yaml
│ └── agent_builder.yaml
├── digest_output/ ← auto-created on first run
├── .env
└── requirements.txt
To understand the what why and how,read this.
- Cross-session memory and contradiction detection
- Full PDF verification (not just abstracts)
- Source reputation scoring
- Citation velocity weighting via Semantic Scholar API
- Web interface
Pull requests welcome on all of these.
MIT. Fork it. Extend it. Use it commercially.