First public release of Sift — published to PyPI as getsift.
Install
pip install getsift
sift init # writes config.yaml + .env with prompts for API keys
sift # launches the interactive terminal UIHighlights
- Interactive Rich-based terminal UI with setup wizard, plus scripted
sift analyze/sift scrapecommands. - Multi-source scrapers: G2, App Store, Play Store, YouTube, Hacker News, GitHub issues, Product Hunt, Stack Overflow, Dev.to, support forums, changelogs, Discord/LinkedIn exports. (Reddit is wired up but muted by default until API approval.)
- ML pipeline: hash-based dedup → relevance gate → sentence embeddings (
all-MiniLM-L12-v2) → UMAP + HDBSCAN clustering → LLM cluster analysis with structured JSON fallbacks. - Multi-product comparison reports (Markdown + JSON).
- OpenAI-compatible LLM client with configurable
base_url— no vendor lock-in. - Token-bucket rate limiting with exponential backoff and jitter; usernames stripped at ingestion (only source links retained).
- Tested on Python 3.11 and 3.12, MIT licensed.
The full feature list is in CHANGELOG.md.