Community driven projects for analyzing 20,000+ Epstein Estate files relased by House Oversight Committee
This repository catalogs community developed projects for analyzing 20,000+ pages of Epstein files released by the House Oversight Committee on November 12, 2025.
Content Notice: Documents contain sensitive material related to criminal investigations.
⚠️ Verification Warning: Many projects use generative models to analyze documents and can make errors, miss information, or even hallucinate false details. Always cross verify findings with the actual source documents. If you encounter erronous information, please report it in the Safety repository.
New to the dataset? Start with our beginner friendly Jupyter notebook:
Getting_Started_w_Dataset.ipynb
This notebook provides:
- Basic exploration and analysis examples
- Sample queries and filtering techniques
Perfect for users interested in developing new analysis tools or conducting research using the hugging face dataset.
- Purpose: Browse and search document collection
- Key Features: Advanced search, filtering, bookmarking
- Privacy: Browser-based interface
- Purpose: Query documents using natural language
- Key Features: AI-powered Q&A, conversational interface
- Privacy: Configurable deployment
- Purpose: Identify key documents by relevance
- Key Features: Algorithm-based ranking, prioritized lists
- Privacy: Local processing
- Purpose: Discover investigative leads
- Key Features: AI scoring (0-100), entity extraction, categorization
- Privacy: Runs entirely offline
- Purpose: Visual analysis and patterns
- Key Features: Interactive charts, relationship mapping
- Privacy: Client side processing
Contributions are always welcome!
- Fork this repository
- Add your project using this format:
#### [Project Name](url)
- **Purpose**: Brief description
- **Key Features**: Main capabilities
- **Privacy**: Privacy approach- Submit a pull request
- ✅ Legitimate research/journalism purposes
- ✅ Clear documentation
- ✅ Open-source preferred
- ✅ Transparent data handling
- Datasets Repository - Dataset documentation and access
- Safety Repository - Report issues with projects or data
- House Oversight Release - Original source
Community-maintained • Not affiliated with any official investigation