Skip to content

EF20K/Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 

Repository files navigation

EF20K Projects: Investigative Tools

Community driven projects for analyzing 20,000+ Epstein Estate files relased by House Oversight Committee

datasets documents community


📔 Table of Contents

🌟 About

This repository catalogs community developed projects for analyzing 20,000+ pages of Epstein files released by the House Oversight Committee on November 12, 2025.

Content Notice: Documents contain sensitive material related to criminal investigations.

⚠️ Verification Warning: Many projects use generative models to analyze documents and can make errors, miss information, or even hallucinate false details. Always cross verify findings with the actual source documents. If you encounter erronous information, please report it in the Safety repository.

🚀 Getting Started

Quick Start with Python

New to the dataset? Start with our beginner friendly Jupyter notebook:

Getting_Started_w_Dataset.ipynb

This notebook provides:

  • Basic exploration and analysis examples
  • Sample queries and filtering techniques

Perfect for users interested in developing new analysis tools or conducting research using the hugging face dataset.

🧰 Projects

🔍 Search & Analysis

  • Purpose: Browse and search document collection
  • Key Features: Advanced search, filtering, bookmarking
  • Privacy: Browser-based interface
  • Purpose: Query documents using natural language
  • Key Features: AI-powered Q&A, conversational interface
  • Privacy: Configurable deployment
  • Purpose: Identify key documents by relevance
  • Key Features: Algorithm-based ranking, prioritized lists
  • Privacy: Local processing
  • Purpose: Discover investigative leads
  • Key Features: AI scoring (0-100), entity extraction, categorization
  • Privacy: Runs entirely offline

📊 Visualization

  • Purpose: Visual analysis and patterns
  • Key Features: Interactive charts, relationship mapping
  • Privacy: Client side processing

👋 Contributing

Contributions are always welcome!

Adding Your Project

  1. Fork this repository
  2. Add your project using this format:
#### [Project Name](url)
- **Purpose**: Brief description
- **Key Features**: Main capabilities
- **Privacy**: Privacy approach
  1. Submit a pull request

Guidelines

  • ✅ Legitimate research/journalism purposes
  • ✅ Clear documentation
  • ✅ Open-source preferred
  • ✅ Transparent data handling

💎 Resources


Community-maintained • Not affiliated with any official investigation

About

Projects using EPSTEIN_FILES_20K on Hugging Face

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published