DocFlow is a privacy-first desktop application that automates file organization using a hybrid architecture of Static Rules and Local LLM (Llama 3).
DocFlow solves the problem of digital clutter without compromising data privacy. Unlike cloud-based solutions, this tool processes sensitive documents (contracts, CVs, financial reports) locally on your machine.
It utilizes a Waterfall Filtering System to optimize performance:
- Binary & Archives: Instantly sorted by extension (O(1) complexity).
- Media: Organized chronologically (Year/Month) using file metadata.
- Documents & Code: Analyzed by AI (Llama 3) for semantic understanding and context-aware sorting.
- Privacy First: No data leaves your computer. Powered by the local
llama3model. - Context-Aware Sorting: The AI reads the content of PDFs, DOCX, PPTX, and Code files to determine the best category (e.g., classifying a Python script as
Programming/Pythonor a Resume asCareer/CV). - Hybrid Architecture: Uses hardcoded rules for binary files to ensure speed, using AI only where necessary.
- Smart Media Handling: Automatically organizes photos and videos into
Year/Monthfolders based on creation date. - Intelligent Renaming: AI generates clean, descriptive filenames in
snake_casebased on document content. - Modern GUI: Built with
CustomTkinterfor a clean, dark-mode interface with real-time logging and multithreading.
The system processes files in the following priority order:
- Archives (
.zip,.rar) → Moved to/Archives. - Executables (
.exe,.msi) → Moved to/Apps. - Media (
.jpg,.mp4) → Metadata Extraction → Moved to/Media/2024/05_May. - Config Files (
.yml,.log) → Moved to/Others. - Documents (
.pdf,.docx,.py) → Text Extraction → Llama 3 Analysis → JSON Response → Moved to Semantic Category (e.g.,/Career/Contracts).
This application requires Ollama installed and running locally.
- Download Ollama from ollama.com.
- Pull the Llama 3 model:
ollama pull llama3
- Go to the Releases section.
- Download the latest
DocFlow.exe. - Run the application.
- Clone the repository:
git clone [https://github.com/YOUR_USERNAME/DocFlow.git](https://github.com/YOUR_USERNAME/DocFlow.git) cd DocFlow - Install dependencies:
pip install customtkinter ollama PyPDF2 python-pptx python-docx
- Run the app:
python app_gui.py
- Language: Python 3
- GUI: CustomTkinter
- AI Engine: Ollama (Llama 3)
- File Analysis:
python-docx,python-pptx,PyPDF2
Distributed under the MIT License. See LICENSE for more information.
