🧠 DocuLens

A complete AI-powered document analysis platform with web interface and CLI tools

Transform your document collection into an intelligent, searchable knowledge base using AI. Upload documents, get instant analysis, and query your content with natural language.

🚀 Quick Start

Web Application (Recommended)

# 1. Install dependencies
pip install -r requirements.txt

# 2. Start the web server
python app.py

# 3. Open your browser
open http://localhost:3000

Command Line Interface

# Run the CLI analyzer
python document_analyzer.py

✨ Features

🌐 Web Application

📁 Document Upload: Drag & drop interface for .txt, .md, and .pdf files (up to 16MB)
📊 Real-time Analysis: Instant document statistics and keyword extraction
🔍 AI-Powered Search: Natural language querying with LLMware's capabilities
📈 Analytics Dashboard: Comprehensive document library statistics
🎨 Modern UI: Responsive Bootstrap-based interface with dark mode support
🤖 Model Integration: Access to LLMware's model catalog

💻 Command Line Interface

Document Parsing: Parse and analyze text documents
Batch Processing: Handle multiple documents simultaneously
Query Engine: Search through document collections
Statistics Export: Generate detailed analysis reports

🧠 AI Capabilities

Natural Language Processing: Understand context and meaning
Semantic Search: Find relevant content beyond keyword matching
Document Chunking: Intelligent text segmentation
Keyword Extraction: Automatic identification of important terms
Statistics Generation: Generate summary statistics for document libraries

📁 Project Structure

llmware-project/
├── 🌐 Web Application
│   ├── app.py                    # Flask web server
│   ├── templates/
│   │   ├── base.html            # Base template
│   │   ├── index.html           # Homepage
│   │   ├── upload.html          # File upload
│   │   ├── analyze.html         # Analysis view
│   │   └── query.html           # Search interface
│   └── static/
│       ├── css/style.css        # Custom styling
│       └── js/main.js           # JavaScript utilities
├── 💻 Command Line Tools
│   └── document_analyzer.py     # CLI analyzer
├── 📄 Sample Documents
│   ├── ai_research.txt          # AI research paper
│   └── tech_trends.txt          # Tech trends report
├── ⚙️ Configuration
│   ├── requirements.txt         # Python dependencies
│   ├── README.md               # This file
│   └── WEB_APP_README.md       # Detailed web app docs
└── 📋 Other
    ├── uploads/                 # File upload storage
    └── test_app.py              # Web app tests

Installation

Install the required dependencies:

pip install -r requirements.txt

Usage

Run the document analyzer:

python document_analyzer.py

The script will:

Create a new LLMware library
Add all .txt files from the documents/ folder
Parse and analyze each document
Perform sample queries on the document collection
Display library statistics
List available LLMware models

Sample Output

The script provides detailed analysis including:

Word and character counts for each document
Keyword detection (AI, technology, computing, machine learning)
Document previews
Query results for various search terms
Library statistics

Adding Your Own Documents

To analyze your own documents:

Place .txt files in the documents/ directory
Run the script again

LLMware Features Demonstrated

Library Management: Creating and managing document collections
Document Parsing: Processing text files and extracting content
Text Querying: Searching through document collections
Model Catalog: Accessing available LLMware models

Requirements

Python 3.7+
LLMware 0.4.2+
Dependencies listed in requirements.txt

License

This project is for educational and demonstration purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 DocuLens

🚀 Quick Start

Web Application (Recommended)

Command Line Interface

✨ Features

🌐 Web Application

💻 Command Line Interface

🧠 AI Capabilities

📁 Project Structure

Installation

Usage

Sample Output

Adding Your Own Documents

LLMware Features Demonstrated

Requirements

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
documents		documents
static		static
templates		templates
uploads		uploads
.gitignore		.gitignore
README.md		README.md
WEB_APP_README.md		WEB_APP_README.md
app.py		app.py
document_analyzer.py		document_analyzer.py
requirements.txt		requirements.txt
test_app.py		test_app.py

HavelCS/DocuLens

Folders and files

Latest commit

History

Repository files navigation

🧠 DocuLens

🚀 Quick Start

Web Application (Recommended)

Command Line Interface

✨ Features

🌐 Web Application

💻 Command Line Interface

🧠 AI Capabilities

📁 Project Structure

Installation

Usage

Sample Output

Adding Your Own Documents

LLMware Features Demonstrated

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages