A complete AI-powered document analysis platform with web interface and CLI tools
Transform your document collection into an intelligent, searchable knowledge base using AI. Upload documents, get instant analysis, and query your content with natural language.
# 1. Install dependencies
pip install -r requirements.txt
# 2. Start the web server
python app.py
# 3. Open your browser
open http://localhost:3000
# Run the CLI analyzer
python document_analyzer.py
- 📁 Document Upload: Drag & drop interface for .txt, .md, and .pdf files (up to 16MB)
- 📊 Real-time Analysis: Instant document statistics and keyword extraction
- 🔍 AI-Powered Search: Natural language querying with LLMware's capabilities
- 📈 Analytics Dashboard: Comprehensive document library statistics
- 🎨 Modern UI: Responsive Bootstrap-based interface with dark mode support
- 🤖 Model Integration: Access to LLMware's model catalog
- Document Parsing: Parse and analyze text documents
- Batch Processing: Handle multiple documents simultaneously
- Query Engine: Search through document collections
- Statistics Export: Generate detailed analysis reports
- Natural Language Processing: Understand context and meaning
- Semantic Search: Find relevant content beyond keyword matching
- Document Chunking: Intelligent text segmentation
- Keyword Extraction: Automatic identification of important terms
- Statistics Generation: Generate summary statistics for document libraries
llmware-project/
├── 🌐 Web Application
│ ├── app.py # Flask web server
│ ├── templates/
│ │ ├── base.html # Base template
│ │ ├── index.html # Homepage
│ │ ├── upload.html # File upload
│ │ ├── analyze.html # Analysis view
│ │ └── query.html # Search interface
│ └── static/
│ ├── css/style.css # Custom styling
│ └── js/main.js # JavaScript utilities
├── 💻 Command Line Tools
│ └── document_analyzer.py # CLI analyzer
├── 📄 Sample Documents
│ ├── ai_research.txt # AI research paper
│ └── tech_trends.txt # Tech trends report
├── ⚙️ Configuration
│ ├── requirements.txt # Python dependencies
│ ├── README.md # This file
│ └── WEB_APP_README.md # Detailed web app docs
└── 📋 Other
├── uploads/ # File upload storage
└── test_app.py # Web app tests
- Install the required dependencies:
pip install -r requirements.txt
Run the document analyzer:
python document_analyzer.py
The script will:
- Create a new LLMware library
- Add all
.txt
files from thedocuments/
folder - Parse and analyze each document
- Perform sample queries on the document collection
- Display library statistics
- List available LLMware models
The script provides detailed analysis including:
- Word and character counts for each document
- Keyword detection (AI, technology, computing, machine learning)
- Document previews
- Query results for various search terms
- Library statistics
To analyze your own documents:
- Place
.txt
files in thedocuments/
directory - Run the script again
- Library Management: Creating and managing document collections
- Document Parsing: Processing text files and extracting content
- Text Querying: Searching through document collections
- Model Catalog: Accessing available LLMware models
- Python 3.7+
- LLMware 0.4.2+
- Dependencies listed in
requirements.txt
This project is for educational and demonstration purposes.