A powerful tool to analyze and compare Excel files, highlighting differences and tracking changes over time - faster and more intuitive than traditional Excel formulas.
- Side-by-side comparison of two Excel files
- Color-coded highlighting of differences and matches
- Detailed difference reports showing exactly what changed
- Multiple comparison methods: exact, fuzzy, and numeric threshold
- Natural language queries for intuitive analysis
- Track changes across multiple versions of Excel files over time
- Cell-level change history showing how each value evolved
- Timeline visualization of match percentages across versions
- Identify most frequently changed columns and patterns
- Comprehensive change reports for auditing and analysis
- ✅ Detects fields present in one file but not the other
- ✅ Handles numeric differences with configurable thresholds
- ✅ Text matching with fuzzy comparison for variations
- ✅ Missing value detection and handling
- ✅ Export results to Excel with detailed breakdowns
- ✅ Visual charts and statistics
-
Install dependencies:
pip install -r requirements.txt
-
Run the application:
streamlit run app.py
-
Upload your files:
- Choose "Two Files" mode for standard comparison
- Choose "Multiple Files (3+)" mode to track changes over time
-
Analyze:
- View highlighted differences
- Explore change history
- Export comprehensive reports
- Version Control: Track how data changes across monthly/quarterly updates
- Data Validation: Compare source and target files to identify discrepancies
- Audit Trails: Monitor modifications to sensitive data over time
- Quality Assurance: Validate data transformations and migrations
- Price Tracking: Monitor product price changes across catalogs
- Inventory Management: Track stock level changes over time
For detailed feature documentation, see FEATURES.md
- pandas: Data manipulation and analysis
- openpyxl: Excel file processing
- streamlit: Interactive web interface
- plotly: Beautiful visualizations and charts
- No formula writing required - intuitive point-and-click interface
- Visual highlighting - instantly see what changed
- Multi-file tracking - Excel formulas can't track changes across 3+ files
- Comprehensive reports - automatic generation of detailed analysis
- Timeline views - understand change patterns over time
- Natural language - ask questions in plain English
Compare three monthly product catalogs:
- Upload
products_jan.xlsx
,products_feb.xlsx
,products_mar.xlsx
- Click "Compare All Versions"
- See exactly which products changed, when, and how
- Export a complete change report
Contributions are welcome! Please feel free to submit a Pull Request.
This project is open source and available under the MIT License.