A Python application for tagging and categorizing images.
This project provides functionality for automatically or manually tagging images with relevant labels and categories.
- Clone the repository:
git clone <repository-url>
cd ImageTagging- Create a virtual environment (recommended):
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate- Install dependencies:
pip install -r requirements.txtBefore building any models, it's recommended to run the EDA pipeline to understand your data:
python run_eda.pyThis will:
- Load all JSON files from
entities_dataset_v2/ - Analyze schema and data structure
- Analyze tag frequencies and distributions
- Analyze image-level statistics
- Generate visualizations
- Save summary reports to
eda_output/
Outputs:
- Console reports with key insights
- CSV files with detailed statistics (tag frequencies, co-occurrences, etc.)
- JSON files with summary statistics
- Visualization plots (PNG files)
Run the main application:
python main.pyImageTagging/
├── main.py # Main entry point
├── run_eda.py # EDA pipeline script
├── requirements.txt # Python dependencies
├── entities_dataset_v2/ # JSON dataset files
├── eda/ # EDA modules
│ ├── __init__.py
│ ├── data_loader.py # Data loading and normalization
│ ├── schema_analyzer.py # Schema analysis
│ ├── tag_analyzer.py # Tag/entity analysis
│ ├── image_analyzer.py # Image-level analysis
│ └── visualizer.py # Visualization generation
├── eda_output/ # Generated EDA outputs (created after running)
└── README.md # This file
Add your image tagging logic in main.py or create additional modules as needed.
[Add your license here]
[Add contribution guidelines here]