-
Notifications
You must be signed in to change notification settings - Fork 3
Katie Zelvin edited this page Aug 6, 2024
·
3 revisions
- Object Detection with YOLOv8 model for accurate detection and cropping of equations from document images: Enhanced efficiency and precision in image processing, ensuring all equations are correctly identified and extracted.
- Image to LaTeX conversion with integration of MathPix OCR: Streamlined extraction and conversion process, ensuring precise translation of equations into text.
- Similarity Calculation using a fuzzy matching approach leveraging Python-Levenshtein: Enhanced text comparison and matching accuracy.
- yolov8m.pt: Final weights for fine-tuned YoloV8 model.
- eqn-detect.yaml: YAML file with configuration for fine-tuning YoloV8 model (assumes binary labels, big-eqn i.e. LaTeX math display mode, and inline-eqn).
- yolov8_training.py: Python script to load pre-trained model and fine-tune using YAML file configuration.
- cropped_data: Contains cropped equations detected by fine-tuned YoloV8 model.
- runs/detect/train: Data generated by YoloV8 during fine-tuning.
- datasets/eqn-images-dataset/labels: Custom dataset of our hand-labelled equations (contains bounding box coordinates in YoloV8 formatting). Can be directly used for fine-tuning YoloV8 model.