Skip to content
View marcodelpin's full-sized avatar

Block or report marcodelpin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Vision

27 repositories

Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)

Python 982 63 Updated Mar 7, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 20,643 1,332 Updated Feb 27, 2025

Ultimate camera streaming application with support RTSP, RTMP, HTTP-FLV, WebRTC, MSE, HLS, MP4, MJPEG, HomeKit, FFmpeg, etc.

Go 8,455 615 Updated Mar 13, 2025

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 9,701 2,275 Updated Nov 20, 2024

Implementation of yolo v11 in c++ std 17 over opencv and onnxruntime

Kotlin 14 3 Updated Mar 11, 2025

使用Opencv中的DNN模块对YOLOv8的所有类型模型,YOLOV9目标检测模型,YOLO11全系列模型进行了推理

C++ 73 7 Updated Oct 10, 2024

Multi-Object Tracking with Ultralytics YOLO11

Roff 10 2 Updated Oct 5, 2024

The YOLOv11 C++ TensorRT Project in C++ and optimized using NVIDIA TensorRT

C++ 69 6 Updated Oct 13, 2024

NVR with realtime local object detection for IP cameras

TypeScript 21,509 1,980 Updated Mar 13, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,140 626 Updated Feb 10, 2025

Witness the aha moment of VLM with less than $3.

Python 3,235 254 Updated Mar 1, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 623 42 Updated Mar 12, 2025

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 1,977 109 Updated Mar 14, 2025

🖼️ Image Toolbox is a powerful app for advanced image manipulation. It offers dozens of features, from basic tools like crop and draw to filters, OCR, and a wide range of image processing options

Kotlin 5,762 262 Updated Mar 14, 2025

Vision agent

Python 4,270 468 Updated Mar 13, 2025

This repository offers a TensorFlow-based anomaly detection system for cell images using adversarial autoencoders, capable of identifying anomalies even in contaminated datasets. Check out our code…

Jupyter Notebook 9 Updated Jun 17, 2024

Train and test image anomaly detection models with Anomalib. Examples on a custom dataset

Python 14 Updated May 5, 2024

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 4,122 716 Updated Mar 6, 2025

Anomaly detection on images using features from pretrained neural networks.

Jupyter Notebook 75 21 Updated Jul 15, 2024

image anomaly detection

Python 39 11 Updated Nov 22, 2021

PyTorch implementation of "Sub-Image Anomaly Detection with Deep Pyramid Correspondences"

Python 242 44 Updated Dec 27, 2022

A Vision Transformer Network for Image Anomaly Detection and Localization

Python 113 26 Updated Sep 16, 2021

SimpleNet: A Simple Network for Image Anomaly Detection and Localization

Python 32 8 Updated Jul 3, 2023

YOLOv12: Attention-Centric Real-Time Object Detectors

Python 1,172 144 Updated Mar 10, 2025

YOLOv12 Inference Using CPP and ONNX Runtime

C++ 18 2 Updated Feb 26, 2025

OCR & Document Extraction using vision models

TypeScript 10,273 674 Updated Mar 13, 2025