Curated educational list for computer vision

Reference Books

Online Courses

Uni Courses

YouTube Playlists

Python Libraries

MATLAB/Octave Libraries

Repos

Dataset Collections

Task Management Tools

Annotation Tools

Awesome Lists

Misc

Conferences

Journals

Summer Schools

YouTube Channels

Mailing Lists

Reference Books

Book	Links
Nixon, Mark, and Alberto Aguado. “Feature extraction and image processing for computer vision.” Academic press, (2019).	goodreads
González, Rafael Corsino and Richard E. Woods. “Digital image processing, 4th Edition.” (2018).	goodreads
Prince, Simon. “Computer Vision: Models, Learning, and Inference.” (2012).	goodreads
Forsyth, David Alexander and Jean Ponce. “Computer Vision - A Modern Approach, Second Edition.” (2011).	goodreads
Szeliski, Richard. “Computer Vision - Algorithms and Applications.” Texts in Computer Science (2010).	goodreads
Bishop, Charles M.. “Pattern recognition and machine learning, 5th Edition.” Information science and statistics (2007).	goodreads
Harltey, Andrew and Andrew Zisserman. “Multiple view geometry in computer vision (2. ed.).” (2003).	goodreads
Stockman, George C. and Linda G. Shapiro. “Computer Vision.” (2001).	goodreads

Online Courses

Course	Tags	Platform
Version Control with Git	`Git`	Udacity
Git Essential Training	`Git`	LinkedIn Learning
Learning GitHub	`Git`	LinkedIn Learning
Introduction to Python Programming	`Programming`	Udacity
Learning Python	`Programming`	LinkedIn Learning
Intro to Data Science	`Data Science`	Udacity
Intro to Data Analysis	`Data Science`	Udacity
Python Data Analysis	`Data Science`	LinkedIn Learning
Segmentation and Clustering	`Data Science`	Udacity
Python for Data Science Essential Training Part 1	`Data Science`	LinkedIn Learning
Python for Data Science Essential Training Part 2	`Data Science`	LinkedIn Learning
Introduction to Machine Learning Course	`Machine Learning`	Udacity
Machine Learning with Scikit-Learn	`Machine Learning`	LinkedIn Learning
Intro to Deep Learning with PyTorch	`Deep Learning`	Udacity
Introduction to Computer Vision	`Computer Vision`	Udacity
OpenCV for Python Developers	`Computer Vision`	LinkedIn Learning

Uni Courses

Course	Tags	University
Introduction to Computer Vision	`Computer Vision`	Brown
Advances in Computer Vision	`Computer Vision`	MIT
Deep Learning for Computer Vision	`Computer Vision` `Deep Learning`	Stanford

YouTube Playlists

Course	Year	Instructor	University
Computer Vision	2021	Andreas Geiger	University of Tübingen
Computer Vision	2021	Yogesh S Rawat / Mubarak Shah	University of Central Florida
Advanced Computer Vision	2021	Mubarak Shah	University of Central Florida
Deep Learning for Computer Vision	2020	Justin Johnson	University of Michigan
Advanced Deep Learning for Computer Vision	2020	Laura Leal-Taixé / Matthias Niessner	Technical University of Munich
Introduction to Digital Image Processing	2020	Ahmadreza Baghaie	New York Institute of Technology
Quantitative Imaging	2019	Kevin Mader	ETH Zurich
Convolutional Neural Networks for Visual Recognition	2017	Fei-Fei Li	Stanford University
Introduction to Digital Image Processing	2015	Rich Radke	Rensselaer Polytechnic Institute
Machine Learning for Robotics and Computer Vision	2014	Rudolph Triebel	Technical University of Munich
Multiple View Geometry	2013	Daniel Cremers	Technical University of Munich
Variational Methods for Computer Vision	2013	Daniel Cremers	Technical University of Munich
Computer Vision	2012	Mubarak Shah	University of Central Florida
Image and video processing	-	Guillermo Sapiro	Duke University

Python Libraries

Library	Description
OpenCV	Open Source Computer Vision Library
Pillow	The friendly PIL fork (Python Imaging Library)
scikit-image	collection of algorithms for image processing
SciPy	open-source software for mathematics, science, and engineering
mmcv	OpenMMLab foundational library for computer vision research
imutils	A series of convenience functions to make basic image processing operations
pgmagick	python based wrapper for GraphicsMagick/ImageMagick
Mahotas	library of fast computer vision algorithms (last updated: 2021)
SimpleCV	The Open Source Framework for Machine Vision (last updated: 2015)

MATLAB Libraries

Library	Description
PMT	Piotr's Computer Vision Matlab Toolbox
matlabfns	MATLAB and Octave Functions for Computer Vision and Image Processing, P. Kovesi, University of Western Australia
VLFeat	open source library implements popular computer vision algorithms, A. Vedaldi and B. Fulkerson
MLV	Mid-level Vision Toolbox (MLVToolbox), BWLab, University of Toronto
ElencoCode	Loris Nanni's CV functions, University of Padova

Repos

Tags: Object Classification `[ObjCls]`, Object Detection `[ObjDet]`, Object Segmentation `[ObjSeg]`, General Library `[GenLib]`, Text Reading / Object Character Recognition `[OCR]`, Action Recognition `[ActRec]`, Object Tracking `[ObjTrk]`, Data Augmentation `[DatAug]`, Simultaneous Localization and Mapping `[SLAM]`, Outlier/Anomaly/Novelty Detection `[NvlDet]`, Content-based Image Retrieval `[CBIR]`, Image Enhancement `[ImgEnh]`, Aesthetic Assessment `[AesAss]`, Explainable Artificial Intelligence `[XAI]`, Text-to-Image Generation `[TexImg]`, Pose Estimation `[PosEst]`, Video Matting `[VidMat]`, Eye Tracking `[EyeTrk]`

Repo	Tags	Description
computervision-recipes	`[GenLib]`	Microsoft, Best Practices, code samples, and documentation for Computer Vision
FastAI	`[GenLib]`	FastAI, Library over PyTorch used for learning and practicing machine learning and deep learning
pytorch-lightning	`[GenLib]`	PyTorchLightning, Lightweight PyTorch wrapper for high-performance AI research
ignite	`[GenLib]`	PyTorch, High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently
pytorch_geometric	`[GenLib]`	Graph Neural Network Library for PyTorch
kornia	`[GenLib]`	Open Source Differentiable Computer Vision Library
ncnn	`[GenLib]`	Tencent, High-performance neural network inference framework optimized for the mobile platform
MediaPipe	`[ObjDet]` `[ObjSeg]` `[ObjTrk]` `[GenLib]`	Google, iOS - Andriod - C++ - Python - Coral, Face Detection - Face Mesh - Iris - Hands - Pose - Holistic - Hair Segmentation - Object Detection - Box Tracking - Instant Motion Tracking - Objectron - KNIFT (Similar to SIFT)
PyTorch image models	`[ObjCls]`	rwightman, PyTorch image classification models, scripts, pretrained weights
mmclassification	`[ObjCls]`	OpenMMLab, Image Classification Toolbox and Benchmark
vit-pytorch	`[ObjCls]`	SOTA for vision transformers
face_classification	`[ObjCls]` `[ObjDet]`	Real-time face detection and emotion/gender classification
mmdetection	`[ObjDet]`	OpenMMLab, Image Detection Toolbox and Benchmark
detectron2	`[ObjDet]` `[ObjSeg]`	Facebook, FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks
detr	`[ObjDet]`	Facebook, End-to-End Object Detection with Transformers
libfacedetection	`[ObjDet]`	An open source library for face detection in images, speed: ~1000FPS
FaceDetection-DSFD	`[ObjDet]`	Tencent, SOTA face detector
object-Detection-Metrics	`[ObjDet]`	Most popular metrics used to evaluate object detection algorithms
SAHI	`[ObjDet]` `[ObjSeg]`	A lightweight vision library for performing large scale object detection/ instance segmentation
yolov5	`[ObjDet]`	ultralytics
AlexeyAB/darknet pjreddie/darknet	`[ObjDet]`	YOLOv4 / Scaled-YOLOv4 / YOLOv3 / YOLOv2
U-2-Net	`[ObjDet]`	ultralytics U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection
segmentation_models.pytorch	`[ObjSeg]`	qubvel, PyTorch segmentation models with pretrained backbones
mmsegmentation	`[ObjSeg]`	OpenMMLab, Semantic Segmentation Toolbox and Benchmark
mmocr	`[OCR]`	OpenMMLab, Text Detection, Recognition and Understanding Toolbox
pytesseract	`[OCR]`	A Python wrapper for Google Tesseract
EasyOCR	`[OCR]`	Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc
PaddleOCR	`[OCR]`	Practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices
mmtracking	`[ObjTrk]`	OpenMMLab, Video Perception Toolbox for object detection and tracking
mmaction	`[ActRec]`	OpenMMLab, An open-source toolbox for action understanding based on PyTorch
albumentations	`[DatAug]`	Fast image augmentation library and an easy-to-use wrapper around other libraries
ORB_SLAM2	`[SLAM]`	Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities
pyod	`[NvlDet]`	Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
imagededup	`[CBIR]`	Image retrieval, CBIR, Find duplicate images made easy!
image-match	`[CBIR]`	Image retrieval, CBIR, Quickly search over billions of images
Bringing-Old-Photos-Back-to-Life	`[ImgEnh]`	Microsoft, Bringing Old Photo Back to Life (CVPR 2020 oral)
image-quality-assessment	`[AesAss]`	Idealo, Image Aesthetic, NIMA model to predict the aesthetic and technical quality of images
aesthetics	`[AesAss]`	Image Aesthetics Toolkit using Fisher Vectors
pytorch-cnn-visualizations	`[XAI]`	Pytorch implementation of convolutional neural network visualization techniques
DALLE2-pytorch	`[TexImg]`	Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
imagen-pytorch	`[TexImg]`	Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
openpose	`[PosEst]`	OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
RobustVideoMatting	`[VidMat]`	Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
fastudp	`[NvlDet]` `[CBIR]`	An unsupervised and free tool for image and video dataset analysis
Random-Erasing	`[DatAug]`	Random Erasing Data Augmentation in PyTorch
CutMix-PyTorch	`[DatAug]`	Official Pytorch implementation of CutMix regularizer
keras-cv	`[GenLib]`	Library of modular computer vision oriented Keras components
PsychoPy	`[EyeTrk]`	Library for running psychology and neuroscience experiments

Dataset Collections

PyTorch - CV Datasets, Meta
Tensorflow - CV Datasets, Google
CVonline: Image Databases, Edinburgh University, Thanks to Robert Fisher!
Yet Another Computer Vision Index To Datasets (YACVID), Thanks to Hayko Riemenschneider!
Kaggle
PaperWithCode, Meta
RoboFlow
VisualData
CUHK Computer Vision
VGG - University of Oxford

Task Management Tools

MLflow, Platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry
Apache Airflow, Apache/AirBnB, Platform created by the community to programmatically author, schedule and monitor workflows
Ploomber, fastest way to build data pipelines.

Annotation Tools

VoTT, Microsoft, Visual Object Tagging Tool: An electron app for building end to end Object Detection Models from Images and Videos
labelme, Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation)
labelImg, Graphical image annotation tool and label object bounding boxes in images
VIA, VGG Oxford, HTML-based standalone manual annotation software for image, audio and video
FiftyOne, open-source tool for building high-quality datasets and computer vision models

Awesome Lists

anomaly-detection-resources, Anomaly detection related books, papers, videos, and toolboxes
awesome-satellite-imagery-datasets List of satellite image training datasets with annotations for computer vision and deep learning
awesome-Face_Recognition, Computer vision papers about faces.
the-incredible-pytorch, Curated list of tutorials, papers, projects, communities and more relating to PyTorch

Misc

How to build a good poster - [Link1] [Link2] [Link3]
How to report a good report - [Link1] [link2]
The "Python Machine Learning (3rd edition)" book code repository
Multithreading with OpenCV-Python to improve video processing performance
Computer Vision Zone - Videos and implementations for computer vision projects
MadeWithML, Learn how to responsibly deliver value with ML
d2l-en, Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 200 universities
Writing Pet Peeves, writing guide for correctness, references, and style
Hitchhiker's Guide to Python, Python best practices guidebook, written for humans
python-fire, Google, a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
shotcut, a free, open source, cross-platform video editor.
PyTorch Computer Vision Cookbook, PyTorch Computer Vision Cookbook, Published by Packt.
Machine Learning Mastery - Blogs, Blogs written by Jason Brownlee about machine learning.
PyImageSearch - Blogs, Blogs written by Adrian Rosebrock about computer vision.
jetson-inference, guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Conferences

CORE Rank A:
- ICCV: International Conference on Computer Vision (IEEE)
- CVPR: Conference on Computer Vision and Pattern Recognition (IEEE)
- ECCV: European Conference on Computer Vision (Springer)
- WACV: Winter Conference/Workshop on Applications of Computer Vision (IEEE)
- ICASSP: International Conference on Acoustics, Speech, and Signal Processing (IEEE)
- MICCAI: Conference on Medical Image Computing and Computer Assisted Intervention (Springer)
- IROS: International Conference on Intelligent Robots and Systems (IEEE)
- ACMMM: ACM International Conference on Multimedia (ACM)
CORE Rank B
- ACCV: Asian Conference on Computer Vision (Springer)
- VCIP: International Conference on Visual Communications and Image Processing (IEEE)
- ICIP: International Conference on Image Processing (IEEE)
- CAIP: International Conference on Computer Analysis of Images and Patterns (Springer)
- VISAPP: International Conference on Vision Theory and Applications (SCITEPRESS)
- ICPR: International Conference on Pattern Recognition (IEEE)
- ACIVS: Conference on Advanced Concepts for Intelligent Vision Systems (Springer)
- EUSIPCO: European Signal Processing Conference (IEEE)
- ICRA: International Conference on Robotics and Automation (IEEE)
- BMVC: British Machine Vision Conference (organized by BMVA: British Machine Vision Association and Society for Pattern Recognition)
CORE Rank C:
- ICISP: International Conference on Image and Signal Processing (Springer)
- ICIAR: International Conference on Image Analysis and Recognition (Springer)
- ICVS: International Conference on Computer Vision Systems (Springer)
Unranked but popular
- MIUA: Conference on Medical Image Understanding and Analysis (organized by BMVA: British Machine Vision Association and Society for Pattern Recognition)
- EUVIP: European Workshop on Visual Information Processing (IEEE, organized by EURASIP: European Association for Signal Processing)
- CIC: Color and Imaging Conference (organized by IS&T: Society for Imaging Science and Technology)
- CVCS: Colour and Visual Computing Symposium
- DSP: International Conference on Digital Signal Processing

Journals

Tier 1
- IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI)
- IEEE Transactions on Image Processing (IEEE TIP)
- IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT)
- Springer International Journal of Computer Vision (Springer IJCV)
- Elsevier Pattern Recognition (Elsevier PR)
- Elsevier Computer Vision and Image Understanding (Elsevier CVIU)
- Elsevier Expert Systems with Applications
- Elsevier Neurocomputing, Springer Neural Computing and Applications
Tier 2
- Elsevier Image and Vision Computing (Elsevier IVC)
- Elsevier Pattern Recognition Letters (Elsevier PR Letters)
- Elsevier Journal of Visual Communication and Image Representation
- Springer Journal of Mathematical Imaging and Vision
- SPIE Journal of Electronic Imaging
- IET Image Processing
- Springer Pattern Analysis and Applications (Springer PAA)
- Springer Machine Vision and Applications (Springer MVA)
- IET Computer Vision
Open Access
- IEEE Access
- MDPI Journal of Imaging

Summer Schools

International Computer Vision Summer School (IVCSS) [2007-Present], Sicily, Italy [2023]
Machine Intelligence and Visual Computing Summer School (VISUM) [2013-2022], Porto, Portugal [2022]
BMVA British Computer Vision Summer School (CVSS) [2013-2020,2023], UK [Website]

YouTube Channels

@AurelienGeron [Individual], Aurélien Géron: former lead of YouTube's video classification team, and author of the O'Reilly book Hands-On Machine Learning with Scikit-Learn and TensorFlow.
@howardjeremyp [Individual], Jeremy Howard: former president and chief scientist of Kaggle, and co-founder of fast.ai.
@PieterAbbeel [Individual], Pieter Abbeel: professor of electrical engineering and computer sciences, University of California, Berkeley.
@pascalpoupart3507 [Individual], Pascal Poupart: professor in the David R. Cheriton School of Computer Science at the University of Waterloo.
@MatthiasNiessner [Individual], Matthias Niessner: Professor at the Technical University of Munich and head of the Visual Computing Lab.
@MichaelBronsteinGDL [Individual], Michael Bronstein: DeepMind Professor of AI, University of Oxford / Head of Graph Learning Research, Twitter.
@DeepFindr [Individual], Videos about all kinds of Machine Learning / Data Science topics.
@deeplizard [Individual], Videos about building collective intelligence.
@YannicKilcher [Individual], Yannic Kilcher: make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society.
@sentdex [Individual], sentdex: provides Python programming tutorials in machine learning, finance, data analysis, robotics, web development, game development and more.
@bmvabritishmachinevisionas8529 [Conferences], BMVA: British Machine Vision Association.
@ComputerVisionFoundation [Conferences], Computer Vision Foundation (CVF): co-sponsored conferences on computer vision (e.g. CVPR and ICCV).
@cvprtum [University], Computer Vision Group at Technical University of Munich.
@UCFCRCV [University], Center for Research in Computer Vision at University of Central Florida.
@dynamicvisionandlearninggr1022 [University], Dynamic Vision and Learning research group channel! Technical University of Munich.
@TubingenML [University], Machine Learning groups at the University of Tübingen.
@computervisiontalks4659 [Talks], Computer Vision Talks.
@freecodecamp [Talks], Videos to learn how to code.
@LondonMachineLearningMeetup [Talks], Largest machine learning community in Europe.
@LesHouches-iu6nv [Talks], Summer school on Statistical Physics of Machine learning held in Les Houches, July 4 - 29, 2022.
@MachineLearningStreetTalk [Talks], top AI podcast on Spotify.
@WeightsBiases [Talks], Weights and Biases team's conversations with industry experts, and researchers.
@PreserveKnowledge [Talks], Canada higher education media organization that focuses on advances in mathematics, computer science, and artificial intelligence.
@TwoMinutePapers [Papers], Two Minute Papers: Explaining AI papers in few mins.
@TheAIEpiphany [Papers], Aleksa Gordić: x-Google DeepMind, x-Microsoft engineer explaining AI papers.
@bycloudAI [Papers], bycloud: covers the latest AI tech/research papers for fun.
Unorganized/Unsorted:
- https://www.youtube.com/@AAmini
- https://www.youtube.com/@WhatsAI
- https://www.youtube.com/@mrdbourke
- https://www.youtube.com/@marksaroufim
- https://www.youtube.com/@NicholasRenotte
- https://www.youtube.com/@abhishekkrthakur
- https://www.youtube.com/@AladdinPersson
- https://www.youtube.com/@CodeEmporium
- https://www.youtube.com/@arp_ai
- https://www.youtube.com/@CodeThisCodeThat
- https://www.youtube.com/@connorshorten6311
- https://www.youtube.com/@SmithaKolan
- https://www.youtube.com/@AICoffeeBreak
- https://www.youtube.com/@independentcode
- https://www.youtube.com/@alfcnz
- https://www.youtube.com/@KapilSachdeva
- https://www.youtube.com/@AICoding
- https://www.youtube.com/@mildlyoverfitted

Mailing Lists

Vision Science, announcements about industry/academic jobs in computer vision around the world (in English).
bull-i3, posts about job opportunities in computer vision in France (in French).

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Repository files navigation

Curated educational list for computer vision

Reference Books

Online Courses

Uni Courses

YouTube Playlists

Python Libraries

MATLAB Libraries

Repos

Dataset Collections

Task Management Tools

Annotation Tools

Awesome Lists

Misc

Conferences

Journals

Summer Schools

YouTube Channels

Mailing Lists

Thanks

About

Contributors 2

mawady/awesome-cv

Folders and files

Latest commit

History

README.md

README.md

Repository files navigation

Curated educational list for computer vision

Reference Books

Online Courses

Uni Courses

YouTube Playlists

Python Libraries

MATLAB Libraries

Repos

Dataset Collections

Task Management Tools

Annotation Tools

Awesome Lists

Misc

Conferences

Journals

Summer Schools

YouTube Channels

Mailing Lists

Thanks

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2