Alexander Alekhin edited this page Jun 16, 2016 · 3 revisions
Clone this wiki locally

Open CV 3.0: Solving Problems

The third major release of OpenCV is aimed at building solid ground for computer vision development. C++ and Python will be covered here. OpenCV 3.0 enables:

  • more algorithms to be integrated (of which we will showcase the latest)
  • more supported languages (Matlab, Ruby, Haskell)
  • more optimizations (NEON, OpenVX)
  • a new and modular way of participating to the core development

While showcasing the aforementioned features, we will focus building end-to-end vision pipelines through several application walk-throughs. Code, instructions, and mobile applications will be available online before the tutorial.

Video from last summer of code


Morning Session: OpenCV 3.0 Walkthrough


  • Introduction (Gary Bradski)


  • source:OpenCVOverviewCVPR2014.pptx
  • Library Overview (Gary Bradski)
    • History, License (BSD)
    • Content Overview
    • Documentation
    • Interfaces: C++, Matlab, Java, Python
    • Platforms:
      • Windows
      • Mac
      • Linux
      • iOS
      • Android
    • Acceleration: Cuda, OpenCL, IPP


  • OpenCV 3.0 … OpenCV grows up and further opens up (Vincent Rabaud, Gary Bradski)
    • Modules
    • User Contrib, how to submit
    • How to develop for OpenCV
      • Development Cycle
      • Pull requests
    • Class factories




  • Past and Current OpenCV Google Summer of Code (Vincent Rabaud)
    • Projects
    • Videos


  • Open Question/Brainstorm /Feature Request Session (Vincent Rabaud, Grace Vesom)
    • we had a discussion at CVPR 2011 with great exchanges and lasting impact on OpenCV



Afternoon Session: Problem Solving Tutorials


  • The RGBD Module in OpenCV 3.0 (Vincent Rabaud)
    • Basics
    • Normals
    • Planes
    • Odometry
    • 3d Visualization
    • Future

Code at

  • RGBD Tutorial (Patrick O’Keefe)
    • Acquire 3D data, options. Occipital Sensor
    • Process for normals/planes
    • Register depth data to external, high-resolution RGB camera
    • Track motion with RGBD odometry




  • The art of stereo camera calibration (Grace Vesom)
    • Utilizing new fisheye calibration model in OpenCV 3.0
    • Best practices
    • Partial capture / using aruco boards as calibration boards
    • Calibrating wide angle stereo systems


  • Computational photography (Vincent Rabaud)
    • HDR
    • Cloning
    • Non-photorealistic rendering
    • Visualization


  • Appearance Based SLAM: OpenFABMap (Gary Bradski)
    • Overview of FABMap
    • Walk through how to use OpenFABMap in OpenCV
    • Demo


  • Concluding panel/Q&A (All)



Dr. Gary Rost Bradski

is VP of Perception and Core Software at Magic Leap. Gary founded OpenCV at Intel Research in 2000 and is currently CEO of non-profit He ran the vision team for Stanley, the autonomous vehicle that completed and won the $2M DARPA Grand Challenge robot race across the desert. Dr. Bradski helped start up NeuroScan (sold to Marmon), Video Surf (sold to Microsoft), and Willow Garage (absorbed into Suitable Tech). Most recently, he founded Industrial Perception (sold August 2013). Gary has more than 100 publications and more than 30 patents and is co-author of a best seller in its category Learning OpenCV: Computer Vision with the OpenCV Library, O’Reilly Press.

Patrick O’Keef

is a computer vision engineer at Occipital. He loves and works on everything related to RGBD and the Structure Sensor (from firmware to SLAM), and holds an MSEE from the University of Michigan in Signal Processing."

Vincent Rabaud

is the perception team manager at Aldebaran Robotics. He co-founded the non-profit with Gary Bradski in 2012 while a research engineer at Willow Garage. His research interests include 3D processing, object recognition and anything that involves underusing CPUs by feeding them fast algorithms. Dr. Rabaud completed his PhD at UCSD, advised by Serge Belongie. He also holds a MS in space mechanics and space imagery from SUPAERO and a MS in optimization from the Ecole Polytechnique.

Grace Vesom

is a computer vision engineer at Magic Leap. Previously, she was a research scientist at Lawrence Livermore National Laboratory working on computer vision and machine learning algorithms for global security applications. She completed her DPhil at the University of Oxford with Alison Noble and Mike Brady in 2010.