Perception

One of the key features that differentiate robots from simple machines is their ability to act based on observations of the environment. In this lesson we will explore two common tasks in robotic perception: Detecting and tracking visual fiducial markers, and in a stretch goal, human faces.

Visual Fiducials

Derived from the latin word for trust, fiducials are physical markers that can be used as a point of reference. When the size and shape of visual fiducials are known, they can be localised in 3D with respect to a calibrated 2D camera.

April Tags are a popular set of visual fiducials developed at the University of Michigan. In this workshop we will use the April Tag group tag36h11 (download here, source here).

April Tags

In this session, we will use inexpensive cameras to detect and track the relative position and orientation of April Tags. Start by installing the April Tags ROS package:

sudo apt install ros-$ROS_DISTRO-apriltag-ros

There are two ways to source camera data for this exercise: Live data from a USB camera, or prerecorded data from a ROS bag file.

Option 1: Live USB camera

Choose either use your laptop's camera, or borrow a USB camera from a mentor. To integrate the camera into ROS, you'll need to install the correct driver; fortunately, most USB cameras and in-built laptop cameras work with the usb_cam package. We'll also need the camera_calibration package to calibrate your camera. Install these packages:

sudo apt install ros-$ROS_DISTRO-usb-cam \
                 ros-$ROS_DISTRO-camera-calibration

Steps:

Create a launch file that loads the USB camera driver.
Check that the camera's images are visible with rviz.
Locate a camera calibration checkerboard
Calibrate your camera with a checkerboard by following these instructions.
Source one or more of the April Tags that have been distributed around the workshop.

Option 2: Prerecorded bag file

Ask one of the mentors for a copy of the ROS bag file april_tag.bag.

Steps:

Create a launch file that plays the ROS bag file.
Set the bag file to loop repeatedy.
Check that the camera's images are visible with rviz.
Explore how the camera calibration is stored in the bag file.

April Tag Exercise

Once you have sourced camera data from either a live camera or bag file:

Add the April Tag node to your launch file and configure it (Tip: you will need to configure the node to subscribe to your camera publisher and add your April Tag ID to a configuration file)
Add a static tf from the map to camera at the height your camera is above the ground.
View the image topic showing the tag detection in rviz.
View the tf tree in rviz, it should look like this:

Stretch goal

The stretch goal is to create a face detection and position estimation system as a Python node. It assumes you are familiar with:

Python
Subscribing and Publishing Topics
Playing ROS bags

Ask a mentor for the face_detection.bag file, which contains lidar and USB camera data. The lidar sensor provides range-bearing measurements over a 270 degree horizontal field of view.

Steps:

Create a launch file that plays the ROS bag (it makes it easier if you loop the bag).
View the outputs of the camera and lidar (or depth sensor) in rviz.
Calculate the range of lidar bearings that overlap the camera's view.
Write a Python node that subscribes to the image and lidar topics.
Use OpenCV to perform face detection on the image.
For each face detected, calculate its range and bearing in the latest lidar scan based on the face centroid.
Output the object type, range and bearing
Stretch: Publish the tf of the face detection

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
hok		hok
resources		resources
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Perception

Visual Fiducials

April Tags

Option 1: Live USB camera

Steps:

Option 2: Prerecorded bag file

Steps:

April Tag Exercise

Stretch goal

Steps:

About

Contributors 5

Languages

License

ros-workshop/perception

Folders and files

Latest commit

History

Repository files navigation

Perception

Visual Fiducials

April Tags

Option 1: Live USB camera

Steps:

Option 2: Prerecorded bag file

Steps:

April Tag Exercise

Stretch goal

Steps:

About

Resources

License

Stars

Watchers

Forks

Contributors 5

Languages