This is the Indoor Landmark Detection module for F1tenth racecar navigation which heavily adapted from YOLO: Real-Time Object Detection.
The trained YOLO model can classify indoor objects(currently doors only but can be generalized to more classes with more labeled data), and has an overall accuracy(validation) > 95%. With the tiny sized model, the detection frame rate can reach >15 fps onboard.
Pre-trained YOLO models are trained using datasets taken from human's perspective and outdoors, such as COCO. However in our case, the landmarks to be detected are indoor architectural objects, such as doors, windows, floors, chairs and tables, etc. Thus, we need to fine tuning some pretrained YOLO model with customized dataset to achieve high accuracy. This jupyter notebook is written for fine tuning and you may find this blog also very useful.
This software is built on the Robotic Operating System ([ROS]), which needs to be installed first. Additionally, YOLO for ROS depends on following software:
In order to install darknet_ros, clone the latest version using SSH (see how to set up an SSH key) from this repository into your catkin workspace and compile the package using ROS.
cd catkin_workspace/src
git clone --recursive git@github.com:leggedrobotics/darknet_ros.git
cd ../
To maximize performance, make sure to build in Release mode. You can specify the build type by setting
catkin_make -DCMAKE_BUILD_TYPE=Release
or using the Catkin Command Line Tools
catkin build darknet_ros -DCMAKE_BUILD_TYPE=Release
Darknet on the CPU is fast (approximately 1.5 seconds on an Intel Core i7-6700HQ CPU @ 2.60GHz × 8) but it's like 500 times faster on GPU! You'll have to have an Nvidia GPU and you'll have to install CUDA. The CMakeLists.txt file automatically detects if you have CUDA installed or not. CUDA is a parallel computing platform and application programming interface (API) model created by Nvidia. If you do not have CUDA on your System the build process will switch to the CPU version of YOLO. If you are compiling with CUDA, you might receive the following build error:
nvcc fatal : Unsupported gpu architecture 'compute_61'.
This means that you need to check the compute capability (version) of your GPU. You can find a list of supported GPUs in CUDA here: CUDA - WIKIPEDIA. Simply find the compute capability of your GPU and add it into darknet_ros/CMakeLists.txt. Simply add a similar line like
-O3 -gencode arch=compute_62,code=sm_62
Download the fine-tuned weights and copy it under directory:
catkin_workspace/src/darknet_ros/darknet_ros/yolo_network_config/weights/
The model configure is defined in
catkin_workspace/src/darknet_ros/darknet_ros/yolo_network_config/cfg/yolov3-tiny-door-detection.cfg
In addition, the config file for ROS defines the names of the detection objects, in our case is "door" only.
Run:
roslaunch darknet_ros darknet_ros.launch
This is the main YOLO ROS: Real-Time Object Detection for ROS node. It uses the camera measurements to detect pre-learned objects in the frames.
You can change the names and other parameters of the publishers, subscribers and actions inside darknet_ros/config/ros.yaml.
-
/camera_reading([sensor_msgs/Image])The camera measurements.
-
object_detector([std_msgs::Int8])Publishes the number of detected objects.
-
bounding_boxes([darknet_ros_msgs::BoundingBoxes])Publishes an array of bounding boxes that gives information of the position and size of the bounding box in pixel coordinates.
-
detection_image([sensor_msgs::Image])Publishes an image of the detection image including the bounding boxes.
-
camera_reading([sensor_msgs::Image])Sends an action with an image and the result is an array of bounding boxes.
You can change the parameters that are related to the detection by adding a new config file that looks similar to darknet_ros/config/yolo.yaml.
-
image_view/enable_opencv(bool)Enable or disable the open cv view of the detection image including the bounding boxes.
-
image_view/wait_key_delay(int)Wait key delay in ms of the open cv window.
-
yolo_model/config_file/name(string)Name of the cfg file of the network that is used for detection. The code searches for this name inside
darknet_ros/yolo_network_config/cfg/. -
yolo_model/weight_file/name(string)Name of the weights file of the network that is used for detection. The code searches for this name inside
darknet_ros/yolo_network_config/weights/. -
yolo_model/threshold/value(float)Threshold of the detection algorithm. It is defined between 0 and 1.
-
yolo_model/detection_classes/names(array of strings)Detection names of the network used by the cfg and weights file inside
darknet_ros/yolo_network_config/.
