Thermal SuperPoint SLAM

Thermal SuperPoint SLAM is a project completed for ROB 530 at the University of Michigan in the winter 2021 semester. This project aimed to create an indirect SLAM algorithm that can successfully perform on thermal imagery. Specifically, we trained a SuperPoint feature detection and description network on thermal imagery and integrated the network with ORB_SLAM2 in place of the ORB feature detector and descriptor. Our combined algorithm runs offline on precomputed keypoints and descriptors. See our video and report for the details of the process and our results. Although the project was completed with thermal imagery in mind, the steps described here can be followed with any set of images to yield a SuperPoint network and corresponding vocabulary and use them within a modified version of ORB_SLAM2.

This project utilizes four existing codebases:

SuperPoint Training: https://github.com/eric-yyjau/pytorch-superpoint
Vocabulary Training: https://github.com/dorian3d/DBoW2
Integration of SuperPoint and ORB_SLAM2: https://github.com/KinglittleQ/SuperPoint_SLAM
Original ORB_SLAM2 (for comparison): https://github.com/raulmur/ORB_SLAM2

Each codebase required modifications and the modified forks are included in the thirdparty folder as submodules.

1. Setup

This library was tested on Ubuntu 18.04. After downloading the submodules the setup is divided into three task specific sections: SuperPoint training, vocabulary creation, and SLAM. Each task can be done independent of the others. Finally there are requirements for our preprocessing and evaluation scripts.

Downloading Submodules

This repository uses submodules, after cloning download the submodules by running:

cd Thermal_SuperPoint_SLAM
git submodule update --init --recursive

SuperPoint Training (pytorch-superpoint)

The pytorch-superpoint repository provides a requirements file for installing dependencies. An example of using it to create an environment with Anaconda is as follows:

cd thirdparty/pytorch-superpoint/
conda create --name py36-sp python=3.6
conda activate py36-sp
pip install -r requirements.txt
pip install -r requirements_torch.txt

Vocabulary Generation (DBoW2)

For training a SuperPoint vocabulary DBoW2 only requires OpenCV (C++). Download and install instructions can be found at: http://opencv.org.

After installing OpenCV the DBoW2 code can be built by running the provided shell script:

chmod +x build_vocab_code.sh
./build_vocab_code.sh

SLAM (ORB_SLAM2 and SuperPoint_SLAM)

To run the modified versions of ORB_SLAM2 and SuperPoint_SLAM the following is required:

C++11 or C++0x Compiler
Pangolin: used for visualization and user interface. Download and install instructions can be found at: https://github.com/stevenlovegrove/Pangolin.
OpenCV: used to manipulate images and features. Download and install instructions can be found at: http://opencv.org. Required at least 2.4.3..
Eigen: required by g2o (an included third party optimization library). Download and install instructions can be found at: http://eigen.tuxfamily.org. Required at least 3.1.0.

After installing all of the above the ORB_SLAM2 and SuperPoint_SLAM code can be built by running the provided shell script:

chmod +x build_slam_code.sh
./build_slam_code.sh

Preprocessing and Evaluation (utils/ and evaluation/)

Any of these scripts that do not work with ROS bags can be run using the pytorch-superpoint environment (see the section SuperPoint Training above). The scripts that work with ROS bags require ROS to be installed (and were tested with ROS melodic) and we have also found that these scripts do not work within conda environments. The scripts that create GIFs additionally require PIL which can be installed with pip install Pillow.

2. Image Directory Preprocessing

This section explains how to apply contrast limited adaptive histogram equalization (CLAHE) to an image directory containing 16 bit thermal images. This is a step we took for training our thermal SuperPoint network but it is not necessary for training on RGB images. See our video and report for more details.

The script utils/image_directory_preprocessor.py is provided to apply CLAHE to an image directory and write the results to a new directory. See the script's help message for the full details.

An example of how to use this script is as follows:

python image_directory_preprocessor.py ../../datasets/FLIR_ADAS/train/Data/ Data_CLAHE

This will apply CLAHE to each image in the source directory and output them to ../../datasets/FLIR_ADAS/train/Data_CLAHE/ as PNG images with the same filenames.

3. ROS Bag Preprocessing

This section explains how to preprocess image messages in a ROS bag into a format suitable for ORB_SLAM2 and SuperPoint_SLAM. The end result is a text file of timestamps and a folder of images with filenames corresponding to the timestamps. This is the same format ORB_SLAM2 uses for the EuRoC dataset.

The script utils/rosbag_preprocessor.py is provided for this purpose. See the script's help message for the full details.

An example of how to use this script is as follows:

python rosbag_preprocessor.py ../../datasets/vivid/outdoor_robust_day1.bag /thermal/image_raw outdoor_thermal --apply-clahe

This will output all images under the topic /thermal/image_raw to the directory ../../datasets/vivid/outdoor_thermal/images_30hz_tstart_0_tstop_inf and will output a text file containing the timestamps of each image to ../../datasets/vivid/outdoor_thermal/timestamps/timestamps_30hz_tstart_0_tstop_inf.txt. Note that the framerate, start time, and stop time are denoted in the image folder name and the timestamp filename (in this example the original framerate has been assumed to be 30 Hz). Note also the --apply-clahe flag used here. This flag indicates that the input messages are 16 bit images, that CLAHE should be applied, and the result should be stored as an 8 bit image. This is unnecessary for RGB images.

4. SuperPoint Training

Training SuperPoint requires three steps: 1) train a MagicPoint network on synthetic shapes 2) generate pseudo-groundtruth keypoint labels using the trained MagicPoint network through Homographic Adaptation 3) train a SuperPoint network with the pseudo-groundtruth keypoint labels.

We used an existing trained MagicPoint network included in the original pytorch-superpoint repository instead of training one ourselves; see that repository for details on training a MagicPoint network. The MagicPoint network we used is now located at trained_networks/magicpoint/magicpoint.pth.tar.

Generating pseudo-groundtruth interest points can be done by making some modifications and running:

python thirdparty/pytorch-superpoint/export.py export_detector_homoAdapt configs/training/magicpoint_flir_export.yaml magicpoint_synth

Specifically DATA_PATH in settings.py and this line in FLIR_ADAS.py need to be modified for images to be imported. Note that in FLIR_ADAS.py the input task will be set to the export_folder given in the config file configs/training/magicpoint_flir_export.yaml. The parameter export_folder can be set to either 'train' or 'val' and the image dataset used must be split into training and validation sets that are imported according to the corresponding setting for export_folder. The above command must be run twice, once with export_folder set to 'train' and once with it set to 'val'. The resulting pseudo-groundtruth keypoints will be stored in logs/magicpoint_synth/predictions/train and logs/magicpoint_synth/predictions/val. The config file includes many other parameters that can optionally be changed. Note that resized dimensions of the input images should be divisible by eight.

After generating the pseudo-groundtruth keypoints the SuperPoint network can be trained by running:

python thirdparty/pytorch-superpoint/train4.py train_joint configs/training superpoint_flir_train_heatmap.yaml superpoint

The result is a series of checkpoints of the network saved to logs/superpoint/checkpoints/. Once again the config file includes many parameters that can optionally be changed.

trained_networks/superpoint_thermal/thermal.pth.tar is our thermal SuperPoint network trained on the FLIR ADAS dataset.

5. SuperPoint Keypoint and Descriptor Generation

Applying the SuperPoint network trained using pytorch-superpoint to an image in C++ would require porting over a significant amount of python code. Due to time constraints we avoid this by precomputing the SuperPoint keypoints and descriptors using Python code, storing the results, and importing the results when needed in C++.

The script utils/generate_keypts_and_desc.py is provided to apply a SuperPoint network to an image directory and output the resulting keypoints and descriptors in sequentially named YAML files. See the script's help message for the full details.

An example of how to use this script with the thermal SuperPoint network we trained is as follows:

python generate_keypts_and_desc.py ../trained_networks/superpoint_thermal/thermal.pth.tar ../../datasets/vivid/outdoor_thermal/images_clahe_10hz_tstart_0_tstop_inf/ features

The result is a folder ../../datasets/vivid/outdoor_thermal/features/ with sequentially named YAML files containing the SuperPoint features.

6. Vocabulary Generation

To generate a SuperPoint vocabulary using precomputed SuperPoint keypoints and descriptors run:

./thirdparty/DBoW2/build/build_superpt_vocab <PATH_TO_SUPERPOINT_FEATURES>

The result will be a file superpt_voc.yml.gz.

Note that a hardcoded kmeans iteration limit of 100 was added here. Previously DBoW2 only progressed to the next node once all descriptors remain in the same clusters for two iterations. In our experience, this would frequently not occur and instead the percentage of descriptors switching clusters each iteration would oscillate. Note also that the completion percentage printed during training is only a loose approximation as it uses an upper bound for the number of possible nodes to be processed.

vocabularies/superpt_thermal.yml.gz is our thermal SuperPoint vocabulary trained on the FLIR ADAS dataset.

7. Running SuperPoint SLAM

Our modified version of SuperPoint SLAM runs offline on precomputed keypoints and descriptors. The original SuperPoint SLAM could be run online but utilized the pretrained SuperPoint network provided by the original SuperPoint authors here. The third party implementation we use for training (pytorch-superpoint) employs different layers in the network and our trained networks are incompatible with the original SuperPoint SLAM as a result. As was done in training the vocabulary, our quick fix is to generate keypoints and descriptors offline and import them into SuperPoint SLAM at runtime.

Assuming the data is in the format described in the ROS Bag Preprocessing section it can be imported using the EuRoC example. The executable can be run with the following arguments:

./thirdparty/SuperPoint_SLAM/Examples/Monocular/mono_euroc <PATH_TO_VOCABULARY> <PATH_TO_CONFIG> <PATH_TO_IMAGE_FOLDER> <PATH_TO_TIMESTAMP_FILE> <PATH_TO_SUPERPOINT_FEATURES>

For example:

./thirdparty/SuperPoint_SLAM/Examples/Monocular/mono_euroc vocabularies/superpt_thermal.yml.gz configs/ORB_SLAM2/ViViD_Thermal.yaml ../datasets/vivid/outdoor_thermal/images_clahe_10hz_tstart_0_tstop_inf/ ../datasets/vivid/outdoor_thermal/timestamps/timestamps_10hz_tstart_0_tstop_inf.txt ../datasets/vivid/outdoor_thermal/features/

8. Evaluation

This section gives the commands used to generate the results shown in our video and report.

Comparing Contrast Enhancement Techniques

Contrast enhancement comparison figure:

python compare_contrast_enhancement.py ../../datasets/fcav/cadata_sequence.bag /ubol/image_raw -f 50

CLAHE GIF:

python generate_clahe_gif.py ../../datasets/fcav/cadata_sequence.bag /ubol/image_raw clahe --frame-rate-divisor 10 --time-start 20 --time-stop 30

Feature Matching

Feature tracking GIF:

python generate_tracking_gif.py ../trained_networks/superpoint_thermal/thermal.pth.tar ../trained_networks/superpoint_rgb/rgb.pth.tar ../../datasets/fcav/uncooled_seq_1/images_clahe_10hz_tstart_90_tstop_110/ tracking 10

Vocabulary Image Similarity Scoring

Image similarity scores using thermal SuperPoint features and the thermal SuperPoint vocabulary:

./thirdparty/DBoW2/build/test_vocab thirdparty/DBoW2/test_vocab_data/SuperPoint_Thermal_Keypts_and_Desc/ vocabularies/superpt_thermal.yml.gz

Image similarity scores using thermal SuperPoint features and the RGB SuperPoint vocabulary:

./thirdparty/DBoW2/build/test_vocab thirdparty/DBoW2/test_vocab_data/SuperPoint_Thermal_Keypts_and_Desc/ vocabularies/superpoint_rgb.yml.gz

Image similarity scores using RGB SuperPoint features and the RGB SuperPoint vocabulary:

./thirdparty/DBoW2/build/test_vocab thirdparty/DBoW2/test_vocab_data/SuperPoint_RGB_Keypts_and_Desc/ vocabularies/superpoint_rgb.yml.gz

Image similarity scores using RGB SuperPoint features and the Thermal SuperPoint vocabulary:

./thirdparty/DBoW2/build/test_vocab thirdparty/DBoW2/test_vocab_data/SuperPoint_RGB_Keypts_and_Desc/ vocabularies/superpt_thermal.yml.gz

Image similarity scores using ORB features and the ORB vocabulary:

./thirdparty/ORB_SLAM2/Examples/Monocular/test_vocab thirdparty/DBoW2/test_vocab_data/ORB_Keypts_and_Desc/ vocabularies/ORBvoc.txt

SLAM Recordings

RGB SuperPoint SLAM run on KITTI sequence 03:

./thirdparty/SuperPoint_SLAM/Examples/Monocular/mono_kitti vocabularies/superpoint_rgb.yml.gz thirdparty/SuperPoint_SLAM/Examples/Monocular/KITTI03.yaml ../datasets/kitti/data_odometry_gray/dataset/sequences/03/ ../datasets/kitti/data_odometry_gray/dataset/sequences/03/RGB_Feat_and_Descriptors/

ORB_SLAM2 run on KITTI sequence 03:

./thirdparty/ORB_SLAM2/Examples/Monocular/mono_kitti vocabularies/ORBvoc.txt thirdparty/ORB_SLAM2/Examples/Monocular/KITTI03.yaml ../datasets/kitti/data_odometry_gray/dataset/sequences/03/

Thermal SuperPoint SLAM run on thermal images:

./thirdparty/SuperPoint_SLAM/Examples/Monocular/mono_euroc vocabularies/superpt_thermal.yml.gz configs/ORB_SLAM2/X8500.yaml ../datasets/fcav/cooled/images_clahe_10hz_tstart_108_tstop_inf/ ../datasets/fcav/cooled/timestamps/timestamps_10hz_tstart_108_tstop_inf.txt ../datasets/fcav/cooled/features/

ORB_SLAM2 run on thermal images:

./thirdparty/ORB_SLAM2/Examples/Monocular/mono_euroc vocabularies/ORBvoc.txt configs/ORB_SLAM2/X8500.yaml ../datasets/fcav/cooled/images_clahe_10hz_tstart_108_tstop_inf/ ../datasets/fcav/cooled/timestamps/timestamps_10hz_tstart_108_tstop_inf.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thermal SuperPoint SLAM

1. Setup

Downloading Submodules

SuperPoint Training (pytorch-superpoint)

Vocabulary Generation (DBoW2)

SLAM (ORB_SLAM2 and SuperPoint_SLAM)

Preprocessing and Evaluation (utils/ and evaluation/)

2. Image Directory Preprocessing

3. ROS Bag Preprocessing

4. SuperPoint Training

5. SuperPoint Keypoint and Descriptor Generation

6. Vocabulary Generation

7. Running SuperPoint SLAM

8. Evaluation

Comparing Contrast Enhancement Techniques

Feature Matching

Vocabulary Image Similarity Scoring

SLAM Recordings

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
configs		configs
evaluation		evaluation
thirdparty		thirdparty
trained_networks		trained_networks
utils		utils
vocabularies		vocabularies
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
ROB_530_Final_Report_Thermal_SuperPoint_SLAM.pdf		ROB_530_Final_Report_Thermal_SuperPoint_SLAM.pdf
build_slam_code.sh		build_slam_code.sh
build_vocab_code.sh		build_vocab_code.sh

specarmi/Thermal_SuperPoint_SLAM

Folders and files

Latest commit

History

Repository files navigation

Thermal SuperPoint SLAM

1. Setup

Downloading Submodules

SuperPoint Training (pytorch-superpoint)

Vocabulary Generation (DBoW2)

SLAM (ORB_SLAM2 and SuperPoint_SLAM)

Preprocessing and Evaluation (utils/ and evaluation/)

2. Image Directory Preprocessing

3. ROS Bag Preprocessing

4. SuperPoint Training

5. SuperPoint Keypoint and Descriptor Generation

6. Vocabulary Generation

7. Running SuperPoint SLAM

8. Evaluation

Comparing Contrast Enhancement Techniques

Feature Matching

Vocabulary Image Similarity Scoring

SLAM Recordings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages