Public release of our data splits used for the results in the paper:
@ARTICLE{caltagirone2021dataset,
author = {Caltagirone, Luca and Bellone, Mauro and Lennart, Svenson and Mattias, Wahde and Raivo, Sell},
title = {Lidar-Camera Semi-Supervised Learning for Semantic Segmentation},
journal = {MDPI Sensors},
volume = {21},
number = {14},
year = {2021},
url = {https://www.mdpi.com/1424-8220/21/14/4813},
doi = {10.3390/s21144813},
pages = {000--000}
}
In our paper, we used the Waymo dataset which consists of 1110 driving sequences collected under various weather and lighting conditions. The Waymo open dataset includes 1110 driving sequences recorded with multiple cameras and lidars across a large variety of locations, road types and weather and lighting conditions. Each driving sequence consists of a 20-s-long recording sampled at 10 Hz. Both 2D and 3D bounding boxes were manually generated for all frames and considering the following four categories of objects: vehicles; pedestrians; cyclists; and traffic signs. Additionally, we have partitioned the driving sequences into four broad subsets, namely day–fair; night–fair; day–rain; night–rain (see Table below for further details).
Day–Fair | Day–Rain | Night–Fair | Night–Rain |
---|---|---|---|
747 | 226 | 82 | 45 |
table: Number of 20-s-long driving sequences belonging to the four main categories considered in this work (10 Hz sample frequency).
The labels day and night indicate whether a sequence was collected during the day under good lighting conditions, or late in the day or at night under poor external illumination. The labels fair and rain instead refer to the weather conditions, with fair denoting good weather, and rain denoting active raining or wet environment following recent precipitation. Each sequence originally contains approximately 200 frames (i.e., 20 s sampled at 10 Hz) of which we kept every 10th frame. Out of the full dataset, for~the experiments described in the following sections, we generated N=10 random dataset splits. The randomization was carried out by considering driving sequences instead of individual frames in order to avoid any overlap between the subsets in any given set.
With the exception of the training sets, which only contain examples belonging to the coarse category day--fair, all other sets include examples belonging to all coarse categories. By only considering training data belonging to one category, it is possible to investigate the effectiveness of the proposed approach for carrying out domain adaptation. (See Table below for more details regarding the dataset splits S.)
Set | Day–Fair | Day–Rain | Night–Fair | Night–Rain |
---|---|---|---|---|
Training | 100 | 0 | 0 | 0 |
Validation | 10 | 10 | 10 | 5 |
Unlabelled | 40 | 40 | 36 | 20 |
Test | 40 | 40 | 36 | 20 |
table: Number of sequences assigned to training, validation, unlabelled, and test subsets for each dataset split Si according to the four main categories considered in this work.
The specific lists are provided in the folder ./data/splits/. as text files
./data/splits/
The list files contain rows in the following format:
labeled/day/not_rain/camera/segment-575209926587730008_3880_000_3900_000_with_camera_labels_0000000124.png
where "day", "not_rain" etc. refer to the specific split in the table above, while "segment-XXXXX-.png" refers to the specific image from the waymo dataset. File names follow the same standard and the same format.