All RELLIS-3D files and components can be found on the official GitHub repository:
🔗 RELLIS-3D GitHub
Evaluation benchmarks:
-
HRNet+OCR metrics: HRNet Benchmark
-
GSCNN metrics: GSCNN Benchmark
-
DALL·E 3 (used for generative segmentation): DALL·E 3 by OpenAI
-
mIoU benchmarks code are found within the HRNet & GSCNN repository
-
All images used for this testing are in the "images" folder.
⚠️ Note: The free version of DALL·E 3 has a limited number of image generations per day. For consistent and high-quality results, it is recommended to upgrade to GPT Plus.
-
Upload an image to DALL·E 3.
-
Visually identify the most dominant class in the image.
For RELLIS-3D images, the most common classes are typically sky or grass. -
Prompt DALL·E 3 to segment the dominant class.
Use this format:“Segment the {class name} in {color}.”
For example:
“Segment the sky in RGB(0, 255, 0).”Acceptable color formats include RGB, Hex, or named colors (e.g.,
cyan
,dark green
). RGB or Hex values are preferred for consistency. -
Inspect the output.
- If segmentation includes incorrect areas or merges multiple classes into one color, refine your prompt.
- If the result is poor, restart the segmentation process for better accuracy.
-
After obtaining a satisfactory segmentation, proceed to morphological refinement.
-
Apply a first round of morphological operations.
Choose a combination of:- Kernel Size (
K
) - Structuring Element Shape (
Circle
orSquare
) - Operation Type (
Open
,Close
,OpenClose
)
Example prompt:
“Apply a Closing operation using a circular structuring element with a kernel size of 5 to the segmented sky.”
- Kernel Size (
DALL-E will produce the results natively
-
Evaluate the resulting segmentations.
Select the image that produces the highest IoU. There should be a measurable improvement from the original—e.g., sky IoU increasing from0.8763
to0.8931
. -
Apply a second round of operations on the selected best image.
Available operation combinations:Closing & Flood Fill
Flood Fill & Closing
Flood Fill & Opening
Imfill
Opening & Closing
Opening
Closing & Flood Fill
Opening & Flood Fill
Kernel sizes typically range from 5 to 60.
Example prompt:“Perform a Flood Fill & Opening operation with kernel size 5 on the segmented sky region.”
Repeat this for all morphological types and kernel sizes (24 combinations total), then select the image with the highest IoU.
-
Repeat the process for each remaining class until the full image is segmented.
🔍 Important: Smaller visual classes (e.g., bushes, road signs) are more difficult to segment accurately. Extra care or multiple iterations may be needed.