segmentAI

Dataset and Resources

All RELLIS-3D files and components can be found on the official GitHub repository:
🔗 RELLIS-3D GitHub

Evaluation benchmarks:

HRNet+OCR metrics: HRNet Benchmark
GSCNN metrics: GSCNN Benchmark
DALL·E 3 (used for generative segmentation): DALL·E 3 by OpenAI
mIoU benchmarks code are found within the HRNet & GSCNN repository
All images used for this testing are in the "images" folder.

⚠️ Note: The free version of DALL·E 3 has a limited number of image generations per day. For consistent and high-quality results, it is recommended to upgrade to GPT Plus.

How to Segment an Image Using DALL·E 3

Upload an image to DALL·E 3.
Visually identify the most dominant class in the image.
For RELLIS-3D images, the most common classes are typically sky or grass.
Prompt DALL·E 3 to segment the dominant class.
Use this format:

“Segment the {class name} in {color}.”
For example:
“Segment the sky in RGB(0, 255, 0).”

Acceptable color formats include RGB, Hex, or named colors (e.g., cyan, dark green). RGB or Hex values are preferred for consistency.
Inspect the output.
- If segmentation includes incorrect areas or merges multiple classes into one color, refine your prompt.
- If the result is poor, restart the segmentation process for better accuracy.
After obtaining a satisfactory segmentation, proceed to morphological refinement.
Apply a first round of morphological operations.
Choose a combination of:
- Kernel Size (K)
- Structuring Element Shape (Circle or Square)
- Operation Type (Open, Close, OpenClose)
Example prompt:

“Apply a Closing operation using a circular structuring element with a kernel size of 5 to the segmented sky.”

DALL-E will produce the results natively

Evaluate the resulting segmentations.
Select the image that produces the highest IoU. There should be a measurable improvement from the original—e.g., sky IoU increasing from 0.8763 to 0.8931.
Apply a second round of operations on the selected best image.
Available operation combinations:
- Closing & Flood Fill
- Flood Fill & Closing
- Flood Fill & Opening
- Imfill
- Opening & Closing
- Opening
- Closing & Flood Fill
- Opening & Flood Fill
Kernel sizes typically range from 5 to 60.
Example prompt:

“Perform a Flood Fill & Opening operation with kernel size 5 on the segmented sky region.”

Repeat this for all morphological types and kernel sizes (24 combinations total), then select the image with the highest IoU.
Repeat the process for each remaining class until the full image is segmented.

🔍 Important: Smaller visual classes (e.g., bushes, road signs) are more difficult to segment accurately. Extra care or multiple iterations may be needed.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Images		Images
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

segmentAI

Dataset and Resources

How to Segment an Image Using DALL·E 3

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

License

sevilresearch/segmentAI

Folders and files

Latest commit

History

Repository files navigation

segmentAI

Dataset and Resources

How to Segment an Image Using DALL·E 3

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Packages