AIDA U²-Net Project

A Python package for in-memory image segmentation using the U²-Net model. This package processes images entirely in memory—cropping or warping them based on detected contours—so that its output can be fed directly into downstream components (e.g. a CNN for determining if an image is AI‑generated).

Features

In-Memory Processing: No disk writes; images are processed and returned as NumPy arrays.
U²-Net Based: Uses the U²-Net model to generate saliency masks.
Perspective Transformation: Applies a perspective warp when a quadrilateral is detected.
Fallback Cropping: Falls back to bounding box cropping if a quadrilateral is not detected.
Cloud-Ready API: Designed to be integrated in cloud services where images are processed and passed directly to subsequent stages.

Package Structure

AIDA_U2_Network/
├── src/
    ├── __init__.py          # Exposes the main API class.
    ├── config.py            # Configuration constants (e.g. model weights path, image size).
    ├── segmenter.py         # Contains the U2NetSegmenter class with the segmentation API.
    ├── u2net.py             # Contains the U²-Net network definition.
    ├── utils.py             # Helper functions for image preprocessing and transforms.
    ├── weights/
    │   └── u2net.pth        # Pre-trained U²-Net model weights.
└── examples/
    └── demos.py         # Demo script to process all images in the demo_images folder.

Prerequisites

Python 3.7+
PyTorch and Torchvision
OpenCV (opencv-python)
NumPy
Matplotlib (used in demos/visualizations)

Installation

Clone the repository:

git clone <repository-url>
cd u2net_segmenter

Install the Required Packages:
```
pip install -r requirements.txt
```
Download the U^2 Net Weights:

Place your pre-trained U²-Net weights (e.g. u2net.pth) inside the weights/ folder. Ensure the path in config.py matches the location of your weights file

Usage

import cv2
from u2net_segmenter import U2NetSegmenter

Initialize the segmenter.

segmenter = U2NetSegmenter()

Load an image (ensure the image is in BGR format as used by OpenCV).

image = cv2.imread("path/to/your/image.jpg")
if image is None:
    raise ValueError("Unable to load image.")

Process the image.

result = segmenter.process_image(image)

The Segmented Output

'result' is a NumPy array containing the segmented (warped or cropped) image. It can now be passed directly into your downstream CNN.

cv2.imshow("Segmented Image", result)
cv2.waitKey(0)
cv2.destroyAllWindows()

The code above shows how to import and use the U2NetSegmenter class. Remember that your image variable must be a valid OpenCV image (BGR format).

Running `demos.py` from the Project Root

When you run a Python script directly (e.g., python examples/demos.py), Python sets the current working directory to examples/ and will not automatically recognize your src/ folder as a package.

A simple way to fix this is to run your script as a module from the project root:

    python -m examples.demos

With that command, Python will treat examples as a top-level package and can properly import from src. Inside your demos.py, you can then use:

    from src import U2NetSegmenter

Citation

@InProceedings{Qin_2020_PR,
title = {U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection},
author = {Qin, Xuebin and Zhang, Zichen and Huang, Chenyang and Dehghan, Masood and Zaiane, Osmar and Jagersand, Martin},
journal = {Pattern Recognition},
volume = {106},
pages = {107404},
year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
examples		examples
src		src
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIDA U²-Net Project

Features

Package Structure

Prerequisites

Installation

Usage

Initialize the segmenter.

Load an image (ensure the image is in BGR format as used by OpenCV).

Process the image.

The Segmented Output

Running `demos.py` from the Project Root

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AIDA U²-Net Project

Features

Package Structure

Prerequisites

Installation

Usage

Initialize the segmenter.

Load an image (ensure the image is in BGR format as used by OpenCV).

Process the image.

The Segmented Output

Running demos.py from the Project Root

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Running `demos.py` from the Project Root

Packages