convert mask binary image to polygon format #131

raninbowlalala · 2018-03-01T05:49:10Z

I want to create a new dataset same as coco format, and now I have converted mask binary image to RLE format by using encode function in mask.py. But I don't know how to convert mask binary image or RLE format to polygon format, someone can help me? Thanks in advance.

waspinator · 2018-03-07T19:12:21Z

You can try this. I haven't tested it to actually work as a valid COCO format though. Let me know if it works for you.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

produces the following output:

{
    "segmentation": [
        [
            7.0,
            5.5,
            6.0,
            5.5,
            5.0,
            5.5,
            4.5,
            5.0,
            4.5,
            4.0,
            4.5,
            3.0,
            4.5,
            2.0,
            5.0,
            1.5,
            6.0,
            1.5,
            7.0,
            1.5,
            7.5,
            2.0,
            7.5,
            3.0,
            7.5,
            4.0,
            7.5,
            5.0,
            7.0,
            5.5
        ],
        [
            0.0,
            5.5,
            0.5,
            6.0,
            0.0,
            6.5
        ]
    ],
    "area": 13,
    "iscrowd": 0,
    "image_id": 123,
    "bbox": [
        0.0,
        2.0,
        8.0,
        5.0
    ],
    "category_id": 1,
    "id": 1
}

raninbowlalala · 2018-03-09T02:35:53Z

@waspinator I modified your code to read a file path of mask binary images, and it worked for me. Thank you very much!

JerryLingjieMei · 2018-08-06T02:36:33Z

@waspinator I was wondering why you would use np.flip on a contour? Is it because of the difference between c-like and Fortran-like numpy arrays?

waspinator · 2018-08-07T17:08:09Z

@JerryLingjieMei I think it was just because [x, y] coordinates were in the wrong order.

I've since worked on packaging up a tool to create coco json files. Take a look at https://github.com/waspinator/pycococreator/

thecondofitz · 2018-08-07T18:57:23Z

First of all, thank you @waspinator for all the tools that you've created for us to use. You've saved me so much time along the way.
I recently ran into some issues trying to write Mask R-CNN predictions/results to json. More specifically, in the Crowd-AI mapping challenge. The coordinates come out as a long string of letters and characters. here's one for example:

"segmentation": {"size": [512, 512], "counts": "UjU5g2W=6L1O01N3TOdVg2"}, "bbox": [331.0, 321.0, 7.0, 97.0]}, {"image_id": "1Z514B22DA1B2XYZ.jpg", "category_id": 1, "score": 0.9703407883644104, "segmentation": {"size": [512, 512], "counts": "dgU5l1P=U1N1001Nh0cM_Zg2"}, "bbox": [331.0, 207.0, 6.0, 102.0]}, {"image_id": "1Z412D6B6A1B2XYZ.jpg", "category_id": 2, "score": 0.9999915361404419,

that's part of the output of the code in the predictions notebook

Gather all JPG files in the test set as small batches

files = glob.glob(os.path.join(IMAGE_DIR, "*.jpg"))
ALL_FILES=[]
_buffer = []
for _idx, _file in enumerate(files):
if len(_buffer) == config.IMAGES_PER_GPU * config.GPU_COUNT:
ALL_FILES.append(_buffer)
_buffer = []
else:
_buffer.append(_file)

if len(_buffer) > 0:
ALL_FILES.append(_buffer)

Iterate over all the batches and predict

_final_object = []
for files in tqdm.tqdm(ALL_FILES):
images = [skimage.io.imread(x) for x in files]
predictions = model.detect(images, verbose=0)
for _idx, r in enumerate(predictions):
_file = files[_idx]
image_id = os.path.basename(_file)
for _idx, class_id in enumerate(r["class_ids"]):
if class_id == 1:
mask = r["masks"].astype(np.uint8)[:, :, _idx]
bbox = np.around(r["rois"][_idx], 1)
bbox = [float(x) for x in bbox]
_result = {}
_result["image_id"] = image_id
_result["category_id"] = 1
_result["score"] = float(r["scores"][_idx])
_mask = maskUtils.encode(np.asfortranarray(mask))
_mask["counts"] = _mask["counts"].decode("UTF-8")
_result["segmentation"] = _mask
_result["bbox"] = [bbox[1], bbox[0], bbox[3] - bbox[1], bbox[2] - bbox[0]]
_final_object.append(_result)

fp = open("predictions.json", "w")
import json
print("Writing JSON...")
fp.write(json.dumps(_final_object))
fp.close()

Any idea what I'm doing wrong? I'd like to see the mask coordinates in the same format as in your pycococreator.

atlurip · 2019-05-28T12:12:35Z

Hi Now i am facing the same problem.Please let me know the solution if you have fixed

ghost · 2020-02-25T20:21:45Z

You can try this. I haven't tested it to actually work as a valid COCO format though. Let me know if it works for you.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

produces the following output:

{
    "segmentation": [
        [
            7.0,
            5.5,
            6.0,
            5.5,
            5.0,
            5.5,
            4.5,
            5.0,
            4.5,
            4.0,
            4.5,
            3.0,
            4.5,
            2.0,
            5.0,
            1.5,
            6.0,
            1.5,
            7.0,
            1.5,
            7.5,
            2.0,
            7.5,
            3.0,
            7.5,
            4.0,
            7.5,
            5.0,
            7.0,
            5.5
        ],
        [
            0.0,
            5.5,
            0.5,
            6.0,
            0.0,
            6.5
        ]
    ],
    "area": 13,
    "iscrowd": 0,
    "image_id": 123,
    "bbox": [
        0.0,
        2.0,
        8.0,
        5.0
    ],
    "category_id": 1,
    "id": 1
}

how to make this code work for multiple classes not only binary?

SevenMpp · 2020-06-09T11:52:47Z

@waspinator I modified your code to read a file path of mask binary images, and it worked for me. Thank you very much!
Could you please share your method with me

kimsangseob · 2020-09-04T08:28:45Z

You can try this. I haven't tested it to actually work as a valid COCO format though. Let me know if it works for you.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

produces the following output:

{
    "segmentation": [
        [
            7.0,
            5.5,
            6.0,
            5.5,
            5.0,
            5.5,
            4.5,
            5.0,
            4.5,
            4.0,
            4.5,
            3.0,
            4.5,
            2.0,
            5.0,
            1.5,
            6.0,
            1.5,
            7.0,
            1.5,
            7.5,
            2.0,
            7.5,
            3.0,
            7.5,
            4.0,
            7.5,
            5.0,
            7.0,
            5.5
        ],
        [
            0.0,
            5.5,
            0.5,
            6.0,
            0.0,
            6.5
        ]
    ],
    "area": 13,
    "iscrowd": 0,
    "image_id": 123,
    "bbox": [
        0.0,
        2.0,
        8.0,
        5.0
    ],
    "category_id": 1,
    "id": 1
}

Where can I put my input data. I mean I want to convert PNG files to json COCO dataset format, which has x,y cordinates. I want use your code but I don't know where can I put my images directory.
Can you help me...?

IgnacioAmat · 2020-09-14T13:10:04Z

Hi @kimsangseob, for converting an image you will just need to change the value of the variable ground_truth_binary_mask from :
ground_truth_binary_mask = np.array([[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0], [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0], [ 0, 0, 0, 0, 0, 1, 1, 1, 0, 0], [ 0, 0, 0, 0, 0, 1, 1, 1, 0, 0], [ 0, 0, 0, 0, 0, 1, 1, 1, 0, 0], [ 0, 0, 0, 0, 0, 1, 1, 1, 0, 0], [ 1, 0, 0, 0, 0, 0, 0, 0, 0, 0], [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0], [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]], dtype=np.uint8)
to :
ground_truth_binary_mask = cv2.imread(image_location, 0)

Hope it helps !

narimanakh · 2022-01-18T23:50:13Z

@waspinator I modified your code to read a file path of mask binary images, and it worked for me. Thank you very much!
Could you please share your method with me

can u help me how to read binary images from path and convert it to json using the previous code

narimanakh · 2022-01-23T00:14:07Z

can anyone help me to convert polygon format to image to show it

narimanakh · 2022-01-23T00:15:19Z

You can try this. I haven't tested it to actually work as a valid COCO format though. Let me know if it works for you.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

produces the following output:

{
    "segmentation": [
        [
            7.0,
            5.5,
            6.0,
            5.5,
            5.0,
            5.5,
            4.5,
            5.0,
            4.5,
            4.0,
            4.5,
            3.0,
            4.5,
            2.0,
            5.0,
            1.5,
            6.0,
            1.5,
            7.0,
            1.5,
            7.5,
            2.0,
            7.5,
            3.0,
            7.5,
            4.0,
            7.5,
            5.0,
            7.0,
            5.5
        ],
        [
            0.0,
            5.5,
            0.5,
            6.0,
            0.0,
            6.5
        ]
    ],
    "area": 13,
    "iscrowd": 0,
    "image_id": 123,
    "bbox": [
        0.0,
        2.0,
        8.0,
        5.0
    ],
    "category_id": 1,
    "id": 1
}

Where can I put my input data. I mean I want to convert PNG files to json COCO dataset format, which has x,y cordinates. I want use your code but I don't know where can I put my images directory. Can you help me...?

how can i convert this format to mask image to show it

raninbowlalala closed this as completed Mar 9, 2018

dbolya mentioned this issue Jul 15, 2019

Json file dbolya/yolact#94

Closed

marcfielding1 mentioned this issue Sep 26, 2019

Generate Mask from detection matterport/Mask_RCNN#1764

Open

pvtien96 mentioned this issue Feb 3, 2020

Please help me on customizing training datasets facebookresearch/detectron2#795

Closed

qingqing01 mentioned this issue Nov 16, 2020

请问实例分割中MASK的标签是每个像素实例ID该怎么处理？ PaddlePaddle/PaddleDetection#1699

Closed

JavierClearImageAI mentioned this issue Mar 4, 2021

train Panoptic Segmentation model on custom dataset facebookresearch/detectron2#1691

Open

kumariko mentioned this issue Nov 26, 2021

How do I convert a mask to a polygon tensorflow/models#10397

Closed

levan92 mentioned this issue Mar 1, 2022

Details about BitmapMasks & PolygonMasks open-mmlab/mmdetection#7292

Closed

bhack mentioned this issue Mar 21, 2022

Corners bounding box format keras-team/keras-cv#172

Closed

ajaust mentioned this issue Nov 28, 2022

Calculate and return a polygon based on the vds trace mask equinor/vds-slice#51

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert mask binary image to polygon format #131

convert mask binary image to polygon format #131

raninbowlalala commented Mar 1, 2018

waspinator commented Mar 7, 2018 •

edited

raninbowlalala commented Mar 9, 2018

JerryLingjieMei commented Aug 6, 2018

waspinator commented Aug 7, 2018

thecondofitz commented Aug 7, 2018

atlurip commented May 28, 2019 •

edited

ghost commented Feb 25, 2020

SevenMpp commented Jun 9, 2020

kimsangseob commented Sep 4, 2020

IgnacioAmat commented Sep 14, 2020

narimanakh commented Jan 18, 2022

narimanakh commented Jan 23, 2022

narimanakh commented Jan 23, 2022

convert mask binary image to polygon format #131

convert mask binary image to polygon format #131

Comments

raninbowlalala commented Mar 1, 2018

waspinator commented Mar 7, 2018 • edited

raninbowlalala commented Mar 9, 2018

JerryLingjieMei commented Aug 6, 2018

waspinator commented Aug 7, 2018

thecondofitz commented Aug 7, 2018

Gather all JPG files in the test set as small batches

Iterate over all the batches and predict

atlurip commented May 28, 2019 • edited

ghost commented Feb 25, 2020

SevenMpp commented Jun 9, 2020

kimsangseob commented Sep 4, 2020

IgnacioAmat commented Sep 14, 2020

narimanakh commented Jan 18, 2022

narimanakh commented Jan 23, 2022

narimanakh commented Jan 23, 2022

waspinator commented Mar 7, 2018 •

edited

atlurip commented May 28, 2019 •

edited