The RLE or Polygon format of "segmentation“ for extending to coco dataset #100

topcomma · 2018-02-02T05:41:30Z

Hi Detectron,

Recently I tried to add my custom coco data to run Detectron and encountered the following issues.
(1) "segmentation" in coco data like below，

{"segmentation": [[499.71, 397.28,......342.71, 172.31]], "area": 43466.12825, "iscrowd": 0, "image_id": 182155, "bbox": [338.89, 51.69, 205.82, 367.61], "category_id": 1, "id": 1248258},

{"segmentation": {"counts": [66916, 6, 587,..... 1, 114303], "size": [594, 640]}, "area": 6197, "iscrowd": 1, "image_id": 284445, "bbox": [112, 322, 335, 94], "category_id": 1, "id": 9.001002844e+11},
The first format of "segmentation" is polygon and the second is need to encode/decode for RLE format.

Above formats can run on Detectron.

(2) I added a new category , and generated a new RLE format for "segmentation" field via coco api encode()/decode() mask.
I generated data like this.

"segmentation": [{"counts": "mng=1fb02O1O1O001N2O001O1O0O2O1O1O001N2O001O1O0O2O1O001O1O1O010000O01000O010000O01000O01000O01000O01N2N2M2O2N2N1O2N2O001O10O?B000O10O1O001^OQ^O9Pb0EQ^O;Wb0OO01O1O1O001O1N2N`jT3","size": [600,1000]}]

I found the bolded characters is different from the original coco "segmentation" json format although it can run on MatterPort's implementation to Mask-RCNN.

Also, I tried to modify some Detectron's code to meet my requirement, but very difficult to me because lots of code need to change.

Could you give me some suggestions to run my custom data?

Thanks.

amokeev · 2018-02-02T06:52:24Z

I had a similar issue: some of the functions in lib/utils/segms.py expect segmentations to be in “poly” format and break when those are provided in RLE. It is inconvenient but seems to be in line with the spec for non-crowded regions(iscrowd=0): The segmentation format depends on whether the instance represents a single object (iscrowd=0 in which case polygons are used) or a collection of objects (iscrowd=1 in which case RLE is used). [1] http://cocodataset.org/#download, section "4.1. Object Instance Annotations” The workaround for me was to transform to “poly” format, which is essentially a list of (x,y) vertexes.

…

-Lesha.

On 2. Feb 2018, at 06:41, Hu Xingui ***@***.***> wrote: Hi Detectron, Recently I tried to add my custom coco data to run Detectron and encountered the following issues. (1) "segmentation" in coco data like below， {"segmentation": [[499.71, 397.28,......342.71, 172.31]], "area": 43466.12825, "iscrowd": 0, "image_id": 182155, "bbox": [338.89, 51.69, 205.82, 367.61], "category_id": 1, "id": 1248258}, {"segmentation": {"counts": [66916, 6, 587,..... 1, 114303], "size": [594, 640]}, "area": 6197, "iscrowd": 1, "image_id": 284445, "bbox": [112, 322, 335, 94], "category_id": 1, "id": 9.001002844e+11}, The first format of "segmentation" is polygon and the second is need to encode/decode for RLE format. Above formats can run on Detectron. (2) I added a new category , and generated a new RLE format for "segmentation" field via coco api encode()/decode() mask. I generated data like this. "segmentation": [{"counts": "mng=1fb02O1O1O001N2O001O1O0O2O1O1O001N2O001O1O0O2O1O001O1O1O010000O01000O010000O01000O01000O01000O01N2N2M2O2N2N1O2N2O001O10O?B000O10O1O001^OQ^O9Pb0EQ^O;Wb0OO01O1O1O001O1N2N`jT3","size": [600,1000]}] I found the bolded characters is different from the original coco "segmentation" json format although it can run on MatterPort's implementation to Mask-RCNN. Also, I tried to modify some Detectron's code to meet my requirement, but very difficult to me because lots of code need to change. Could you give me some suggestions to run my custom data? Thanks. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

topcomma · 2018-02-02T07:35:05Z

@amokeev ,How to convert "RLE" to "poly" format to your workaround?

topcomma · 2018-02-02T08:18:39Z

@amokeev , Are you sure the number in segmentation is (x,y) vertexes? For example "66916" ,a large number! Another, although I set "iscrowd" as "1" for RLE format, yet could not run on Detectron.

amokeev · 2018-02-02T08:46:15Z

I interpret poly as list of polygons, defined by the vertexes, like [[x1,y1,x2,y2…xN,yN],…[x1,y1,x2,y2…xN,yN] ], where the coordinates are of the same scale as the image. The masks, encoded this way, are shown correctly by CocoAPI [1] But you may want to get “official” answer. [1] https://github.com/cocodataset/cocoapi <https://github.com/cocodataset/cocoapi>

…

On 2. Feb 2018, at 09:18, Hu Xingui ***@***.***> wrote: @amokeev <https://github.com/amokeev> , Are you sure the number in segmentation is (x,y) vertexes? For example "66916" ,a large number! Another, although I set "iscrowd" as "1" for RLE format, yet could not run on Detectron. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#100 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AFlh63ObnXg-DcaDKwIi3pB4Ppig464Hks5tQsTkgaJpZM4R2tN3>.

Sundrops · 2018-02-04T05:46:30Z

@topcomma
Maybe you can try to convert mask to polys.

labels_info = []
for mask in mask_list:
    # opencv 3.2
    mask_new, contours, hierarchy = cv2.findContours((mask).astype(np.uint8), cv2.RETR_TREE,
                                                        cv2.CHAIN_APPROX_SIMPLE)
    # before opencv 3.2
    # contours, hierarchy = cv2.findContours((mask).astype(np.uint8), cv2.RETR_TREE,
    #                                                    cv2.CHAIN_APPROX_SIMPLE)
    segmentation = []

    for contour in contours:
        contour = contour.flatten().tolist()
        # segmentation.append(contour)
        if len(contour) > 4:
            segmentation.append(contour)
    if len(segmentation) == 0:
        continue
    # get area, bbox, category_id and so on
    labels_info.append(
        {
            "segmentation": segmentation,  # poly
            "area": area,  # segmentation area
            "iscrowd": 0,
            "image_id": index,
            "bbox": [x1, y1, bbox_w, bbox_h],
            "category_id": category_id,
            "id": label_id
        },
    )

topcomma · 2018-02-05T01:06:31Z

@amokeev ,
@Sundrops

Thanks for your suggestions.
Will try.

topcomma · 2018-02-05T01:50:25Z

@Sundrops , As your method to convert, can get the result of "poly" list. Thank you very much！ But I still not know why the coordinate in COCO json file is large/small number such as "66916" or "1"?

Sundrops · 2018-02-05T03:43:46Z

@topcomma COCO annotations have two types of segmentation annotations

polygon (object instance) [[499.71, 397.28,......342.71, 172.31]] vertexes
uncompressed RLE (crowd). "segmentation": {"counts": [66916, 6, 587,..... 1, 114303], "size": [594, 640]}, 66916 represents the number of label 0.

The polygon and uncompressed RLE will be converted to compact RLE format withe the MaskApi.
The compact RLE format:
segmentation": [{"counts": "mng=1fb02O1O1O001N2O001O1O0O2O1O1O001N2O001O1O0O2O1O001O1O1O010000O01000O010000O01000O01000O01000O01N2N2M2O2N2N1O2N2O001O10O?B000O10O1O001^OQ^O9Pb0EQ^O;Wb0OO01O1O1O001O1N2N`jT3","size": [600,1000]}]

topcomma · 2018-02-05T08:19:17Z

@Sundrops ,
Thanks for your great help.
My custom coco-like data can be trained on Detectron now.

lg12170226 · 2018-02-06T09:07:02Z

@topcomma ,
i have the same problem as u .
As Sundrops 's method, i can't find the file to convert mask to polys.
Could you tell me which file?Thank you very much!

topcomma · 2018-02-06T09:27:37Z

@lg12170226 ，
You may refer coco stuff(https://github.com/nightrome/cocostuff) python code to implement it by yourself.
In the code base there is no file of the related annotation.

John1231983 · 2018-02-06T14:29:32Z

@topcomma : I have a raw image and N label images. Each label are stored in a single file, so I have N image for label. I want to train Mask RCNN in my own dataset, so I first need to convert to COCO format. Could you share the code how to convert it to COCO style? Thanks

realwecan · 2018-02-24T19:25:40Z

Just wondering if there is a way to convert compressed RLEs to polys/uncompressed RLEs?

hazirbas · 2018-02-28T00:45:51Z

@realwecan after decoding RLE with pycocotools.mask.decode, you can check my implementation to generate polygons with opencv:

coco-json-converter

John1231983 · 2018-02-28T01:39:48Z

@hazirbas: thanks for your code. Why not use Davis 2017 that contains instance segmenyation? Could we use your code to convert Davis 2017 to coco format to use this maskrcnn implementation?

hazirbas · 2018-02-28T03:47:45Z

@John1231983 you need to modify the script accordingly for reading split files as well as db_info.yml file. For my own research, I needed it for DAVIS 2016.

waspinator · 2018-03-15T17:26:32Z

Another solution to generating polygons, but using skimage instead of opencv.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

waspinator · 2018-03-29T00:54:43Z

how would you convert a binary mask or encoded RLE into uncompressed RLE for use in the "iscrowd: 1" "counts" field?

Kongsea · 2018-04-05T04:47:44Z

@waspinator Segmentations with your code using skimage are differenct from the code of @Sundrops using cv2.
Are these two results both correct and can these two results both be used by Detectron? Please give me some advice, thank you. @waspinator @Sundrops

Results using your code with skimage:

{"segmentation": [[0.0, 252.00196078431372, 1.0, 252.00196078431372, 2.0, 252.00196078431372, 3.0, 252.00196078431372, 4.0, 252.00196078431372, 5.0, 252.00196078431372, 6.0, 252.00196078431372, 7.0, 252.00196078431372, 8.0, 252.00196078431372, 9.0, 252.00196078431372, 10.0, 252.00196078431372, 11.0, 252.00196078431372, 12.0, 252.00196078431372, 13.0, 252.00196078431372, 14.0, 252.00196078431372, 15.0, 252.00196078431372, 16.0, 252.00196078431372, 17.0, 252.00196078431372, 18.0, 252.00196078431372, 19.0, 252.00196078431372, 20.0, 252.00196078431372, 21.0, 252.00196078431372, 22.0, 252.00196078431372, 23.0, 252.00196078431372, 24.0, 252.00196078431372, 25.0, 252.00196078431372, 26.0, 252.00196078431372, 27.0, 252.00196078431372, 28.0, 252.00196078431372, 29.0, 252.00196078431372, 30.0, 252.00196078431372, 31.0, 252.00196078431372, 32.0, 252.00196078431372, 33.0, 252.00196078431372, 34.0, 252.00196078431372, 35.0, 252.00196078431372, 36.0, 252.00196078431372, 37.0, 252.00196078431372, 38.0, 252.00196078431372, 39.0, 252.00196078431372, 40.0, 252.00196078431372, 41.0, 252.00196078431372, 42.0, 252.00196078431372, 43.0, 252.00196078431372, 44.0, 252.00196078431372, 45.0, 252.00196078431372, 46.0, 252.00196078431372, 47.0, 252.00196078431372, 48.0, 252.00196078431372, 49.0, 252.00196078431372, 50.0, 252.00196078431372, 51.0, 252.00196078431372, 52.0, 252.00196078431372, 53.0, 252.00196078431372, 54.0, 252.00196078431372, 55.0, 252.00196078431372, 56.0, 252.00196078431372, 57.0, 252.00196078431372, 58.0, 252.00196078431372, 59.0, 252.00196078431372, 60.0, 252.00196078431372, 61.0, 252.00196078431372, 62.0, 252.00196078431372, 63.0, 252.00196078431372, 64.0, 252.00196078431372, 65.0, 252.00196078431372, 66.0, 252.00196078431372, 67.0, 252.00196078431372, 68.0, 252.00196078431372, 69.0, 252.00196078431372, 70.0, 252.00196078431372, 71.0, 252.00196078431372, 72.0, 252.00196078431372, 73.0, 252.00196078431372, 74.0, 252.00196078431372, 75.0, 252.00196078431372, 76.0, 252.00196078431372, 77.0, 252.00196078431372, 78.0, 252.00196078431372, 79.0, 252.00196078431372, 80.0, 252.00196078431372, 81.0, 252.00196078431372, 82.0, 252.00196078431372, 83.0, 252.00196078431372, 84.0, 252.00196078431372, 85.0, 252.00196078431372, 86.0, 252.00196078431372, 87.0, 252.00196078431372, 88.0, 252.00196078431372, 89.0, 252.00196078431372, 90.0, 252.00196078431372, 91.0, 252.00196078431372, 92.0, 252.00196078431372, 93.0, 252.00196078431372, 93.00196078431372, 252.0, 94.0, 251.00196078431372, 95.0, 251.00196078431372, 96.0...

Results with @Sundrops code using cv2:

[94, 252, 93, 253, 0, 253, 0, 286, 188, 286, 188, 269, 187, 268, 187, 252]

waspinator · 2018-04-05T13:48:58Z

@Kongsea I haven't tested @Sundrops cv2 implementation, but the basic idea should be the same. They will produce different results since there are an infinite amount of sets of points you can use to describe a shape. But otherwise they should both work. I just didn't have cv2 installed so I wrote something that doesn't require it.

Sundrops · 2018-04-05T15:50:32Z

@Kongsea @waspinator I have tested my code. It works.

Kongsea · 2018-04-06T03:27:17Z

Thank you @Sundrops @waspinator .
I will have a try.

ambigus9 · 2018-04-13T00:49:15Z

@waspinator Is any way to convert segmentation poly vertex to RLE? My target is made iscrowd=1

Yuliang-Zou · 2018-04-13T04:52:32Z

@Sundrops Why did you comment this part in your code?

# if len(contour) > 4:
    #     segmentation.append(contour)
# if len(segmentation) == 0:
#     continue

We indeed need to handle such a case, right?

Sundrops · 2018-04-13T05:10:50Z

@Yuliang-Zou You should uncomment this part when contours are uesd for Detectron. Beacase the Detectron will treat it as a rectangle when len(contour) ==4. I have updated my previous code.

Yuliang-Zou · 2018-04-13T05:14:59Z

@Sundrops Thanks. But we still need to handle len(contour)==2, right?

Sundrops · 2018-04-13T05:18:08Z

@Yuliang-Zou Yes, but the code if len(contour) > 4: has handle len(contour)==2 and len(contour)==4.

Yuliang-Zou · 2018-04-13T05:19:07Z

@Sundrops I see, thank you!

waspinator · 2018-04-13T13:51:41Z

I wrote a library and article to help with creating COCO style datasets.

https://patrickwasp.com/create-your-own-coco-style-dataset/

soumenms2015 · 2018-04-23T12:19:19Z

@Sundrops & @topcomma I have problem to load the annotated data in pycocotols as my annotation is included segmentation without mask. any idea how to visualize the annotation without mask in pycocotools?

manketon · 2018-04-27T10:07:07Z

@Sundrops
ann: {
"segmentation": [
[312.29, 562.89, 402.25, 511.49, 400.96, 425.38, 398.39, 372.69, 388.11, 332.85, 318.71, 325.14, 295.58, 305.86, 269.88, 314.86, 258.31, 337.99, 217.19, 321.29, 182.49, 343.13, 141.37, 348.27, 132.37, 358.55, 159.36, 377.83, 116.95, 421.53, 167.07, 499.92, 232.61, 560.32, 300.72, 571.89]
],
"area": 54652.9556,
"iscrowd": 0,
"image_id": 480023,
"bbox": [116.95, 305.86, 285.3, 266.03],
"category_id": 58,
"id": 86
}
How can I calculate the area with mask.py in coco-api ? Thank you.
My code is as following but error:

segmentation = ann['segmentation']
bimask = np.array(segmentation, dtype = np.uint8, order = 'F')
print("bimask:", bimask)
rleObjs = mask.encode(bimask)
print("rleObjs:", rleObjs)
area = mask.area(rleObjs)
print("area:", area)

Sundrops · 2018-04-27T11:28:23Z

@manketon
Maybe you can try cv2, but I'm not sure it's right. It's just a example from Removing contours from an image using Python and OpenCV.

def is_contour_bad(c):
	# approximate the contour
	peri = cv2.arcLength(c, True)
	approx = cv2.approxPolyDP(c, 0.02 * peri, True)
	# return True if it is not a rectangle
	return not len(approx) == 4
image = cv2.imread('xx.jpg')
contours = ann['segmentation']
mask = np.ones(image.shape[:2], dtype="uint8") * 255
# loop over the contours
for c in contours:
	# if the contour is not a rectangle, draw it on the mask
	if is_contour_bad(c):
		cv2.drawContours(mask, [c], -1, 0, -1)
area = (mask==0).sum()

wangg12 · 2018-09-23T16:11:12Z

@Sundrops @waspinator I have a question. If my original object mask has big holes, when it is converted to polygons, how should I correctly convert it back? The decode and merge functions in coco API would treat the hole as parts of the object, so when converted back the holes become masks. How should I do in this case?

waspinator · 2018-09-24T15:26:29Z

@wangg12 As far as I know COCO doesn't have a native way of encoding holes.

BobZhangHT · 2019-01-07T04:00:57Z

for contour in contours:
        contour = contour.flatten().tolist()
        segmentation.append(contour)
        if len(contour) > 4:
            segmentation.append(contour)

Hi @Sundrops , sincerely thanks for your codes. I'm a beginner on Detectron. I feel confused why the length of contour (which is a list) should be larger than 4, in other words, what will happen for the model when the length is smaller than 4. One of your reply said that Detectron will treat it as a rectangle. In my perspective, there may be some objects with the shape of rectangle, so I think it's fine. Also, I wonder if you append the contour for twice in one iteration. I think the right code should be.

for contour in contours:
        contour = contour.flatten().tolist()
        if len(contour) > 4:
            segmentation.append(contour)

It' will be highly appreciated if you can give me some suggestions. Thank you so much!

Sundrops · 2019-01-07T05:28:22Z

@BobZhangHT Yes, it should append the contour for once in one iteration.
For your first question, if len(ann['segmentation'][0])==4, cocoapi will assume all are rectangle.

# cocoapi/PythonAPI/pycocotools/coco.py
def annToRLE(self, ann):
    t = self.imgs[ann['image_id']]
    h, w = t['height'], t['width']
    segm = ann['segmentation']
    if type(segm) == list:
        rles = maskUtils.frPyObjects(segm, h, w) 
        rle = maskUtils.merge(rles)
   ......

# cocoapi/PythonAPI/pycocotools/_mask.pyx
def frPyObjects(pyobj, h, w):
    # encode rle from a list of python objects
    if type(pyobj) == np.ndarray:
        objs = frBbox(pyobj, h, w)
    elif type(pyobj) == list and len(pyobj[0]) == 4:
        objs = frBbox(pyobj, h, w)
    elif type(pyobj) == list and len(pyobj[0]) > 4:
        objs = frPoly(pyobj, h, w)
   ......

BobZhangHT · 2019-01-07T05:50:33Z

@Sundrops Thanks for your reply!

enemni · 2019-04-25T13:33:11Z

I interpret poly as list of polygons, defined by the vertexes, like [[x1,y1,x2,y2…xN,yN],…[x1,y1,x2,y2…xN,yN] ], where the coordinates are of the same scale as the image. The masks, encoded this way, are shown correctly by CocoAPI [1] But you may want to get “official” answer. [1] https://github.com/cocodataset/cocoapi https://github.com/cocodataset/cocoapi
…
On 2. Feb 2018, at 09:18, Hu Xingui @.***> wrote: @amokeev https://github.com/amokeev , Are you sure the number in segmentation is (x,y) vertexes? For example "66916" ,a large number! Another, although I set "iscrowd" as "1" for RLE format, yet could not run on Detectron. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#100 (comment)>, or mute the thread https://github.com/notifications/unsubscribe-auth/AFlh63ObnXg-DcaDKwIi3pB4Ppig464Hks5tQsTkgaJpZM4R2tN3.

How can I transform RLE to polygons when "iscrowd" is "1" because the matterport/MaskRCNN works only with polygons. @amokeev. I want to use the matterport implementation of maskRCNN with a coco dataset created using pycococreator.

bhavanisambaturu · 2021-06-23T16:16:21Z

@topcomma COCO annotations have two types of segmentation annotations

polygon (object instance) [[499.71, 397.28,......342.71, 172.31]] vertexes

uncompressed RLE (crowd). "segmentation": {"counts": [66916, 6, 587,..... 1, 114303], "size": [594, 640]}, 66916 represents the number of label 0.

The polygon and uncompressed RLE will be converted to compact RLE format withe the MaskApi.
The compact RLE format:
segmentation": [{"counts": "mng=1fb02O1O1O001N2O001O1O0O2O1O1O001N2O001O1O0O2O1O001O1O1O010000O01000O010000O01000O01000O01000O01N2N2M2O2N2N1O2N2O001O10O?B000O10O1O001^OQ^O9Pb0EQ^O;Wb0OO01O1O1O001O1N2N`jT3","size": [600,1000]}]

I still didn't understand what 66916 and 114303 stand for?

bhavanisambaturu · 2021-06-26T06:15:24Z

@topcomma COCO annotations have two types of segmentation annotations

polygon (object instance) [[499.71, 397.28,......342.71, 172.31]] vertexes

uncompressed RLE (crowd). "segmentation": {"counts": [66916, 6, 587,..... 1, 114303], "size": [594, 640]}, 66916 represents the number of label 0.

The polygon and uncompressed RLE will be converted to compact RLE format withe the MaskApi.
The compact RLE format:
segmentation": [{"counts": "mng=1fb02O1O1O001N2O001O1O0O2O1O1O001N2O001O1O0O2O1O001O1O1O010000O01000O010000O01000O01000O01000O01N2N2M2O2N2N1O2N2O001O10O?B000O10O1O001^OQ^O9Pb0EQ^O;Wb0OO01O1O1O001O1N2N`jT3","size": [600,1000]}]

I still didn't understand what 66916 and 114303 stand for?

I understood. 66916 is the number of 0 pixels before the first pixel in the mask. 114303 is the number of zeros after the last 0 pixel in the mask

balajihosur · 2022-01-11T10:53:13Z

Another solution to generating polygons, but using skimage instead of opencv.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

Another solution to generating polygons, but using skimage instead of opencv.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

--> 12 encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
TypeError: encode() argument 'encoding' must be str, not numpy.ndarray
I am getting this error .please help

balajihosur · 2022-01-11T10:54:39Z

Another solution to generating polygons, but using skimage instead of opencv.

import json
import numpy as np
from pycocotools import mask
from skimage import measure

ground_truth_binary_mask = np.array([[  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  0,   0,   0,   0,   0,   1,   1,   1,   0,   0],
                                     [  1,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0],
                                     [  0,   0,   0,   0,   0,   0,   0,   0,   0,   0]], dtype=np.uint8)

fortran_ground_truth_binary_mask = np.asfortranarray(ground_truth_binary_mask)
encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
ground_truth_area = mask.area(encoded_ground_truth)
ground_truth_bounding_box = mask.toBbox(encoded_ground_truth)
contours = measure.find_contours(ground_truth_binary_mask, 0.5)

annotation = {
        "segmentation": [],
        "area": ground_truth_area.tolist(),
        "iscrowd": 0,
        "image_id": 123,
        "bbox": ground_truth_bounding_box.tolist(),
        "category_id": 1,
        "id": 1
    }

for contour in contours:
    contour = np.flip(contour, axis=1)
    segmentation = contour.ravel().tolist()
    annotation["segmentation"].append(segmentation)
    
print(json.dumps(annotation, indent=4))

---> 12 encoded_ground_truth = mask.encode(fortran_ground_truth_binary_mask)
TypeError: encode() argument 'encoding' must be str, not numpy.ndarray
I am facing this issue when I execute line 12

SajjadIqbal-git · 2023-01-26T07:28:25Z

@Sundrops how do I convert mask images into json files for yolo v8 format

rbgirshick added the community help wanted label Feb 7, 2018

francoto mentioned this issue Mar 8, 2018

Fine-tuning Mask-RCNN on my own dataset #212

Closed

alwansm mentioned this issue Mar 15, 2018

meaning of "counts" and "size" in "iscrowd: 1" segmentation cocodataset/cocoapi#135

Closed

kyamagu mentioned this issue May 30, 2018

Support rle annotation in mask r-cnn training #458

Closed

ambrosejcarr mentioned this issue Dec 15, 2018

Propose a format for segmentation results spacetx/starfish#875

Closed

akshitac8 mentioned this issue Mar 20, 2019

garbage characters about segmentation results ShuLiu1993/PANet#24

Open

pengsida mentioned this issue Jun 10, 2020

what is segmentation data mean? zju3dv/snake#54

Closed

OsamaMazhar mentioned this issue May 7, 2021

Object detection 2d opendr-eu/opendr#63

Closed

stephengmatthews mentioned this issue Jul 28, 2021

Compressed RLE to uncompressed RLE cocodataset/cocoapi#386

Open

frank-xwang mentioned this issue Apr 20, 2023

Generating Annotations for VOC with MaskCut facebookresearch/CutLER#28

Closed

The RLE or Polygon format of "segmentation“ for extending to coco dataset #100

The RLE or Polygon format of "segmentation“ for extending to coco dataset #100

Comments

topcomma commented Feb 2, 2018

amokeev commented Feb 2, 2018 via email

topcomma commented Feb 2, 2018

topcomma commented Feb 2, 2018

amokeev commented Feb 2, 2018 via email

Sundrops commented Feb 4, 2018 • edited

topcomma commented Feb 5, 2018

topcomma commented Feb 5, 2018

Sundrops commented Feb 5, 2018

topcomma commented Feb 5, 2018

lg12170226 commented Feb 6, 2018

topcomma commented Feb 6, 2018

John1231983 commented Feb 6, 2018

realwecan commented Feb 24, 2018 • edited

hazirbas commented Feb 28, 2018

John1231983 commented Feb 28, 2018

hazirbas commented Feb 28, 2018 • edited

waspinator commented Mar 15, 2018 • edited

waspinator commented Mar 29, 2018

Kongsea commented Apr 5, 2018 • edited

waspinator commented Apr 5, 2018 • edited

Sundrops commented Apr 5, 2018

Kongsea commented Apr 6, 2018

ambigus9 commented Apr 13, 2018

Yuliang-Zou commented Apr 13, 2018

Sundrops commented Apr 13, 2018

Yuliang-Zou commented Apr 13, 2018

Sundrops commented Apr 13, 2018

Yuliang-Zou commented Apr 13, 2018

waspinator commented Apr 13, 2018

soumenms2015 commented Apr 23, 2018

manketon commented Apr 27, 2018

Sundrops commented Apr 27, 2018

wangg12 commented Sep 23, 2018 • edited

waspinator commented Sep 24, 2018

BobZhangHT commented Jan 7, 2019

Sundrops commented Jan 7, 2019

BobZhangHT commented Jan 7, 2019

enemni commented Apr 25, 2019 • edited

bhavanisambaturu commented Jun 23, 2021

bhavanisambaturu commented Jun 26, 2021

balajihosur commented Jan 11, 2022

balajihosur commented Jan 11, 2022

SajjadIqbal-git commented Jan 26, 2023

Sundrops commented Feb 4, 2018 •

edited

realwecan commented Feb 24, 2018 •

edited

hazirbas commented Feb 28, 2018 •

edited

waspinator commented Mar 15, 2018 •

edited

Kongsea commented Apr 5, 2018 •

edited

waspinator commented Apr 5, 2018 •

edited

wangg12 commented Sep 23, 2018 •

edited

enemni commented Apr 25, 2019 •

edited