How to increase number of anchors? #1057

universewill · 2020-09-28T05:13:24Z

I trained yolov5 on a custom dataset. However, the result turns out bad on long and thick targets like 213x5 or 6x194.
I want to try to increase the anchor number to improve the detect result.
How to increase anchors number?

Aktcob · 2020-09-30T08:14:40Z

simply add anchors in here: https://github.com/ultralytics/yolov5/blob/master/models/yolov5s.yaml#L8

pls try it before open issue.

glenn-jocher · 2020-10-04T17:39:41Z

@universewill yes just do what @Aktcob recommends. Only constraint is that each output layer must have same number of anchors. You can populate with an integer as well to use autoanchor, i.e. in your yaml you can just write:

anchors: 10

to force 10 auto-computed anchors per output layer (30 total).

glenn-jocher · 2020-10-04T17:41:05Z

One note is that extreme aspect ratios like your 200x5 example will probably not work very well, as the convolution kernels are for the most part squares rather than rectangles.

HaolyShiit · 2020-10-13T00:58:00Z

One note is that extreme aspect ratios like your 200x5 example will probably not work very well, as the convolution kernels are for the most part squares rather than rectangles.

After training, where are the auto-computed anchors saved?

glenn-jocher · 2020-10-13T09:37:15Z

Anchors are stored as Detect() layer parameters.

m = model.model[-1]  # Detect()
m.anchors  # in stride units
m.anchor_grid  # in pixel units

print(m.anchor_grid.view(-1,2))
tensor([[ 10.,  13.],
        [ 16.,  30.],
        [ 33.,  23.],
        [ 30.,  61.],
        [ 62.,  45.],
        [ 59., 119.],
        [116.,  90.],
        [156., 198.],
        [373., 326.]])

xinxin342 · 2020-10-14T16:25:04Z

@glenn-jocher Could you tell me more about it? I have set hpy.scratch.yaml [anchors]=5, but it didn't create any new anchors.
Where should I add the code you wrote? thankyou！

glenn-jocher · 2020-10-14T19:15:23Z

@xinxin342

yolov5/data/hyp.scratch.yaml

Line 20 in 4d3680c

# anchors: 0 # anchors per output grid (0 to ignore)

xinxin342 · 2020-10-15T08:36:42Z

@glenn-jocher I mean that where should I add these codes : m = model.model[-1] print(m.anchor_grid.view(-1,2))

glenn-jocher · 2020-10-15T08:57:25Z

@xinxin342 you can use this code anywhere you want, only you can answer that question.

model = torch.load('yolov5s.pt')['model']

m = model.model[-1]  # Detect()
m.anchors  # in stride units
m.anchor_grid  # in pixel units

print(m.anchor_grid.view(-1,2))
tensor([[ 10.,  13.],
        [ 16.,  30.],
        [ 33.,  23.],
        [ 30.,  61.],
        [ 62.,  45.],
        [ 59., 119.],
        [116.,  90.],
        [156., 198.],
        [373., 326.]])

HaolyShiit · 2020-10-18T02:12:16Z

Anchors are stored as Detect() layer parameters.

m = model.model[-1]  # Detect()
m.anchors  # in stride units
m.anchor_grid  # in pixel units

print(m.anchor_grid.view(-1,2))
tensor([[ 10.,  13.],
        [ 16.,  30.],
        [ 33.,  23.],
        [ 30.,  61.],
        [ 62.,  45.],
        [ 59., 119.],
        [116.,  90.],
        [156., 198.],
        [373., 326.]])

Thank you !

alicera · 2020-10-20T08:33:01Z

Hi
Does the detect.py use the anchors ?
If it does, where can I find it ?
I cant find the code, I only know the yolov5s.yaml

glenn-jocher · 2020-10-20T11:14:14Z

@alicera this is answered above: #1057 (comment)

Edwardmark · 2020-11-17T06:15:32Z

@glenn-jocher what does the anchor pixel value mean? Does it mean the width and height of the anchor?

glenn-jocher · 2020-11-17T12:32:48Z

Yes

aidevmin · 2023-11-06T06:57:00Z

Anchors are stored as Detect() layer parameters.

m = model.model[-1]  # Detect()
m.anchors  # in stride units
m.anchor_grid  # in pixel units

print(m.anchor_grid.view(-1,2))
tensor([[ 10.,  13.],
        [ 16.,  30.],
        [ 33.,  23.],
        [ 30.,  61.],
        [ 62.,  45.],
        [ 59., 119.],
        [116.,  90.],
        [156., 198.],
        [373., 326.]])

@glenn-jocher
We define anchor boxes in this file.

yolov5/models/yolov5l.yaml

Line 7 in d5d514e

anchors:

anchors:
  - [10,13, 16,30, 33,23]  # P3/8
  - [30,61, 62,45, 59,119]  # P4/16
  - [116,90, 156,198, 373,326]  # P5/32

I know that at the neck we have 3 feature map of neck:

80x80x256 # P3/8
40x40x512 #P4/16
20x20x1024 #P5/32

How to I map from anchor boxes to feature map of neck? I mean that the location of anchor boxes on feature map. Thanks.

glenn-jocher · 2023-11-06T08:46:11Z

@aidevmin the mapping of anchor boxes to the feature map is determined by the anchor stride and anchor scale. In YOLOv5, the anchor stride is defined in the architecture file (yolov5l.yaml in this case) and specifies the stride or downsampling factor of each feature map. The anchor scale is defined in the same file and specifies the width and height of each anchor box in pixel units.

To calculate the position of anchor boxes on the feature map, you can divide the spatial dimensions of the feature map by the anchor stride. This will give you the grid size of the anchor boxes on the feature map. The x and y coordinates of an anchor box within this grid can then be multiplied by the anchor stride to obtain the pixel coordinates on the original image.

For example, for the P3/8 feature map (80x80x256) with an anchor stride of 8, the anchor grid size would be 10x10. Each anchor box in this feature map would span a 10x10 grid cell. To find the pixel coordinates of an anchor box within this grid, you can multiply the x and y coordinates by the anchor stride (8).

I hope this clarifies the mapping of anchor boxes to the feature map. Let me know if you have any further questions.

aidevmin · 2023-11-06T09:36:42Z

@glenn-jocher
Thanks for quick respone. I have read your comment, but I dont't understand clear. I need more times to investigate. I will inform you later.
I have one more question.
We have anchor boxes

anchors:
  - [10,13, 16,30, 33,23]  # P3/8
  - [30,61, 62,45, 59,119]  # P4/16
  - [116,90, 156,198, 373,326]  # P5/32

We have 3 neck feature maps with input size = 640

80x80x256 # P3/8
40x40x512 #P4/16
20x20x1024 #P5/32

How can we specify ground-truth mask of objects corespond 3 above feature map?
Let me give an example to clarify my question:

I have 1 image with size w = 400, h=500. There is 2 objects in image with location:
- Object 1: x_left = 200, y_left=300, w=20, h=30
- Object 2: x_left = 220, y_left=270, w=34, h=65
  How I can specify location of object 1 and object 2 in 3 above feature maps based on anchor boxes (map location of object from original image to location of object on neck feature map)? I am confusing about that 1 feature map is repsonsible for one object or 3 feature maps can be responsible for one object. The reason that I want to specify ground-truth mask on neck feature map is applying ground-truth mask for KD (knowledge distillation).

Sorry for my not good English.

glenn-jocher · 2023-11-06T09:53:16Z

@aidevmin the anchor boxes in YOLOv5 are used to detect objects at different scales and aspect ratios. The anchor boxes are assigned to the grid cells of the corresponding feature maps based on their spatial location and size. Each feature map is responsible for detecting objects of certain scales.

To specify the ground-truth masks of objects on the feature maps, you need to map the location and size of each object from the original image to the corresponding grid cells on the feature maps.

In your example, you have two objects in an image with specific locations and sizes. To map these objects to the feature maps, you can follow these steps:

Compute the center coordinates (cx, cy) of each object by adding half of its width and height to its top-left coordinates.
Calculate the relative coordinates (rx, ry) of the object centers within the grid cell of each feature map. Divide the absolute coordinates (cx, cy) by the width and height of the respective feature map cells (e.g., 80x80 for P3/8).
Assign each object to the grid cell on the feature map that contains its center coordinates, using the relative coordinates obtained in the previous step.
Calculate the relative width and height (rw, rh) of each object by dividing its width and height by the total width and height of the feature map cells.

By performing these steps, you can assign the ground-truth masks of the objects to the corresponding grid cells on the feature maps. Each feature map will be responsible for detecting objects of certain scales and aspect ratios.

Please note that in YOLO models, each grid cell can be responsible for detecting multiple objects. Therefore, the three feature maps (P3/8, P4/16, P5/32) can collectively be responsible for detecting all the objects present in the image.

I hope this clarifies the process of specifying ground-truth masks on the feature maps. If you have further questions or need more clarification, please let me know.

aidevmin · 2023-11-06T15:33:22Z

@glenn-jocher
Thanks.
Let assume I have image with size (640, 640). There is one object in the image with information of the object: x_topleft=160, y_topleft=200, w=24, h=40. Follow your instruction:
1: Compute the center coordinates (cx, cy) of each object by adding half of its width and height to its top-left coordinates.
I have (cx, cy) = (172, 220)
2. Calculate the relative coordinates (rx, ry) of the object centers within the grid cell of each feature map. Divide the absolute coordinates (cx, cy) by the width and height of the respective feature map cells (e.g., 80x80 for P3/8). You mean rx = cx / 80 = 2.15; ry = cy/80 = 2.75
3. Assign each object to the grid cell on the feature map that contains its center coordinates, using the relative coordinates obtained in the previous step. With rx=2.15 and ry=2.75, so center of objet will be at grid cell (2, 2)
4. Calculate the relative width and height (rw, rh) of each object by dividing its width and height by the total width and height of the feature map cells. rw = w/80 = 24/80 = 0.3 ry = 40/80 = 0.5

I think, your calculation is not correct. This is my thought:

Compute the center coordinates (cx, cy) of each object by adding half of its width and height to its top-left coordinates.
I have (cx, cy) = (172, 220)
Calculate the relative coordinates (rx, ry) of the object centers within the grid cell of each feature map. Divide the absolute coordinates (cx, cy) by the anchor stride (e.g., 80x80 for P3/8 - anchor stride = 8). rx = cx/8 = 172/8 = 21.5; ry = cy/80 = 220/8 = 27.5
Assign each object to the grid cell on the feature map that contains its center coordinates, using the relative coordinates obtained in the previous step. With rx=21.5 and ry=27.5, so center of objet will be at grid cell (21, 27)
Calculate the relative width and height (rw, rh) of each object by dividing its width and height by the anchor stride: rw = w/80 = 24/8 = 3 ry = 40/8 = 5

So that the mask of object in the feature map 80x80 has center as (21, 27) and width=3, height=5. How do you think about my thought @glenn-jocher?

I have one more questions. The defined anchor boxes in the configs

yolov5/models/yolov5m.yaml

Line 7 in b56b724

anchors:

is for input size = 640? Is that right?

glenn-jocher · 2023-11-06T19:16:36Z

@aidevmin hi,

Regarding the calculation of the object's location on the feature map, your understanding is correct. The relative coordinates should be computed by dividing the object's center coordinates by the anchor stride, not by the size of the feature map cells. So, your calculations of rx=21.5 and ry=27.5 as the object's center location on the feature map are accurate.

For the relative width and height, dividing by the anchor stride (8, in this case) is also correct. So, rw=3 and rh=5 would represent the object's dimensions on the feature map.

Regarding your second question, the defined anchor boxes in the YAML configuration file you mentioned (yolov5m.yaml) are designed for an input image size of 640x640. These anchor boxes are specific to the YOLOv5m model and are chosen based on the target object scales and aspect ratios. Keep in mind that these anchor boxes work best for images of size 640x640. If you use a different input image size, the anchor boxes may not be optimal. Adjusting the anchor boxes according to your specific input size may be necessary for better performance.

I hope this clarifies your doubts. If you have any further questions or need additional information, feel free to ask.

aidevmin · 2023-11-08T02:12:13Z

@glenn-jocher
Thank you so much.

glenn-jocher · 2023-11-08T05:34:04Z

@aidevmin thank you for reaching out. We appreciate your interest in YOLOv5. The anchor boxes specified in the configuration file (yolov5m.yaml) are indeed optimized for an input size of 640x640. These anchor boxes are carefully chosen to best capture the scales and aspect ratios of target objects for this specific input size. However, the performance of the anchor boxes may vary for different input image sizes. It is recommended to adjust the anchor boxes according to your specific input size for optimal results.

If you have any further questions or need additional assistance, feel free to ask. We are here to help.

universewill added the question label Sep 28, 2020

universewill closed this as completed Dec 16, 2020

holger-prause mentioned this issue Oct 31, 2022

How to add anchors (to detect long objects) WongKinYiu/yolov7#1022

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

How to increase number of anchors? #1057

How to increase number of anchors? #1057

universewill commented Sep 28, 2020

Aktcob commented Sep 30, 2020

glenn-jocher commented Oct 4, 2020 •

edited

Loading

glenn-jocher commented Oct 4, 2020

HaolyShiit commented Oct 13, 2020

glenn-jocher commented Oct 13, 2020

xinxin342 commented Oct 14, 2020

glenn-jocher commented Oct 14, 2020

xinxin342 commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

HaolyShiit commented Oct 18, 2020

alicera commented Oct 20, 2020 •

edited

Loading

glenn-jocher commented Oct 20, 2020

Edwardmark commented Nov 17, 2020

glenn-jocher commented Nov 17, 2020

aidevmin commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

aidevmin commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

aidevmin commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

aidevmin commented Nov 8, 2023

glenn-jocher commented Nov 8, 2023

How to increase number of anchors? #1057

How to increase number of anchors? #1057

Comments

universewill commented Sep 28, 2020

Aktcob commented Sep 30, 2020

glenn-jocher commented Oct 4, 2020 • edited Loading

glenn-jocher commented Oct 4, 2020

HaolyShiit commented Oct 13, 2020

glenn-jocher commented Oct 13, 2020

xinxin342 commented Oct 14, 2020

glenn-jocher commented Oct 14, 2020

xinxin342 commented Oct 15, 2020

glenn-jocher commented Oct 15, 2020

HaolyShiit commented Oct 18, 2020

alicera commented Oct 20, 2020 • edited Loading

glenn-jocher commented Oct 20, 2020

Edwardmark commented Nov 17, 2020

glenn-jocher commented Nov 17, 2020

aidevmin commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

aidevmin commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

aidevmin commented Nov 6, 2023

glenn-jocher commented Nov 6, 2023

aidevmin commented Nov 8, 2023

glenn-jocher commented Nov 8, 2023

glenn-jocher commented Oct 4, 2020 •

edited

Loading

alicera commented Oct 20, 2020 •

edited

Loading