Loss graph during training #614

abeyang00 · 2018-04-01T03:34:16Z

Is there a way to show loss graph during training like tensorflow?

springkim · 2018-04-01T06:04:24Z

Hi @abeyang00
Here! https://github.com/AlexeyAB/darknet.
He made loss plot for training.

abeyang00 · 2018-04-01T07:38:12Z

@springkim can you tell me where the plot is located in his folder? is it in .c file in src??

ahsan856jalal · 2018-04-03T11:44:26Z

AlexeyAB#504 (comment)
your answer is at the bottom

Caroline1994 · 2018-04-05T14:59:01Z

can someone tell me how to show loss graph during training when i use pjreddie's darknet

Sikandarkhan · 2018-08-31T15:04:09Z

Any update on this thread?

rbarman · 2019-03-22T17:58:13Z

I found one solution here : https://github.com/Jumabek/darknet_scripts/#how-to-plot-yolo-loss

You basically need to save the output of ./darknet detector train <> into a log file and then python plot_yolo_log.py log_file.log

Note that the plot does not show in a jupyter-notebook even with %matplotlib inline. A work around is to copy all plot related code from https://github.com/Jumabek/darknet_scripts/blob/master/plot_yolo_log.py into a new function.

AlexeyAB · 2019-03-22T18:08:45Z

You can use repo https://github.com/AlexeyAB/darknet that shows Loss & mAP chart during Training:

groszste · 2019-04-18T15:14:32Z

@AlexeyAB is this plot of the training loss or validation loss? If training loss, do you have a way of viewing the validation loss?

AlexeyAB · 2019-04-18T19:29:41Z

@groszste
It is Training loss and Validation mAP.
For me it isn't necessary to see Validation loss, it is much better to see Validation mAP.

JakupGuven · 2019-04-20T16:56:56Z

@AlexeyAB
What commands do you use to display validation mAP during training?

kschwethelm · 2019-04-27T12:20:08Z

@JakupGuven

From README: https://github.com/AlexeyAB/darknet/blob/master/README.md

"Or just train with -map flag:

darknet.exe detector train data/obj.data yolo-obj.cfg darknet53.conv.74 -map

So you will see mAP-chart (red-line) in the Loss-chart Window. mAP will be calculated for each 4 Epochs using valid=valid.txt file that is specified in obj.data file (1 Epoch = images_in_train_txt / batch iterations)"

yjdeveloper · 2019-06-20T01:06:17Z

I have followed the steps given by Mr. @AlexeyAB and got the red line but my problem is how to plot a mAP after every 100 iteration. In your documentation until 1000 iteration, but i want in every 100 iteration.

devphilno · 2019-07-21T16:55:49Z

@yjdeveloper have you figured out how to downscale the mAP calculation to a shorter interval?

fcakyon · 2019-09-25T20:52:20Z

@yjdeveloper @snphnolt use this version with -map 0.02 for map calculation at every 0.02 epoch (starts after warmup iterations)

https://github.com/fcakyon/darknet

neso613 · 2020-02-10T07:35:49Z

mAP-chart

Where this map graph has seen?

neso613 · 2020-02-10T08:37:32Z

I have followed the steps given by Mr. @AlexeyAB and got the red line but my problem is how to plot a mAP after every 100 iteration. In your documentation until 1000 iteration, but i want in every 100 iteration.

How do you got the red line?

ghost · 2020-02-25T10:27:01Z

@AlexeyAB

I am using your repo to detect a custom objects using yolov3. however I have get in to trouble. The predictions.jpg image do not draw the confidence score but it draws the class id.

i traced the image.c code and I have found that in the function definition

void draw_detections_v3(image im, detection *dets, int num, float thresh, char **names, image **alphabet, int classes, int ext_output)

how to resolev e the issue?

ghost · 2020-02-26T06:56:09Z

please, anyone, help. which function I have to use in AlexeyAB yolo repository in order to get confidence score drawings on the predictions.jpg image file???? I have get only class Id using this

!./darknet detector test data/trainer.data cfg/yolov3.cfg backup/yolov3_last.weights -thresh 0.1 -iou_thresh 0.3 data/img/tb500.jpg

Leprechault · 2020-03-21T13:15:47Z

You can use repo https://github.com/AlexeyAB/darknet that shows Loss & mAP chart during Training:

The command ./darknet detector demo ... -json_port 8070 -mjpeg_port 8090 works very well, but is there any way to save the image in vectorial format like eg. *pdf, *svg, *ps?

Leprechault · 2020-03-24T13:31:29Z

I found one solution here : https://github.com/Jumabek/darknet_scripts/#how-to-plot-yolo-loss

You basically need to save the output of ./darknet detector train <> into a log file and then python plot_yolo_log.py log_file.log

Note that the plot does not show in a jupyter-notebook even with %matplotlib inline. A work around is to copy all plot related code from https://github.com/Jumabek/darknet_scripts/blob/master/plot_yolo_log.py into a new function.

@rbarman in the log.txt output what's the information about mAP?

ak3509311 · 2020-05-08T23:43:44Z

How to save the loss graph on drive because i run the code on colab .

harshkc03 · 2020-06-16T08:18:13Z

I'm training Yolov3-tiny on colab using the following command-
!./darknet detector train /content/obj.data /content/yolov3-tiny-obj.cfg backup/yolov3-tiny-obj_last.weights -dont_show -mjpeg_port 8090 -map

It shows MJPEG-stream sent in the output after every iteration and i know we have to use http://ip-address:8090 format to access the chart, but I'm unable to find the ip-address of my colab notebook. I tried using addresses from !ifconfig and !curl ipecho.net/plain but still no result.
Any help would be appreciated.

himewel · 2020-07-05T06:15:12Z

@harshkc03 I found this quote in StackOverflow. I still not found a way to propagate the json and graph at same time, but you can try something like this to train and see your graph updating. It prints a url that you can access your loss graph with the follow commands:

!wget https://bin.equinox.io/c/4VmDzA7iaHb/ngrok-stable-linux-amd64.zip
!unzip ngrok-stable-linux-amd64.zip

get_ipython().system_raw('./ngrok http 8090 &')

!curl -s http://localhost:4040/api/tunnels | python3 -c \
 "import sys, json; print(json.load(sys.stdin)['tunnels'][0]['public_url'])"

After this, start your training:

!./darknet detector train /content/obj.data /content/yolov3-tiny-obj.cfg backup/yolov3-tiny-obj_last.weights -dont_show -mjpeg_port 8090 -map

francismontalbo · 2020-07-18T11:11:46Z

Is there a way to produce the loss curve and mAP from an existing weight?

harshkc03 · 2020-07-18T11:44:02Z

@francismontalbo you can obtain mAP of the existing weight using the command-
./darknet detector map data/obj.data yolo-obj.cfg backup\yolo-obj_last.weights
but you cannot generate the loss curve of an existing weight. Loss curve generates only during training.

francismontalbo · 2020-07-18T14:28:16Z

@francismontalbo you can obtain mAP of the existing weight using the command-

./darknet detector map data/obj.data yolo-obj.cfg backup\yolo-obj_last.weights

but you cannot generate the loss curve of an existing weight. Loss curve generates only during training.

Yes, I've been using that. I see, thank you for the response good sir.

wc1997 · 2020-08-04T04:00:48Z

You can use pyngrok python package to display loss graph

!pip install pyngrok
from pyngrok import ngrok# Open a HTTP tunnel on port 8090
public_url = ngrok.connect(port = '8090')

public_url

Then run your training with flags

-mjpeg_port 8090 -map

harshkc03 · 2020-08-25T04:43:09Z

You can use pyngrok python package to display loss graph
!pip install pyngrok
from pyngrok import ngrok# Open a HTTP tunnel on port 8090
public_url = ngrok.connect(port = '8090')

public_url
Then run your training with flags
-mjpeg_port 8090 -map

Thankyou sir, it works as expected.

himewel · 2020-08-25T05:50:38Z

You can use pyngrok python package to display loss graph
!pip install pyngrok
from pyngrok import ngrok# Open a HTTP tunnel on port 8090
public_url = ngrok.connect(port = '8090')

public_url
Then run your training with flags
-mjpeg_port 8090 -map

Seems much more elegant than my response, ty auhdsuahsduahs

vishnuvardhan58 · 2020-11-02T16:31:33Z

Hello , I am getting the following error while using the command "!./darknet detector train data/obj.data cfg/yolov3_custom.cfg darknet53.conv.74 -dont_show -mjpeg_port 8090 -map".I am using google colab.

The connection to http://d80c91c46410.ngrok.io was successfully tunneled to your ngrok client, but the client failed to establish a connection to the local address localhost:8090.

Make sure that a web service is running on localhost:8090 and that it is a valid address.

The error encountered was: dial tcp 127.0.0.1:8090: connect: connection refused

sercangokturk · 2020-11-08T09:20:14Z

I have followed the steps given by Mr. @AlexeyAB and got the red line but my problem is how to plot a mAP after every 100 iteration. In your documentation until 1000 iteration, but i want in every 100 iteration.

Did you find a solution? Thanks in advance.

shawntyshawny · 2021-01-08T12:23:10Z

You can use repo https://github.com/AlexeyAB/darknet that shows Loss & mAP chart during Training:

I've followed this tutorial but my output mAP seems to have started its line from 68% and there is a broken line between 0 - 68%, how do I resolve this?

Doomleet · 2021-01-25T16:16:54Z

You can use repo https://github.com/AlexeyAB/darknet that shows Loss & mAP chart during Training:

Hello! It is work only on Windows? I dont know how use it on Linux. I use tag -map

Doomleet · 2021-01-25T16:45:20Z

You can use repo https://github.com/AlexeyAB/darknet that shows Loss & mAP chart during Training:

Hello! It is work only on Windows? I dont know how use it on Linux. I use tag -map

Ok, i do make without OPENCV=1, now i do make with OPENCV=1 and its work :)

mhdayub · 2021-02-03T17:14:13Z

how to show red line (percentage) and run on what file?

oo92 · 2021-02-22T15:33:21Z

What is the y-axis? What do those numbers on the y-axis represent?

khinmaunghtay4ah · 2021-09-16T09:03:14Z

how to show red line (percentage) and run on what file?

@mhdayub You just have to add -map flag at the end of the command used for training and you will see accuracy-mAP during training.

For e.g, darknet.exe detector train data/obj.data cfg/yolov4-obj.cfg backup/yolov4-obj_last.weights -map

hainguyen201 · 2022-01-03T15:36:06Z

go to darket folder, you will see the chart image file 'chart.png'

DikshitV · 2022-04-28T06:47:54Z

Hi, I am new to yolo. Where can I find the training loss values stored in the darknet? Is the training loss values stored or they are just directly plotted?

ZaynAlk · 2022-07-06T16:02:31Z

Hi, I am new to yolo. Where can I find the training loss values stored in the darknet? Is the training loss values stored or they are just directly plotted?

Go to darknet folder and you can find it there

yeonmnim · 2023-04-02T16:33:57Z

@AlexeyAB
Hi all, i am using the colab to run yolov4 on my custom data, but now it seems like the mAP does not coming out for me. I previously can get the loss graph with a mAP line graph but that was long time ago. So, i have write a txt file to store the log, but it seems like when reaching 1000th iterations, the mAP cannot be calculated. It outputs me the error as shown below :

cuDNN status Error in: file: ./src/convolutional_kernels.cu : () : line: 543 : build time: Apr 2 2023 - 12:39:35

cuDNN Error: CUDNN_STATUS_BAD_PARAM
Darknet error location: ./src/dark_cuda.c, cudnn_check_error, line #204
cuDNN Error: CUDNN_STATUS_BAD_PARAM: Success

and, is there any method to avoid the runtime to get stopped? i have 6000th iterations to run , but it will automatically stops when it reaches 3000 iterations.

khaldonminaga mentioned this issue Apr 13, 2020

Is there a way to measure validation loss? AlexeyAB/darknet#5225

Open

This was referenced Nov 23, 2020

How to graph accuracy/precision and loss function? AlexeyAB/darknet#7016

Open

How to graph accuracy/precision and loss function? #2344

Open

Loss graph during training #614

Loss graph during training #614

Comments

abeyang00 commented Apr 1, 2018

springkim commented Apr 1, 2018

abeyang00 commented Apr 1, 2018

ahsan856jalal commented Apr 3, 2018

Caroline1994 commented Apr 5, 2018

Sikandarkhan commented Aug 31, 2018

rbarman commented Mar 22, 2019

AlexeyAB commented Mar 22, 2019

groszste commented Apr 18, 2019

AlexeyAB commented Apr 18, 2019

JakupGuven commented Apr 20, 2019

kschwethelm commented Apr 27, 2019

yjdeveloper commented Jun 20, 2019 • edited Loading

devphilno commented Jul 21, 2019

fcakyon commented Sep 25, 2019 • edited Loading

neso613 commented Feb 10, 2020

neso613 commented Feb 10, 2020

ghost commented Feb 25, 2020

ghost commented Feb 26, 2020

Leprechault commented Mar 21, 2020

Leprechault commented Mar 24, 2020

ak3509311 commented May 8, 2020

harshkc03 commented Jun 16, 2020 • edited Loading

himewel commented Jul 5, 2020 • edited Loading

francismontalbo commented Jul 18, 2020

harshkc03 commented Jul 18, 2020

francismontalbo commented Jul 18, 2020

wc1997 commented Aug 4, 2020

harshkc03 commented Aug 25, 2020

himewel commented Aug 25, 2020

vishnuvardhan58 commented Nov 2, 2020 • edited Loading

sercangokturk commented Nov 8, 2020

shawntyshawny commented Jan 8, 2021 • edited Loading

Doomleet commented Jan 25, 2021

Doomleet commented Jan 25, 2021

mhdayub commented Feb 3, 2021

oo92 commented Feb 22, 2021

khinmaunghtay4ah commented Sep 16, 2021 • edited Loading

hainguyen201 commented Jan 3, 2022

DikshitV commented Apr 28, 2022

ZaynAlk commented Jul 6, 2022

yeonmnim commented Apr 2, 2023 • edited Loading

yjdeveloper commented Jun 20, 2019 •

edited

Loading

fcakyon commented Sep 25, 2019 •

edited

Loading

harshkc03 commented Jun 16, 2020 •

edited

Loading

himewel commented Jul 5, 2020 •

edited

Loading

vishnuvardhan58 commented Nov 2, 2020 •

edited

Loading

shawntyshawny commented Jan 8, 2021 •

edited

Loading

khinmaunghtay4ah commented Sep 16, 2021 •

edited

Loading

yeonmnim commented Apr 2, 2023 •

edited

Loading