redid the YOLO object detection algorithm #176

hiddentn · 2018-07-09T23:13:11Z

redid the YOLO object detection algorithm
more simple & understandable
no more memory leaks
performance boost (i think )

a normal usage example should look like this

let options = { url: '...', }
let yolo = new ml5.YOLO(options); 
let loaded = await yolo.loadModel()
if(loaded){
   let results = await yolo.detect(image ||  video || canvas)
}

here's a working example(image) ~~https://codepen.io/hiddentn/pen/NBjPKR~~
another one with a video element ~~https://codepen.io/hiddentn/pen/jpmbRy~~

mw108 · 2018-07-22T10:47:19Z

./YOLO/index.js Line 12:

import { iou } from './utils';

Isn't this supposed to be import iou from './utils';, because iou is a default export? Otherwise iou is unknown: Uncaught (in promise) TypeError: (0 , p.iou) is not a function

./YOLO/index.js Line 175:

const boxClassProbMask = tf.greaterEqual(boxScores, ...);

Isn't this supposed to be tf.greaterEqual(boxScores1, ...);?

mw108 · 2018-07-22T10:57:21Z

p5 images and video are not supported anymore?

hiddentn · 2018-07-23T17:01:51Z

@mw108 thank you , i guess i missed those mistakes :)

concerning p5 images & video i am not really sure how to handle them , anyone is welcome to join in and help me with a pr

hiddentn · 2018-07-23T18:20:15Z

here's a working example(image) ~~https://codepen.io/hiddentn/pen/NBjPKR~~
another one with a video element ~~https://codepen.io/hiddentn/pen/jpmbRy~~
another one with a webcam ~~https://codepen.io/hiddentn/pen/OwmMPM~~

mw108 · 2018-07-24T07:11:47Z

@TheHidden1 Thanks for the quick fixes. :)

I fiddled around a bit with p5 and it turned out that supporting p5 images and webcam is actually pretty easy.

For images, you need to pass img.canvas to the yolo.detect() function.
Example: https://codepen.io/mw-108/pen/gjWGWX

And for webcam, you need to pass video.elt to the yolo.detect() function.
Example: https://codepen.io/mw-108/pen/XBRaQG

cvalenzuela · 2018-07-24T15:10:39Z

Thanks @TheHidden1 and @mw108! this is great work!
We will need to update way the method is created. This has changed since you forked the library. See: https://github.com/ml5js/ml5-library/blob/master/src/YOLO/index.js#L146

cvalenzuela · 2018-08-06T15:21:51Z

Thanks for the hard work @TheHidden1! There are issue with the tests. Let's fix them and move to get this merge!

shiffman · 2018-08-08T14:07:35Z

For integration with p5 receiving the bounding box as x,y,w,h is probably ideal since those are the defaults with rect(). Thanks for your contributions!

cvalenzuela · 2018-08-08T14:32:46Z

Following @shiffman, x,y,w,h makes sense!
For managing different YOLO versions, I think we can support two cases:

const yolo = new ml5.yolo('v2')

and also this:

const options = { 
  version: '3'
 // other options
} 
const yolo = new ml5.yolo(options)

shiffman · 2018-08-08T14:39:16Z

Aligning with our API though we would wrap in a function (avoiding new) like so? And YOLO all caps?

const yolo = ml5.YOLO('v2')

cvalenzuela · 2018-08-08T16:00:56Z

ups! yes, missed that!

hiddentn · 2018-08-15T09:43:20Z

i think yolov2 is ready for a merge , it output [x y w h] now.(x,y being to coordinates for the center point not the top-left point )
tiny-yolov2:

tiny-yolov3:

EDIT : this is on :

 { IOUThreshold: 0.4, filterBoxesThreshold: 0.01, classProbThreshold: 0.4,}

someone help with the test 😭 😭 😭 😭 😭 🤣

cvalenzuela · 2018-08-16T18:25:52Z

src/YOLO/index_test.js

@@ -25,14 +25,15 @@ describe('YOLO', () => {



The test error is:

TypeError: Cannot read property 'filterBoxesThreshold' of undefined

can you try defining the variable here first. Like this: https://github.com/ml5js/ml5-library/blob/master/src/ImageClassifier/index_test.js#L20

joeyklee · 2019-01-24T18:03:52Z

Hello @TheHidden1
Thanks for all your work on this PR. As the ml5 crew going back through the existing PRs after the busy last months, we're curious to know whether or not you're still interested in integrating refactor of the YOLO for ml5. If so, would you be able to update your PR and check for any breaking changes or issues?

I've tried to run your re-implementation and I get the following error at the .predict() function in YOLO. It seems that there's an issue with the incoming image data and for whatever reason the .predict() function isn't returning anything.

(in firefox)

(in chrome)

If I run the YOLO detection on a simple example as it is currently implemented in ml5.js v0.1.3, then this is what I see:

Let us know if you're interested to explore this further. Many thanks!

hiddentn · 2019-01-27T18:43:43Z

i will check it out right away

hiddentn · 2019-01-27T22:22:34Z

it seems to be working for me (i think you missed yolo.load() in your example )
here is a working codepen : ~~https://codepen.io/hiddentn/full/gqrBeO~~

( i am not sure about the firefox error though)

shiffman · 2019-01-28T02:28:17Z

Thanks for jumping back into this @TheHidden1! 🎉🎉🎉

@joeyklee happy to look at this together sometime this week if that would be helpful, perhaps merging pull requests could be a good activity for our Friday sessions! 🌈

joeyklee · 2019-01-28T15:09:51Z

@TheHidden1 - Thanks so much for the updates! @shiffman and I will have a look this week and let you know if/when we merge! Many thanks!

added support for yolo v3 models & made some changed to the post proccessing alogorithm now wen can chang the model input size on the fly

hiddentn · 2019-01-30T17:40:07Z

@shiffman @joeyklee i cleaned up some stuff and added support for yolo v3 models :

one major thing is that i added is the ability to change the model input size witch gives a good trade off between accuracy and speed . please see redid the yolo model ml5-data-and-models#31

I restructured the post processing algorithm to be more clear & efficient (i think) , there is also a V2
that implements tf.image.nonMaxSuppression() but it seems to be slowing things down a bit (10 - 20 ms)
this how the new config shoud be :

     // this an example for the tiny yolov2 model
 let config = {
     version: 'v2', /* 'v2' || 'v3' */

     /*
       128 || 144 || 224 || 256 || 320 || 416 (or any multiple of 32 rly)
       we can change this on the fly  now wohoo!
      */
     modelSize: 416 ,   

     URL: '',

     /* inference parameters */
     IOUThreshold: 0.5,
     classProbThreshold: 0.4,

     /* this mask defines  witch anchors go to witch layer 
         eg : for tiny yolo v3 :  ( hast 2 output layers at different scales ) 
                masks: [ [3, 4, 5], [0, 1, 2] ],
                anchors: [[10, 14], [23, 27], [37, 58], [81, 82], [135, 169], [344, 319] ],
     */
     masks: [ [0, 1, 2, 3, 4] ],
     anchors: [ [0.57273, 0.677385], [1.87446, 2.06253], [3.33843, 5.47434], [7.88282, 3.52778], [9.77052, 9.16828]],

     /* class names array  */
     classes: CLASS_NAMES_COCO,
}

i did some benchmarks & here are some of the results :
- Chrome : 71.0.3578.98 (Official Build) (64-bit)
- CPU : Intel® Core™ i7-7700HQ Processor
- GPU0 : Intel® HD Graphics 630
- GPU1 : NVIDIA GeForce GTX 1050 Ti
- Backend : webgl

ml5.js + tfjs 0.13
	CPU*			GPU**
	Min	Avg	Max	Min	Avg	Max
416x416	522.99	538.13	556.40	98.89	101.68	114.79
320x320	325.30	332.64	351.39	58.69	61.40	71.79
224x224	186.10	191.98	207.30	33.90	36.58	89.29
128	83.55	78.39	104.00	15.60	16.95	23.90
tfjs 0.14.2
	CPU*			GPU**
	Min	Avg	Max	Min	Avg	Max
416x416	464.39	476.80	525.49	83.30	87.24	126.70
320x320	275.59	283.80	296.49	54.99	56.86	67.00
224x224	140.79	144.96	150.40	33.89	36.27	45.40
128x128	76.89	79.98	92.90	20.20	22.40	32.00

* Intel® HD Graphics 630

** NVIDIA GeForce GTX 1050 Ti

here is an updated demo where you can try out all of this stuff : https://codepen.io/hiddentn/full/gqrBeO

note : there is a big time difference when using detect() vs detectAsync() (+100ms in inference time) ,so please can anyone test things out ( there is a detect & a detect Sync buttons in the DEMO) and report their findings thank you

joeyklee · 2019-02-01T21:52:01Z

Hi @TheHidden1 Wow! This is wild. I didn't manage to have a deep look at your contributions yet, but I hope to do some checks with @shiffman soon. I just wanted to check in to let you know it's still very much on our minds! Thanks!

joeyklee · 2019-02-22T20:47:39Z

Hi @TheHidden1 - apologies for the radio silence. @yining1023 and I had a deeper look into your code and we have the feeling that your current proposal diverges a bit too far from the current API structure. We recognize that we don't have good (or any?) documentation yet about the API conventions for ml5.js but that is high on our list.

We would like to propose the following:

Please submit a simple working example of your refactor in action, preferably in p5.js and make a PR to the ml5-examples repo.
Submit some documentation on your new structure so we can better understand how this works.
If you're still interested to work on adapting your proposal closer to the current API structure of YOLO, you're welcome to do this and we can revisit the review.

Thanks again for all your work on this! cc/ @shiffman

hiddentn · 2019-02-24T10:07:46Z

i'll try & do my best

added docs

hiddentn · 2019-03-23T16:38:59Z

i added a small demo ml5js/ml5-examples#107

hiddentn · 2019-03-23T17:08:09Z

you know what, let me start this again. this has become too messy

hiddentn added 6 commits July 9, 2018 21:39

redid the YOLO object detection algorithm

3a158a2

woops

68f6f72

woops2.0

b0f1f06

fices some of the travis-ci errors

f863868

fixed travis-cli errors

6ffc23e

.

4d83cc9

fixed some errors

8547cdb

hiddentn and others added 3 commits July 28, 2018 13:00

Merge branch 'master' into master

f7695a5

updated the preproccessing/postproccessing functions

1b767a4

wops

1d4faf8

hiddentn mentioned this pull request Jul 29, 2018

redid the yolo model ml5js/ml5-data-and-models#31

Open

hiddentn and others added 3 commits July 29, 2018 13:50

build

f265b53

fixed test mistake

312e8a6

Merge branch 'master' into master

ea23c7a

hiddentn and others added 3 commits August 13, 2018 21:28

Merge branch 'master' into master

b20f514

yolov2 ready + fixed tests (i think)

95d31e6

wops

b2a0a94

cvalenzuela reviewed Aug 16, 2018

View reviewed changes

hiddentn added 2 commits January 27, 2019 20:07

updated

bc60baa

this build is ok i think

1650669

hiddentn force-pushed the master branch from 30b82cc to 1650669 Compare January 29, 2019 22:55

added yolo v3 & variable model size

6ccdbf8

added support for yolo v3 models & made some changed to the post proccessing alogorithm now wen can chang the model input size on the fly

hiddentn added 4 commits January 30, 2019 18:43

fixes test case

f16ec42

Oops i did it again...

8b345a2

this test..... 👽

03806fb

👏 👏 👏

a9360c4

Merge branch 'master' into master

3ba702b

hiddentn and others added 7 commits March 4, 2019 10:31

started on doc

3ec982f

merge

dc8aac8

Merge branch 'master' into docs-and-v2

5931b27

yolo ?

76812af

added docs

build

4fe7b72

fixed eslint toutching the wrong files

dce7b08

Merge branch 'master' of https://github.com/TheHidden1/ml5-library

888cf31

hiddentn mentioned this pull request Mar 23, 2019

added yolo example ml5js/ml5-examples#107

Open

fixed the test

1abe707

hiddentn closed this Mar 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

redid the YOLO object detection algorithm #176

redid the YOLO object detection algorithm #176

hiddentn commented Jul 9, 2018 •

edited

mw108 commented Jul 22, 2018 •

edited

mw108 commented Jul 22, 2018

hiddentn commented Jul 23, 2018 •

edited

hiddentn commented Jul 23, 2018 •

edited

mw108 commented Jul 24, 2018 •

edited

cvalenzuela commented Jul 24, 2018

cvalenzuela commented Aug 6, 2018

shiffman commented Aug 8, 2018

cvalenzuela commented Aug 8, 2018

shiffman commented Aug 8, 2018 •

edited

cvalenzuela commented Aug 8, 2018

hiddentn commented Aug 15, 2018 •

edited

cvalenzuela Aug 16, 2018

joeyklee commented Jan 24, 2019 •

edited

hiddentn commented Jan 27, 2019

hiddentn commented Jan 27, 2019 •

edited

shiffman commented Jan 28, 2019

joeyklee commented Jan 28, 2019

hiddentn commented Jan 30, 2019

joeyklee commented Feb 1, 2019

joeyklee commented Feb 22, 2019 •

edited

hiddentn commented Feb 24, 2019 •

edited

hiddentn commented Mar 23, 2019 •

edited

hiddentn commented Mar 23, 2019 •

edited

redid the YOLO object detection algorithm #176

redid the YOLO object detection algorithm #176

Conversation

hiddentn commented Jul 9, 2018 • edited

mw108 commented Jul 22, 2018 • edited

mw108 commented Jul 22, 2018

hiddentn commented Jul 23, 2018 • edited

hiddentn commented Jul 23, 2018 • edited

mw108 commented Jul 24, 2018 • edited

cvalenzuela commented Jul 24, 2018

cvalenzuela commented Aug 6, 2018

shiffman commented Aug 8, 2018

cvalenzuela commented Aug 8, 2018

shiffman commented Aug 8, 2018 • edited

cvalenzuela commented Aug 8, 2018

hiddentn commented Aug 15, 2018 • edited

cvalenzuela Aug 16, 2018

Choose a reason for hiding this comment

joeyklee commented Jan 24, 2019 • edited

hiddentn commented Jan 27, 2019

hiddentn commented Jan 27, 2019 • edited

shiffman commented Jan 28, 2019

joeyklee commented Jan 28, 2019

hiddentn commented Jan 30, 2019

* Intel® HD Graphics 630

** NVIDIA GeForce GTX 1050 Ti

joeyklee commented Feb 1, 2019

joeyklee commented Feb 22, 2019 • edited

hiddentn commented Feb 24, 2019 • edited

hiddentn commented Mar 23, 2019 • edited

hiddentn commented Mar 23, 2019 • edited

hiddentn commented Jul 9, 2018 •

edited

mw108 commented Jul 22, 2018 •

edited

hiddentn commented Jul 23, 2018 •

edited

hiddentn commented Jul 23, 2018 •

edited

mw108 commented Jul 24, 2018 •

edited

shiffman commented Aug 8, 2018 •

edited

hiddentn commented Aug 15, 2018 •

edited

joeyklee commented Jan 24, 2019 •

edited

hiddentn commented Jan 27, 2019 •

edited

joeyklee commented Feb 22, 2019 •

edited

hiddentn commented Feb 24, 2019 •

edited

hiddentn commented Mar 23, 2019 •

edited

hiddentn commented Mar 23, 2019 •

edited