Questions on inference latency/cost #72

hshen14 · 2019-01-18T02:54:13Z

Hello,

I am understanding the latency rule in DAWNBench:
• Latency: Use a model that has a top-5 validation accuracy of 93% or greater. Measure the total time needed to classify all 50,000 images in the ImageNet validation set one-at-a-time, and then divide by 50,000

I am not sure how to better understand "one-at-a-time" here, so I raised some questions here and need your confirmation:

Does it allow the pipeline of image processing and CNN inference?
Does it allow preprocessed images (resize and crop done offline)?
Does it allow dummy data?

Thanks.

codyaustun · 2019-01-25T21:50:04Z

Great questions!

Yes, you can pipeline image processing and CNN inference
Yes/No. We don't explicitly require resizing and cropping to be done online, but more information never hurts. If possible, please include latency times with and without preprocessing
No, you must load real data. If we need to verify your results, we would need to feed in real data to validate the accuracy threshold.

jzwang123 · 2019-01-25T22:50:26Z

Hello,

"one-at-a-time" means we cannot use batch size > 1, say 50, get the time spent on that batch, and then divide it by 50, right? @codyaustun

Thanks!

codyaustun · 2019-01-26T19:50:51Z

Yes, that is correct. For latency, you must use a batch size of 1.

hshen14 · 2019-01-27T01:11:47Z

Thanks @codyaustun for your kind explanation.

codyaustun · 2019-01-27T19:20:09Z

No problem

codyaustun closed this as completed Jan 27, 2019

codyaustun mentioned this issue Feb 4, 2019

Questions on inference latency #75

Closed

codyaustun mentioned this issue Jun 22, 2019

ResNet50 || ImageNet Inference || InferenceX Team of Didi Cloud #99

Merged

lvniqi mentioned this issue Oct 23, 2019

Alibaba Cloud for ImageNet resnet26d inference on ecs.gn6i-c8g1.2xlarge #117

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on inference latency/cost #72

Questions on inference latency/cost #72

hshen14 commented Jan 18, 2019 •

edited

codyaustun commented Jan 25, 2019

jzwang123 commented Jan 25, 2019 •

edited

codyaustun commented Jan 26, 2019

hshen14 commented Jan 27, 2019

codyaustun commented Jan 27, 2019

Questions on inference latency/cost #72

Questions on inference latency/cost #72

Comments

hshen14 commented Jan 18, 2019 • edited

codyaustun commented Jan 25, 2019

jzwang123 commented Jan 25, 2019 • edited

codyaustun commented Jan 26, 2019

hshen14 commented Jan 27, 2019

codyaustun commented Jan 27, 2019

hshen14 commented Jan 18, 2019 •

edited

jzwang123 commented Jan 25, 2019 •

edited