Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When the inference benchmark release will be released? #24

Closed
JustinhoCHN opened this issue Dec 17, 2019 · 3 comments
Closed

When the inference benchmark release will be released? #24

JustinhoCHN opened this issue Dec 17, 2019 · 3 comments

Comments

@JustinhoCHN
Copy link

We are interested to see the inference benchmark report, and comparision between T4, including inference time cost, and thoughput. It would be great if you guys release it.

@AWSGH
Copy link
Contributor

AWSGH commented Dec 17, 2019

Hi JustinhoCHN,

Thanks for your note. Please take a look at the Inf1 session from re:invent 2019, it has comparison slides showing BERTbase results compared to EC2 G4, as well as detailed report from Alexa TTS team on their Inf1 migration results compared to G4 and P3. But the best would be of course to try out your application on Inf1, would be great to get your feedback.

re:Invent CMP324 session: https://www.youtube.com/watch?v=17r1EapAxpk

Best regards,
Gadi

@aws-taylor
Copy link
Contributor

Hello JustinhoCHN,

Is there any further assistance I can offer at this time?

Regards,
Taylor

@micwade-aws
Copy link
Contributor

Since we haven’t heard back in a while we are closing this issue, please reopen it if more support from us is needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants