-
Notifications
You must be signed in to change notification settings - Fork 136
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When the inference benchmark release will be released? #24
Comments
Hi JustinhoCHN, Thanks for your note. Please take a look at the Inf1 session from re:invent 2019, it has comparison slides showing BERTbase results compared to EC2 G4, as well as detailed report from Alexa TTS team on their Inf1 migration results compared to G4 and P3. But the best would be of course to try out your application on Inf1, would be great to get your feedback. re:Invent CMP324 session: https://www.youtube.com/watch?v=17r1EapAxpk Best regards, |
Hello JustinhoCHN, Is there any further assistance I can offer at this time? Regards, |
Since we haven’t heard back in a while we are closing this issue, please reopen it if more support from us is needed. |
We are interested to see the inference benchmark report, and comparision between T4, including inference time cost, and thoughput. It would be great if you guys release it.
The text was updated successfully, but these errors were encountered: