-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor/change runtime speed into a test #605
Refactor/change runtime speed into a test #605
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 🤓
Do you mind adding a few unittests please?
Also, could you add a few lines to describe the goal of this PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please just add a short description and feel free to merge 😄
I updated the description of the PR |
This PR aims to provide a capability for testing responses to models in terms of tokens per second. The performance category includes the speed test type. In the future, any test type, such as memory tests,..., may be implemented under performance categories.
# Checklist:
pydantic
for typing when/where necessary.