Tabular Accuracy Verification Test & Instructions #4
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation
To verify that our ROCm enabled
triton-inference-serverandpython_backendwork correctly for our desired use case, we needed to set up a test that runs a transformer-based tabular classification model on ROCm via thetriton-inference-serverand validate the outputs of the model against a dataset for accuracy.Technical Details
Pipeline Overview
To accomplish this, we do the following:
FTTransformmodel on CPU & store the weights it was initialized withFTTransformmodel on GPU via the ROCm enabledtriton-inference-server, reusing the same weights as the CPU modeltriton-inference-serverArtifacts
examples/tab_transform_pytorchdirectory:triton-inference-serverREADME.mdfile in a newValidate Tabular Accuracy for ROCmsectionTest Result
Local runs showed that we were able to match the outputs between the CPU and GPU models, within tolerance for all samples:
Results: Max absolute difference: 4.17e-07 Mean absolute difference: 7.86e-08 Tolerance: 1.00e-05 Samples exceeding tolerance: 0/10000 ============================================================ PASS: All 10000 samples within tolerance (1e-05) GPU implementation matches CPU reference! ============================================================Submission Checklist