Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix issue in sd handler with data type #387

Merged
merged 1 commit into from
Dec 11, 2022
Merged

fix issue in sd handler with data type #387

merged 1 commit into from
Dec 11, 2022

Conversation

siddvenk
Copy link
Contributor

@siddvenk siddvenk commented Dec 11, 2022

Description

It seems like the type check used for self.data_type to determine kwargs was incorrect, leading to pipeline being initialized as one data type, and ds inference trying to use another data type (always defaults to fp32). This should fix that issue and use the data type the user provides.

Before this change, model would start fine, but during inference there is an issue with invoking a triton op due to mismatched data type. Tested this change on g5 and it should work as expected.

@siddvenk siddvenk merged commit 4c4da32 into master Dec 11, 2022
@siddvenk siddvenk deleted the sd-test branch December 11, 2022 19:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants