Skip to content

Do not load context for model transform in llm inference#11751

Merged
hemildesai merged 3 commits intomainfrom
hemil/peft-inference-fix
Jan 14, 2025
Merged

Do not load context for model transform in llm inference#11751
hemildesai merged 3 commits intomainfrom
hemil/peft-inference-fix

Conversation

@hemildesai
Copy link
Collaborator

No description provided.

shanmugamr1992
shanmugamr1992 previously approved these changes Jan 3, 2025
@hemildesai hemildesai marked this pull request as ready for review January 6, 2025 04:11
Signed-off-by: Hemil Desai <hemild@nvidia.com>
@hemildesai hemildesai force-pushed the hemil/peft-inference-fix branch from 1da1fc6 to ec604bd Compare January 6, 2025 04:34
@github-actions
Copy link
Contributor

github-actions bot commented Jan 6, 2025

[🤖]: Hi @hemildesai 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully

So it might be time to merge this PR or get some approvals

I'm just a bot so I'll leave it you what to do next.

//cc @pablo-garay @ko3n1g

Copy link
Collaborator

@cuichenx cuichenx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the optimization! Suggestion below on updating the type annotation

Signed-off-by: Hemil Desai <hemild@nvidia.com>
@github-actions
Copy link
Contributor

[🤖]: Hi @hemildesai 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully

So it might be time to merge this PR or get some approvals

I'm just a bot so I'll leave it you what to do next.

//cc @pablo-garay @ko3n1g

Copy link
Collaborator

@cuichenx cuichenx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, LGTM

@hemildesai hemildesai merged commit cdaf7b1 into main Jan 14, 2025
@hemildesai hemildesai deleted the hemil/peft-inference-fix branch January 14, 2025 17:20
abhinavg4 pushed a commit that referenced this pull request Jan 30, 2025
Signed-off-by: Abhinav Garg <abhgarg@nvidia.com>
youngeunkwon0405 pushed a commit to youngeunkwon0405/NeMo that referenced this pull request Feb 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants