CANN: Resolve soft_max precision issue #15730

hipudding · 2025-09-02T06:32:04Z

Previously, the slope tensor was set to fp16 to improve efficiency. While this worked correctly in FA, it caused precision issues in soft_max. This change applies different data types for different operators to balance both accuracy and performance.

Make sure to read the contributing guidelines before submitting a PR

Previously, the slope tensor was set to fp16 to improve efficiency. While this worked correctly in FA, it caused precision issues in soft_max. This change applies different data types for different operators to balance both accuracy and performance.

noemotiovon

LGTM!

Previously, the slope tensor was set to fp16 to improve efficiency. While this worked correctly in FA, it caused precision issues in soft_max. This change applies different data types for different operators to balance both accuracy and performance.

hipudding requested a review from noemotiovon September 2, 2025 06:43

noemotiovon approved these changes Sep 2, 2025

View reviewed changes

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Ascend NPU issues specific to Ascend NPUs labels Sep 2, 2025

noemotiovon merged commit 9961d24 into ggml-org:master Sep 2, 2025
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CANN: Resolve soft_max precision issue #15730

CANN: Resolve soft_max precision issue #15730

Uh oh!

hipudding commented Sep 2, 2025

Uh oh!

noemotiovon left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CANN: Resolve soft_max precision issue #15730

CANN: Resolve soft_max precision issue #15730

Uh oh!

Conversation

hipudding commented Sep 2, 2025

Uh oh!

noemotiovon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants