Fix and disable padding to a multiple of 16 for INT8 #332

guillaumekln · 2020-11-23T10:43:33Z

The previous logic was incorrect and the padding was not applied. The code checked the layers output type which can only be float32 or float16. Instead, it should check the global compute type.

After fixing this issue, it appears padding to a multiple of 16 does not help. So let's disable it for now and gather more data.

The previous logic was incorrect and the padding was not applied. The code checked the layers output type which can only be float32 or float16. Instead, it should check the global compute type. After fixing this issue, it appears padding to a multiple of 16 does not help. So let's disable it for now and gather more data.

guillaumekln merged commit 7c54f53 into OpenNMT:master Nov 23, 2020

guillaumekln deleted the fix-and-disable-int8-padding branch November 23, 2020 11:07

Purfview mentioned this pull request Jan 18, 2026

Enable multiple of 16 padding for INT8 Tensor Cores #1982

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix and disable padding to a multiple of 16 for INT8 #332

Fix and disable padding to a multiple of 16 for INT8 #332

Uh oh!

guillaumekln commented Nov 23, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix and disable padding to a multiple of 16 for INT8 #332

Fix and disable padding to a multiple of 16 for INT8 #332

Uh oh!

Conversation

guillaumekln commented Nov 23, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant