GPU utilization becomes lower with 16-bit floating point

When I tried to implement the paper "mixed precision training" in the tensor2tensor open source code, I found that I could not achieve the training speed that I expected. After checking, I found that the GPU utilization of my modified code was significantly reduced. However, I did not modify the underlying code that allocates GPU resources. I suspect that the problem occurs where the GPU and CPU interact.

Has anyone encountered the same/similar problems? Many thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU utilization becomes lower with 16-bit floating point #1620

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GPU utilization becomes lower with 16-bit floating point #1620

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions