Skip to content

Support entropy loss #583

@SumanthRH

Description

@SumanthRH

Summary

Support entropy loss. This would require adding an entropy loss coefficient to the trainer.algorithm configuration. Loss calculation support would involve implementing entropy calculation with gradients enabled in ModelWrapper and propagating that to PolicyWorkerBase

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions