Implement NCE loss to speed up Softmax with large amount of labels

For word based LSTM, the Softmax layer receive large amount of labels which make training very slow. Two methods can solve this problem: hierarchical softmax (HS) and NCE.

I think NCE is better than HS, so I want to implement NCE Loss similar to this repos: https://github.com/yandex/faster-rnnlm

The API may looks like

NCEOutput(data = input, label = target, noise = [negative samples])

Do you have more suggestions on this?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement NCE loss to speed up Softmax with large amount of labels #2704

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implement NCE loss to speed up Softmax with large amount of labels #2704

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions