Skip to content
This repository has been archived by the owner on Nov 22, 2022. It is now read-only.

Vocab Limited Pretrained Embedding [2/5] #1248

Closed
wants to merge 1 commit into from

Conversation

geof90
Copy link
Contributor

@geof90 geof90 commented Feb 13, 2020

Summary: In local bento experiments, often nearest neighbors / items nearby in the embedding space tended to be misspellings of the original word. This isn't really useful for spoken language since there won't be many misspellings, so instead this diff adds a subclass of PretrainedEmbeddings that restricts the embedding space to only contain known vocab words. From local experiments, the results here seem much more consistent with what is expected from kNN in the embedding space.

Reviewed By: geof90

Differential Revision: D19818803

Summary: In local bento experiments, often nearest neighbors / items nearby in the embedding space tended to be misspellings of the original word. This isn't really useful for spoken language since there won't be many misspellings, so instead this diff adds a subclass of `PretrainedEmbeddings` that restricts the embedding space to only contain known vocab words. From local experiments, the results here seem much more consistent with what is expected from kNN in the embedding space.

Reviewed By: geof90

Differential Revision: D19818803

fbshipit-source-id: bfac18887990f7a816e30000f8fbbfad37788fd3
@facebook-github-bot facebook-github-bot added CLA Signed Do not delete this pull request or issue due to inactivity. fb-exported labels Feb 13, 2020
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D19818803

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in f907783.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants