Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added bAbI dataset #286

Merged
merged 3 commits into from
Apr 27, 2018
Merged

added bAbI dataset #286

merged 3 commits into from
Apr 27, 2018

Conversation

zshihang
Copy link
Contributor

@zshihang zshihang commented Apr 13, 2018

there are three things in this PR:

  1. enhanced build_vocab on three dimensional data input, for example, the story in shape of [batch_size, memory_size, seq_len] of bAbI dataset. build_vocab on multi-dimension data? #264

  2. added Facebook bAbI dataset and its test file

  3. changed urllib to requests for downloading bAbI dataset.

@zshihang zshihang force-pushed the master branch 2 times, most recently from a87976a to f6799a9 Compare April 18, 2018 04:19
because the opener built by urllib.request.build_opener() is unable to
change the header of http packet by using opener.addheaders on
python2.7, decide to use requests instead.
@jekbradbury
Copy link
Contributor

Is this bAbI-1k or bAbI-10k?

@zshihang
Copy link
Contributor Author

both. there is a bool argument called tenK.

@shtoshni
Copy link

Hi there,

I think you forgot to reverse the story for bAbI. See here where the story is reversed for restricting to the latest sentences within memory_size but are never reversed back.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants