This repository has been archived by the owner on Feb 8, 2018. It is now read-only.

cedias / HAN-pytorch Public archive

Notifications You must be signed in to change notification settings
Fork 14
Star 44

(Deprecated) Hierarchical Attention Networks for Document Classification (https://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf) - in Pytorch

44 stars 14 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Data.py		Data.py
Nets.py		Nets.py
README.md		README.md
main.py		main.py
prepare_data.py		prepare_data.py

Repository files navigation

Deprecated code

A faster and up to date implementation is in my other repo

HAN-pytorch

Batched implementation of Hierarchical Attention Networks for Document Classification paper

Requirements

Pytorch (>= 0.2)
Spacy (for tokenizing)
Gensim (for building word vectors)
tqdm (for fancy graphics)

Scripts:

prepare_data.py transforms gzip files as found on Julian McAuley Amazon product data page to lists of (user,item,review,rating) tuples and builds word vectors if --create-emb option is specified.
main.py trains a Hierarchical Model.
Data.py holds data managing objects.
Nets.py holds networks.
beer2json.py is an helper script if you happen to have the ratebeer/beeradvocate datasets.

Note:

The whole dataset is used to create word embeddings which can be an issue.

About

(Deprecated) Hierarchical Attention Networks for Document Classification (https://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf) - in Pytorch

nlp machine-learning deprecated pytorch

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%