Skip to content

Commit

Permalink
more minor fixes (giving up on mybinder running classification)
Browse files Browse the repository at this point in the history
  • Loading branch information
chrislit committed Oct 15, 2018
1 parent 52ab0a7 commit 19e1915
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 1 deletion.
2 changes: 2 additions & 0 deletions binder/Text Classification of Drug Reviews.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,8 @@
"The text classification task below uses customer review text to predict the condition for which the drug in question was prescribed. No other data (the drug name, for example) is used in this task.\n",
"\n",
"### Caveats\n",
"Unfortunately, this notebook crashes near the end when run on mybinder.org. But it runs fine on Google Colab, though you'll need to add a cell at the beginning to call `!pip install abydos`.\n",
"\n",
"This is a toy problem. I have taken a dataset that was already divided into training & test sets and used the test set for validation, not as a genuine test set. On the other hand, I haven't done much hyperparameter tuning. Indeed, all of the classifiers used below have identical parameters: `LinearSVC(loss='hinge', C=1, max_iter=2000, random_state=1337)`.\n",
"\n",
"However, Abydos was used in a [winning submission](https://www.kaggle.com/c/anlp-2015-classification-assignment/leaderboard) to a Kaggle (InClass) competition in UC Berkeley's 2015 Applied NLP course. The same [notebook](https://gist.github.com/chrislit/3852eed7cce4b3544db2) (but with its Pseudo-SSK classifier disabled due to memory requirements) was applied to [the following year's competition](https://www.kaggle.com/c/anlp-2016-classification-assignment/leaderboard), after the competition deadline, and beat that year's leader (0.89535 to 0.89369) without any tuning. So... Abydos can be useful in generalizing text classification tasks.\n",
Expand Down
2 changes: 1 addition & 1 deletion binder/requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,6 @@ abydos
numpy
pandas
scikit-learn
tensorflow
keras
nltk

0 comments on commit 19e1915

Please sign in to comment.