The folder
- session-1 : contains code for session on Intrusion Detection, divided into R and python code folders.
- session-2 : contains code for session on Text classification, divided into R and python code folders.
-
Anaconda distribution for python : Go to https://www.continuum.io/downloads and download the latest Anaconda distribution. Please use Python 2.7 installation.
-
Run
conda install -c anaconda seaborn
-
Run
conda install -c glemaitre imbalanced-learn
-
Install the libraries listed below using pip.
Steps to install a library in python.
- Go to terminal/command-prompt.
- Run
pip install <library name>
- For instance, to install numpy, you’d run
pip install numpy
List of libraries used in the hands-on session
Session 1 : Intrusion detection
- numpy
- pandas
- matplotlib
- seaborn
- sklearn
- imblearn
- xgboost
Session 2 : News articles recommender
- numpy
- pandas
- sklearn
- nltk 3.2.4
- Install nltk corpus and model:
> import nltk > nltk.download('stopwords') > nltk.download('punkt') > nltk.download('maxent_ne_chunker') > nltk.download('averaged_perceptron_tagger') > nltk.download('words')
- gensim 0.12.4
conda install -c anaconda gensim
- Set up R : Go to https://cran.rstudio.com/ and download R for your OS. Please download R version >=3.4.1
- Set up R Studio : Go to https://www.rstudio.com/products/rstudio/ and download open source version of RStudio Desktop.
- Install the libraries listed below.
Steps to install a library in RStudio
- Open RStudio.
- In the console, run
install.packages(“<library name>”)
- For instance, to install ggplot2, you’d run
install.packages(“ggplot2”)
List of libraries used in the hands-on session
Session 1 : Intrusion detection
- ggplot2
- randomForest
- caret
- rpart
- plyr
- gbm
- rpart.plot
- reshape2
- naivebayes
- corrplot
- e1071
Session 2 : News articles recommender
- tm
- topicmodels
- lda
- MASS
- devtools
- NLP
- R.utils
- stringdist
- dplyr
- openNLP
- rjava
- NLP
- openNLP
- RWeka
- qdap
- magrittr
- openNLPmodels.en
- data.table
- text2vec
Note If any issues with Rjava, make sure you have JDK and JRE installed on your system.
For Windows: http://docs.oracle.com/javase/7/docs/webnotes/install/windows/jdk-installation-windows.html
For Linux: https://github.com/hannarud/r-best-practices/wiki/Installing-RJava-(Ubuntu)
If you are not able to setup your machine, please send an email to sourabh@tatrasdata.com