Applied Text Mining 1: Methods

Vrije Universiteit Amsterdam

Group 1: Laura Alvarez, Ravi Meijer and Martijn Wesselius

Assignments

For this course 5 assignments build up to making an automatic negation detector.

Assignment 1 The Github was created.

Assignment 2 Every member annotated ten documents independently on negation cues. The “saved” directories from the eHost annotation task were stored for every group member under the name saved-groupNumber-annotatorName. This directories are available in the IAA folder.

Assignment 3 During this phase the preprocessing and feature extraction file was performed (run process_extract_final.py file). The code for preprocessing and feature extraction can be found in the preprocessing_extraction folder. In addition, extra code developed for the class, but no included in the final version, can be found in the folder named extra.

Assignment 4 For assignment 4 the models were created. We created a baseline model, and experiment with SVMs, Naive Bayes and CRF. The code for this can be found in the folder named models. In additon, a hyperparameter search was performed on the CRF model, this implementation can be found in the hyperparameter-optimization folder.

Assignment 5 For assigment 5 we performed an error analysis to evaluate the models created during assigment 4. the code for this is available in the folder error analysis.

Data

The data used for this experimentation can be found in the folder data.

Requirements

We have also provided requirements.txt that can be used to create an enviroment to test the code.

Using pip:

$ pip install -r requirements.txt

Using conda:

$ conda create --name <env_name> --file requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Applied Text Mining 1: Methods

Assignments

Data

Requirements

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
IAA		IAA
Papers		Papers
data		data
error_analysis		error_analysis
extra		extra
hyperparameter_optimization		hyperparameter_optimization
models		models
papers		papers
preprocessing_extraction		preprocessing_extraction
README.md		README.md
requirements.txt		requirements.txt

rmr282/ATM

Folders and files

Latest commit

History

Repository files navigation

Applied Text Mining 1: Methods

Assignments

Data

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages