Skip to content

ygcinar/nlp-concentration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

nlp-concentration

Source code for the Word Representations Concentrate and This is Good News! CONNL 2020

to print arguments for training and evaluation:

python train_evaluate.py -h

to plot histogram of difference and statistics of 1100 samples of the 20 newsgroups talk.religion.misc (550 samples) vs. soc.religion.christian (550 samples) categories using glove embeddings:

python train_evaluate.py -e dc-tfidf-glv -c talk.religion.misc soc.religion.christian -n 1100 -d n20 -x

to plot histogram of difference and statistics of 2000 samples from two class gaussian mixture p=400 (feature dimension):

python train_evaluate.py -n 2000 -d gaussian -p 400 -x

to train (and evaluate) lssvm with RBF kernel (sigma2=2) on 500 samples of yahoo answers dataset "Society & Culture" vs. "Education & Reference" categories using glove embeddings:
python train_evaluate.py -f lssvm -n 500 -d yqa -e dc-tfidf-glv -c "Society & Culture" "Education & Reference" -k rbf -r 2.0 -s 2.0

to train (and evaluate) lssvm with RBF kernel (sigma2=2) on 2000 samples of yahoo answers dataset "Society & Culture" vs. "Education & Reference" categories using tf-idf features:
python train_evaluate.py -f lssvm -n 2000 -d yqa -e dc-tfidf -c "Society & Culture" "Education & Reference" -k rbf -r 2.0 -s 2.0

to train (and evaluate) lssvm with polynomial kernel (f(t)=10 f'(t)=-5 f''(t)=1) on 2000 samples of yahoo answers dataset "Society & Culture" vs. "Education & Reference" categories using word2vec embeddings:
python train_evaluate.py -f lssvm -n 2000 -d yqa -e dc-tfidf-w2v -c "Society & Culture" "Education & Reference" -k poly -t 10 -5 1

Scripts for some figures are given under scripts/
example of running scripts for Figure 2:

cd scripts
./run_fig2.txt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published