nlp-concentration

Source code for the Word Representations Concentrate and This is Good News! CONNL 2020

to print arguments for training and evaluation:

python train_evaluate.py -h

to plot histogram of difference and statistics of 1100 samples of the 20 newsgroups talk.religion.misc (550 samples) vs. soc.religion.christian (550 samples) categories using glove embeddings:

python train_evaluate.py -e dc-tfidf-glv -c talk.religion.misc soc.religion.christian -n 1100 -d n20 -x

to plot histogram of difference and statistics of 2000 samples from two class gaussian mixture p=400 (feature dimension):

python train_evaluate.py -n 2000 -d gaussian -p 400 -x

to train (and evaluate) lssvm with RBF kernel (sigma2=2) on 500 samples of yahoo answers dataset "Society & Culture" vs. "Education & Reference" categories using glove embeddings:
python train_evaluate.py -f lssvm -n 500 -d yqa -e dc-tfidf-glv -c "Society & Culture" "Education & Reference" -k rbf -r 2.0 -s 2.0

to train (and evaluate) lssvm with RBF kernel (sigma2=2) on 2000 samples of yahoo answers dataset "Society & Culture" vs. "Education & Reference" categories using tf-idf features:
python train_evaluate.py -f lssvm -n 2000 -d yqa -e dc-tfidf -c "Society & Culture" "Education & Reference" -k rbf -r 2.0 -s 2.0

to train (and evaluate) lssvm with polynomial kernel (f(t)=10 f'(t)=-5 f''(t)=1) on 2000 samples of yahoo answers dataset "Society & Culture" vs. "Education & Reference" categories using word2vec embeddings:
python train_evaluate.py -f lssvm -n 2000 -d yqa -e dc-tfidf-w2v -c "Society & Culture" "Education & Reference" -k poly -t 10 -5 1

Scripts for some figures are given under scripts/
example of running scripts for Figure 2:

cd scripts
./run_fig2.txt

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
scripts		scripts
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nlp-concentration

Source code for the Word Representations Concentrate and This is Good News! CONNL 2020

About

Releases

Packages

Languages

ygcinar/nlp-concentration

Folders and files

Latest commit

History

Repository files navigation

nlp-concentration

Source code for the Word Representations Concentrate and This is Good News! CONNL 2020

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages