GitHub - fhamborg/domain-adapted-atsc: code for our 2019 paper: "Adapt or Get Left Behind: Domain Adaptation through BERT Language Model Finetuning for Aspect-Target Sentiment Classification"

strongly based on: https://github.com/deepopinion/domain-adapted-atsc

Installation

First clone repository, open a terminal and cd to the repository

conda create --yes -n ada python=3.7
source activate ada
pip install -r requirements.txt 
conda install --yes -c anaconda scipy
conda install --yes scikit-learn
conda install --yes pytorch torchvision cudatoolkit=10.1 -c pytorch
python -m spacy download en_core_web_sm
pip install torch-transformers
mkdir -p data/raw/semeval2014  # creates directories for data
mkdir -p data/transformed
mkdir -p data/models

run the code

# check number of non-zero lines
cat data/transformed/copewe10m.txt | sed '/^\s*$/d' | wc -l
# should be roughly 10M: 10000002

cd finetuning_and_classification/

# change to env (if not yet done)
module load anaconda
module load cuda
source activate ada

# open a screen session, because this will take a while
screen -S lmfine

# prepare the finetuning corpus
python pregenerate_training_data.py --train_corpus ../data/transformed/copewe10m.txt --bert_model bert-base-uncased --do_lower_case --output_dir copewe10m_prepared/ --epochs_to_generate 3 --max_seq_len 256

# run the finetuning
python finetune_on_pregenerated.py --pregenerated_data copewe10m_prepared --bert_model bert-base-uncased --do_lower_case --output_dir copewe10m_finetuned/ --epochs 3 --train_batch_size 16

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
finetuning_and_classification		finetuning_and_classification
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_old.md		README_old.md
prepare_laptop_reviews.py		prepare_laptop_reviews.py
prepare_restaurant_reviews.py		prepare_restaurant_reviews.py
prepare_semeval_datasets.py		prepare_semeval_datasets.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning_and_classification

finetuning_and_classification

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

README_old.md

README_old.md

prepare_laptop_reviews.py

prepare_laptop_reviews.py

prepare_restaurant_reviews.py

prepare_restaurant_reviews.py

prepare_semeval_datasets.py

prepare_semeval_datasets.py

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

Installation

run the code

About

Releases

Packages

Languages

License

fhamborg/domain-adapted-atsc

Folders and files

Latest commit

History

Repository files navigation

Installation

run the code

About

Resources

License

Stars

Watchers

Forks

Languages