FATA-Trans

This repository is the official implementation of the CIKM2023 paper "FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular Data". Some code scripts are adapted from Tabular Transformers for Modeling Multivariate Time Series.

Requirement

Language

Python3 >= 3.9

Modules

torch==1.13.0
transformers==4.26.1
tqdm==4.64.1
scikit-learn==1.2.0
matplotlib==3.6.2
numpy==1.24.2
pandas==1.1.5 These packages can be installed directly by running the following command:

pip install -r requirements.txt

Dataset

Synthetic Transaction

Synthetic transaction dataset is provided in the TabFormer github repoistory.

Amamzon Product Reviews

Amazon product reviews datasets are available at here. In our paper, we used the "5-core" subsets.

Running

run preprocess_IBM_v2.ipynb or preprocess_amazon_liang.ipynb to split the dataset raw files into train/val/test csv files.
run preload_dataset.ipynb to excute the first stage processing.
run either process_IBM_dataset.ipynb or process_amazon_dataset.ipynb to get the model-specific dataset.
run files named as "run_main_....ipynb" to pretrain, finetune, train from scratch, or expert embeddings from a model. (You can also directly run with main_ibm.py or main_amazon.py).

Linux bash scripts under the directory sh_commands can be used to run these jupyter notebooks mentioned above with the Python module papermill (we used the version 2.4.0). For model or dataset specfic settings, you are reffered to these bash scripts.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset		dataset
misc		misc
models		models
sh_commands		sh_commands
LICENSE		LICENSE
README.md		README.md
args.py		args.py
main_amazon.py		main_amazon.py
main_ibm.py		main_ibm.py
preload_dataset.ipynb		preload_dataset.ipynb
preprocess_IBM_v2.ipynb		preprocess_IBM_v2.ipynb
preprocess_amazon_liang.ipynb		preprocess_amazon_liang.ipynb
process_IBM_dataset.ipynb		process_IBM_dataset.ipynb
process_amazon_dataset.ipynb		process_amazon_dataset.ipynb
requirements.txt		requirements.txt
run_main_amazon_export.ipynb		run_main_amazon_export.ipynb
run_main_amazon_finetuning.ipynb		run_main_amazon_finetuning.ipynb
run_main_amazon_pretraining.ipynb		run_main_amazon_pretraining.ipynb
run_main_amazon_trainfromscratch.ipynb		run_main_amazon_trainfromscratch.ipynb
run_main_ibm_export.ipynb		run_main_ibm_export.ipynb
run_main_ibm_finetuning.ipynb		run_main_ibm_finetuning.ipynb
run_main_ibm_pretraining.ipynb		run_main_ibm_pretraining.ipynb
run_main_ibm_trainfromscratch.ipynb		run_main_ibm_trainfromscratch.ipynb

License

zdy93/FATA-Trans

Folders and files

Latest commit

History

Repository files navigation

FATA-Trans

Requirement

Language

Modules

Dataset

Synthetic Transaction

Amamzon Product Reviews

Running

About

Resources

License

Stars

Watchers

Forks

Languages