EasyTPP

EasyTPP is an easy-to-use development and application toolkit for Temporal Point Process (TPP), with key features in configurability, compatibility and reproducibility. We hope this project could benefit both researchers and practitioners with the goal of easily customized development and open benchmarking in TPP.

Features [Back to Top]

Configurable and customizable: models are modularized and configurable，with abstract classes to support developing customized TPP models.
Compatible with both Tensorflow and PyTorch framework: EasyTPP implements two equivalent sets of models, which can be run under Tensorflow (both Tensorflow 1.13.1 and Tensorflow 2.0) and PyTorch 1.7.0+ respectively. While the PyTorch models are more popular among researchers, the compatibility with Tensorflow is important for industrial practitioners.
Reproducible: all the benchmarks can be easily reproduced.
Hyper-parameter optimization: a pipeline of optuna-based HPO is provided.

Model List [Back to Top]

We provide reference implementations of various state-of-the-art TPP papers:

No	Publication	Model	Paper	Implementation
1	KDD'16	RMTPP	Recurrent Marked Temporal Point Processes: Embedding Event History to Vector	Tensorflow Torch
2	NeurIPS'17	NHP	The Neural Hawkes Process: A Neurally Self-Modulating Multivariate Point Process	Tensorflow Torch
3	NeurIPS'19	FullyNN	Fully Neural Network based Model for General Temporal Point Processes	Tensorflow Torch
4	ICML'20	SAHP	Self-Attentive Hawkes process	Tensorflow Torch
5	ICML'20	THP	Transformer Hawkes process	Tensorflow Torch
6	ICLR'20	IntensityFree	Intensity-Free Learning of Temporal Point Processes	Tensorflow Torch
7	ICLR'21	ODETPP	Neural Spatio-Temporal Point Processes (simplified)	Tensorflow Torch
8	ICLR'22	AttNHP	Transformer Embeddings of Irregularly Spaced Events and Their Participants	Tensorflow Torch

Dataset [Back to Top]

We preprocessed one synthetic and five real world datasets from widely-cited works that contain diverse characteristics in terms of their application domains and temporal statistics:

Synthetic: a univariate Hawkes process simulated by Tick library.
Retweet (Zhou, 2013): timestamped user retweet events.
Taxi (Whong, 2014): timestamped taxi pick-up events.
StackOverflow (Leskovec, 2014): timestamped user badge reward events in StackOverflow.
Taobao (Xue et al, 2022): timestamped user online shopping behavior events in Taobao platform.
Amazon (Xue et al, 2022): timestamped user online shopping behavior events in Amazon platform.

In addition, we processed two non-anthropogenic datasets

Earthquake: timestamped earthquake events over the Conterminous U.S from 1996 to 2023, processed from USGS.
Volcano eruption: timestamped volcano eruption events over the world in recent hundreds of years, processed from The Smithsonian Institution.

All datasets are preprocess to the Gatech format dataset widely used for TPP researchers, and saved at Google Drive with a public access.

Quick Start [Back to Top]

We provide an end-to-end example for users to run a standard TPP model with EasyTPP.

Step 1. Installation

First of all, we can install the package from the source code on Github.

git clone https://github.com/Anonymous0006/EasyTPP.git
cd EasyTemporalPointProcess
python setup.py install

Step 2. Prepare datasets

We need to put the datasets in a local directory before running a model and the datasets should follow a certain format.

Suppose we use the taxi dataset in the example.

Step 3. Train the model

Before start training, we need to set up the config file for the pipeline. We provide a preset config file in Example Config

After the setup of data and config, the directory structure is as follows:

    data
     |______taxi
             |____ train.pkl
             |____ dev.pkl
             |____ test.pkl

    configs
     |______experiment_config.yaml

Then we start the training by simply running the script

import argparse
from easy_tpp.config_factory import Config
from easy_tpp.runner import Runner


def main():
    parser = argparse.ArgumentParser()

    parser.add_argument('--config_dir', type=str, required=False, default='configs/experiment_config.yaml',
                        help='Dir of configuration yaml to train and evaluate the model.')

    parser.add_argument('--experiment_id', type=str, required=False, default='NHP_train',
                        help='Experiment id in the config file.')

    args = parser.parse_args()

    config = Config.build_from_yaml_file(args.config_dir, experiment_id=args.experiment_id)

    model_runner = Runner.build_from_config(config)

    model_runner.run()


if __name__ == '__main__':
    main()

Documentation [Back to Top]

The classes and methods of EasyTPP have been well documented so that users can generate the documentation by:

cd doc
pip install -r requirements.txt
make html

NOTE:

The doc/requirements.txt is only for documentation by Sphinx, which can be automatically generated by Github actions .github/workflows/docs.yml. (Trigger by pull request.)

Benchmark [Back to Top]

In the examples folder, we provide a script to benchmark the TPPs, with Taxi dataset as the input.

To run the script, one should download the Taxi data following the above instructions. The config file is readily setup up. Then run

cd examples
python benchmark_script.py

License [Back to Top]

This project is licensed under the Apache License (Version 2.0). This toolkit also contains some code modified from other repos under other open-source licenses. See the NOTICE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docs		docs
easy_tpp		easy_tpp
examples		examples
LICENCE		LICENCE
MANIFEST.in		MANIFEST.in
NOTICE		NOTICE
README.md		README.md
requirements-doc.txt		requirements-doc.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
version.py		version.py

License

Anonymous0006/EasyTPP

Folders and files

Latest commit

History

Repository files navigation

EasyTPP

Features [Back to Top]

Model List [Back to Top]

Dataset [Back to Top]

Quick Start [Back to Top]

Step 1. Installation

Step 2. Prepare datasets

Step 3. Train the model

Documentation [Back to Top]

Benchmark [Back to Top]

License [Back to Top]

About

Resources

License

Stars

Watchers

Forks

Languages