Code of paper Challenges and opportunities in applying Neural Point Processes to large scale industry data

We present in this repository code to train, evaluate Neural Temporal Point Processes (TPPs), as described in our work: "Challenges and opportunities in applying Neural Point Processes to large scale industry data"

Contact

Dominykas Šeputis dom.seputis@gmail.com

Data

We use three datasets:

Simulated Hawkes process
Stack Overflow users' activity
Vinted platform members' actions. If needed to access Vinted data, request for the data via email provided in the contact section

Setup instructions

We use poetry as python dependency manager. To setup python virtual environment, first, follow the instructions to install poetry on your machine.
Run $ poetry install to setup virtual environment
Run $ source ./venv/bin/activate to activate the newly initiated virtual environment
Authenticate with wandb tool to track experiments
To prepare datasets for training and evaluation steps, process them by running $ sh ./runs/data_preparation.sh
To replicate experiments, run one of the experiments' scripts inside ./runs/.
Alternatively, run specific experiment by running $ python -m scripts.train --experiment <EXPERIMENT_NAME> --model-name <MODEL_NAME> --split-num <SPLITS_COUNT>

Structure of the repository

├── config <- Config files used for data processing and experimentation
│   ├── data
│   └── experiments <- Subdirectories of different experiments based on the dataset
│       ├── hawkes
│       ├── stack_overflow
│       └── vinted
├── data <- Place where raw and processed/generated data is stored
│   └── raw
│       └── stack_overflow
├── runs <- .sh files that run multiple python scripts
├── scripts <- Place where training and data processing python scripts are held
│   └── data
└── src <- Source files
    ├── datasets <- Datasets' implementations
    ├── models <- Models' implementations
    └── utils <- Various utility functions

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
data		data
runs		runs
scripts		scripts
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.python-version		.python-version
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

runs

runs

scripts

scripts

src

src

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

.pylintrc

.pylintrc

.python-version

.python-version

README.md

README.md

poetry.lock

poetry.lock

poetry.toml

poetry.toml

pyproject.toml

pyproject.toml

Repository files navigation

Code of paper Challenges and opportunities in applying Neural Point Processes to large scale industry data

Contact

Data

Setup instructions

Structure of the repository

About

Releases

Packages

Languages

dqmis/ntpps

Folders and files

Latest commit

History

Repository files navigation

Code of paper Challenges and opportunities in applying Neural Point Processes to large scale industry data

Contact

Data

Setup instructions

Structure of the repository

About

Resources

Stars

Watchers

Forks

Languages