NeoCube: The Graph-Based Implementation of the M³ Data Model

MSc Thesis | Summer 2022 | MSc Computer Science | IT University of Copenhagen

The M³ model is a data model for media exploration that builds on and combines aspects of multi-dimensional analysis and faceted search to get the best of both worlds. The data model has previously only been represented as a relational model. This project aims to translate the M³ data model to graph, evaluating it using large-scale datasets.

The repository current contains the following:

Scripts for populating Neo4j graph data model with M³ CSV data.
Python benchmarking suite with CLI & graph output
Prototype Node.js GraphQL server

Neo4j

install

sudo apt-get install neo4j

Load NeoCube data

CSV files needed:

cubeobjects.csv
tags.csv
alphanumerical_tags.csv
numerical_tags
date_tags.csv
time_tags.csv
timestamp_tags.csv
objecttagrelations.csv
tagsets.csv
nodes.csv
hierarchies.csv

Place M³ csv data in the neo4j import folder.
```
 <neo4j-home>/import
```
Run the neocube_populate.cypher script to load the data. This script requires the Neo4j apoc library for timestamp tag name formatting.
```
 cypher-shell -u neo4j -d neo4j -f neocube_populate.cypher
```

Neo4j & PostgreSQL Benchmarking suite

Located in the benchmarking directory

Dependencies

numpy, click, seaborn, neo4j, psycopg, python-dotenv

pip install -r requirements.txt

Environment variables

First place .env file in the server folder with the following properties:

# Neo4j - uses default database (neo4j)
NEO4J_URL=bolt://localhost:7687
NEO4J_USER=<username>
NEO4J_PASSWORD=<password>
# PostgreSQL
PSQL_HOST=127.0.0.1
PSQL_PORT=5432
PSQL_USER=<username>
PSQL_PASSWORD=<password>
PSQL_DB=<database name>
# LSC dataset 
MAX_TAG_ID=193189
MAX_TAGSET_ID=21
MAX_HIERARCHY_ID=3
MAX_NODE_ID=8842
MAX_OBJECT_ID=183386

Run benchmarks

python3 M3Benchmarker.py --help

python3 M3Benchmarker.py complete --r 5

GraphQL node.js server

https://neo4j.com/product/graphql-library/

Located in the server directory.

Dependencies

@neo4j/graphql @neo4j/graphql-ogm neo4j-driver graphql apollo-server dotenv

Install dependencies:

npm install

Environment variables

First place .env file in the server folder with the following properties:

NEO4J_USER=<username> 
NEO4J_PASSWORD=<password>
NEO4J_URI=bolt://localhost:7687

default user and password are neo4j and neo4j.

Run server

node index.js

The server can be visited at http://localhost:4000. GraphQL queries can be built here through Apollo Studio.

M³ state generators

Navigate to the generators directory and run the following commands:

python3 postgresql_state_generator_V7.py < 3d.txt
python3 neo4j_state_generator_V1.py < 3d.txt

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
benchmarking		benchmarking
client		client
generators		generators
results		results
scripts		scripts
server		server
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeoCube: The Graph-Based Implementation of the M³ Data Model

Neo4j

install

Load NeoCube data

Neo4j & PostgreSQL Benchmarking suite

Dependencies

Environment variables

Run benchmarks

GraphQL node.js server

Dependencies

Environment variables

Run server

M³ state generators

Benchmarking Results

VBS dataset

LSC dataset

About

Releases

Packages

Languages

nimertz/NeoCube

Folders and files

Latest commit

History

Repository files navigation

NeoCube: The Graph-Based Implementation of the M3 Data Model

Neo4j

install

Load NeoCube data

Neo4j & PostgreSQL Benchmarking suite

Dependencies

Environment variables

Run benchmarks

GraphQL node.js server

Dependencies

Environment variables

Run server

M3 state generators

Benchmarking Results

VBS dataset

LSC dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

NeoCube: The Graph-Based Implementation of the M³ Data Model

M³ state generators

Packages