SANSA-POI

This repository contains code to process RDF dataset in SLIPO project which contains a certain amount of Point of Interest(POI). Furthermore, we did some clustering analysis of the data.

Clustering Algorithms

We are mainly using Power Iteration and K-Means clustering algorithms to provide clustering analysis based on the categories of POI. The clustering algorithms are from Spark standard ML library and we developed several encoding methods to encode categorical data to numerical.

OneHot Encoding
Word2Vec Encoding (from Spark ML)
Multidimensional Scaling Encoding (Third Party Library, see pom.xml)

Staypoint Algorithm

Staypoint algorithm determines the stay point based on spatiotemporal data. Stay point is defined as the place where user stays for a while. Stay point is bounded by two parameters i.e., T_min and D_max. The minimum time user must be in same place to consider it a stay point is given by T_min. Whereas D_max is the maximum distance between two consecutive places. By combining stay point with yelp data, one can determine the interesting venues in a region.

Input data:- Spatiotempral RDF data, T_min and D_max
Output: stay points in a region

Project Management

The project is managed using Maven in order to make it easily runnable in different platform. We provided a run.sh shell script to package and run the project with maven profile dev, and it should be easily adaptable.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
results		results
src		src
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

results

results

src

src

LICENSE

LICENSE

README.md

README.md

pom.xml

pom.xml

run.sh

run.sh

Repository files navigation

SANSA-POI

Clustering Algorithms

Staypoint Algorithm

Project Management

About

Releases

Packages

Contributors 2

Languages

License

SLIPO-EU/sansa-poi

Folders and files

Latest commit

History

Repository files navigation

SANSA-POI

Clustering Algorithms

Staypoint Algorithm

Project Management

About

Topics

Resources

License

Stars

Watchers

Forks

Languages