SANSA-Stack

This project comprises the whole Semantic Analytics Stack (SANSA). At a glance, it features the following functionality:

Ingesting RDF and OWL data in various formats into RDDs
Operators for working with RDDs and data frames of RDF data at various levels (triples, bindings, graphs, etc)
Transformation of RDDs to data frames and partitioning of RDDs into R2RML-mapped data frames
Distributed SPARQL querying over R2RML-mapped data frame partitions using RDB2RDF engines (Sparqlify & Ontop)
Enrichment of RDDs with inferences
Application of machine learning algorithms

For a detailed description of SANSA, please visit http://sansa-stack.net.

Layers

The SANSA project is structured in the following five layers developed in their respective sub-folders:

Release Cycle

A SANSA stack release is done every six months and consists of the latest stable versions of each layer at this point. This repository is used for organising those joint releases.

Usage

Spark

Requirements

We currently require a Spark 3.x.x with Scala 2.12 setup. A Spark 2.x version can be built from source based on the spark2 branch.

Release Version

Some of our dependencies are not in Maven central (yet), so you need to add following Maven repository to your project POM file repositories section:

<repository>
   <id>maven.aksw.internal</id>
   <name>AKSW Release Repository</name>
   <url>http://maven.aksw.org/archiva/repository/internal</url>
   <releases>
      <enabled>true</enabled>
   </releases>
   <snapshots>
      <enabled>false</enabled>
   </snapshots>
</repository>

If you want to import the full SANSA Stack, please add the following Maven dependency to your project POM file:

<!-- SANSA Stack -->
<dependency>
   <groupId>net.sansa-stack</groupId>
   <artifactId>sansa-stack-spark_2.12</artifactId>
   <version>$LATEST_RELEASE_VERSION$</version>
</dependency>

If you only want to use particular layers, just replace $LAYER_NAME$ with the corresponding name of the layer

<!-- SANSA $LAYER_NAME$ layer -->
<dependency>
   <groupId>net.sansa-stack</groupId>
   <artifactId>sansa-$LAYER_NAME$-spark_2.12</artifactId>
   <version>$LATEST_RELEASE_VERSION$</version>
</dependency>

SNAPSHOT Version

While the release versions are available on Maven Central, latest SNAPSHOT versions have to be installed from source code:

git clone https://github.com/SANSA-Stack/SANSA-Stack.git
cd SANSA-Stack

Then to build and install the full SANSA Spark stack you can do

./dev/mvn_install_stack_spark.sh

or for a single layer $LAYER_NAME$ you can do

mvn -am -DskipTests -pl :sansa-$LAYER_NAME$-spark_2.12 clean install

Alternatively, you can use the following Maven repository and add it to your project POM file repositories section:

<repository>
   <id>maven.aksw.snapshots</id>
   <name>AKSW Snapshot Repository</name>
   <url>http://maven.aksw.org/archiva/repository/snapshots</url>
   <releases>
      <enabled>false</enabled>
   </releases>
   <snapshots>
      <enabled>true</enabled>
   </snapshots>
</repository>

Then do the same as for the release version and add the dependency:

<!-- SANSA Stack -->
<dependency>
   <groupId>net.sansa-stack</groupId>
   <artifactId>sansa-stack-spark_2.12</artifactId>
   <version>$LATEST_SNAPSHOT_VERSION$</version>
</dependency>

How to Contribute

We always welcome new contributors to the project! Please see our contribution guide for more details on how to get started contributing to SANSA.

Name		Name	Last commit message	Last commit date
Latest commit History 5,327 Commits
.github/workflows		.github/workflows
dev		dev
docs		docs
project		project
sansa-bench-spark		sansa-bench-spark
sansa-bom		sansa-bom
sansa-cmds-picocli		sansa-cmds-picocli
sansa-datalake		sansa-datalake
sansa-examples		sansa-examples
sansa-hadoop-jena		sansa-hadoop-jena
sansa-inference		sansa-inference
sansa-integration-tests		sansa-integration-tests
sansa-ml		sansa-ml
sansa-notebooks		sansa-notebooks
sansa-owl		sansa-owl
sansa-pkg-parent		sansa-pkg-parent
sansa-query		sansa-query
sansa-rdf		sansa-rdf
sansa-resource-metadata		sansa-resource-metadata
sansa-resource-testdata		sansa-resource-testdata
sansa-sabine		sansa-sabine
sansa-spark-cli		sansa-spark-cli
sansa-spark-jakarta		sansa-spark-jakarta
sansa-spark-jena-java		sansa-spark-jena-java
sansa-spark-jena-scala		sansa-spark-jena-scala
sansa-stack		sansa-stack
sansa-test-resources		sansa-test-resources
trash/sansa-debian-spark-cli		trash/sansa-debian-spark-cli
.gitignore		.gitignore
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pom.xml		pom.xml
scalastyle-config.xml		scalastyle-config.xml
test.tarql		test.tarql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SANSA-Stack

Layers

Release Cycle

Usage

Spark

Requirements

Release Version

SNAPSHOT Version

How to Contribute

About

Releases 18

Packages

Contributors 38

Languages

License

SANSA-Stack/SANSA-Stack

Folders and files

Latest commit

History

Repository files navigation

SANSA-Stack

Layers

Release Cycle

Usage

Spark

Requirements

Release Version

SNAPSHOT Version

How to Contribute

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 18

Packages 0

Contributors 38

Languages

Packages