TMDb to Neo4j

Program reading the raw data in JSON from the TMDb collection and inserting them according to our needs in Neo4j. Made in Scala, use of [spray-json] libraries (https://github.com/spray/spray-json) for the deserialization of JSON and [neotypes] (https://neotypes.github.io/neotypes/), the Neo4j Scala driver based on the official Java driver. The programme is made up of six main steps:

Creating constraints on node types
De-serialisation of movies and actors in JSON in Scala case classes
Creation of Map and Set Scala from Movies and Actors to pre-process the relationships between the different nodes (parallel insertion of some data)
Adding all nodes in Neo4j, without relations
Added relationships between the nodes concerned from the collections created in 3.
The chosen algorithms are executed on the whole inserted data set.

Node and relationship insertions are done with the Neo4j query language, Cypher. Here you have the schema of nodes and relations created :

As an example, below are two Scala methods for inserting an actor and for inserting a relation of an actor playing in a movie :

def addActor(actor: Actor): Future[Unit] =
  driver.writeSession { session =>
    c"""
      CREATE (actor: Actor {
        tmdbId: ${actor.id},
        name: ${actor.name},
        biography: ${actor.biography.getOrElse("")},
        birthday: ${actor.birthday.getOrElse("")},
        deathday: ${actor.deathday.getOrElse("")},
        gender: ${actor.intToGender()},
        place_of_birth: ${actor.place_of_birth.getOrElse("")},
        profile_path: ${actor.profile_path.getOrElse("")}
      })
    """.query[Unit].execute(session)
  }
}

def addPlayInRelation(actor: PlayInMovie, movieId: Long): Future[Unit] =
  driver.writeSession { session =>
    c"""
      MATCH (m: Movie {tmdbId: $movieId})
      MATCH (a: Actor {tmdbId: ${actor.id}})
      MERGE (a)-[r:PLAY_IN {character: ${actor.character}, order: ${actor.order}}]->(m)
    """.query[Unit].execute(session)
  }

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
project		project
src/main/scala/ch/hepia		src/main/scala/ch/hepia
.gitignore		.gitignore
.scalafmt.conf		.scalafmt.conf
README.md		README.md
build.sbt		build.sbt
graph.png		graph.png
graph.svg		graph.svg
makefile		makefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TMDb to Neo4j

About

Languages

stevenliatti/tmdb-to-neo4j

Folders and files

Latest commit

History

Repository files navigation

TMDb to Neo4j

About

Topics

Resources

Stars

Watchers

Forks

Languages