Week 4

What is Cypher

Declarative. Focus on the domain instead of db access. Suitable for both developers and operations professionals. Queries are self-explanatory.

Queries are made out of clauses such as MATCH, WHERE, WITH, and RETURN.

Basic example:

MATCH (john {name:'John'})-[:friend]->()-[:friend]->(fof)
RETURN john.name, fof.name

This returns all friends-of-friends of John.

MATCH (user)-[:friend]->(follower)
WHERE user.name IN ['Joe', 'John', 'Sara', 'Maria', 'Steve'] AND follower.name =~ 'S.*'
RETURN user.name, follower.name

Now this query finds users with a friend thats name starts with 'S'.

Some additional querie clauses for updating the graph: CREATE, DELETE, SET, REMOVE, and MERGE.

Update Queries

A Cypher query part can't match and update the graph at the same time. If a query reads and then updates the graph the query implicitly has two parts (first reaing, second writing). Parts are separated using WITH.

An example: filter aggregated data

MATCH (n {name: 'John'})-[:FRIEND]-(friend)
WITH n, count(friend) AS friendsCount
WHERE friendsCount > 3
RETURN n, friendsCount

Now an update query:

MATCH (n {name: 'John'})-[:FRIEND]-(friend)
WITH n, count(friend) AS friendsCount
SET n.friendsCount = friendsCount
RETURN n.friendsCount

Any query can return data. Queries that only read must return data; otherwise they are useless. Update queries don't have to return anything, but can.

Transactions

Any query that updates the graph will run in a transaction. An updating query will always fully succeed or not succeed at all. A query holds changes in memory until the whole query has finished executing; as a result large queries consequently need a jVM with lots of heap space.

Syntax

Values and types

Cypher provides first class support for a number of data types. Split into 3 subsections: Property types, Structural types, and Composite types.

Property types

Can be returned from Cypher queries
Can be used as parameters
Can be stored as properties
Can be constructe with Cypher literals

Comprises of:

Number (Integer and Float)
String
Boolean
Spatial type Point
Temporal types Data, Time, LocalTime, DataTime, LocalDataTime, and Duration

Homogeneous lists of simple types can also be stored as properties, although lists in general (see Composite types) cannot be stored.

Cypher alos allows pass-through support for byte arrays, can be stored as property values. Not considered first-class data type so do not have a literal representation.

Structural Types

Can be returned from Cypher quereis
Cannot be used as parameters
Cannot be stored as properties
Cannot be constructed with Cypher literals

Comprises of:

Nodes:
- Id
- Label(s)
- Map (of properties)
Relationships:
- Id
- Type
- Map (of properties)
- Id of the start and end nodes
Paths:
- Alternating sequence of nodes and relationships

Composite Types

Can be returned from Cypher quereis
Can be used as parameters
Cannot be stored as properties
Can be constructed with Cypher literals

Comprises of:

Lists are heterogeneous, ordered collections of values, each of which has any property, structural or composite type.
Maps are heterogeneous, unordered collections of (key, value) pairs where:
- Key is a string
- Value has any property, structural, or composite type

Composite values can also contain null.

Summary:

Type	Property	Structural	Composite
Can be returned from Cypher Queries	✅	✅	✅
Can be used as properties	✅	❌	✅
Can be stored as properties	✅	❌	❌
Can be constructed with Cypher Literals	✅	❌	✅

Naming Rules and Recommendations

Rules and recommendations for naming node labels, relationship types, property names, and variables.

Naming Rules

Must being with an alphabetic letter
- if a non-alphabetic character is required, use backticks for escaping; e.g. `^n`
Can contain numbers, but not as the first character
Cannot contain symbols
- One exception are underscores my_var
- Use backticks if a leading symbole character is required
Can be very long, up to 65535 characters
Are case-sensitive
Whitespace characters are removed automatically
- Backtick to escape this behavior: hello world variable

Scoping and namespace rules

Technically this query is valid: CREATE (a:a {a: 'a'})-[r:a]→(b:a {a: 'a'})
Node labels, relationship types, and property names may re-use names.
Variables for nodes and relationships must not re-use names within the same query scope
- This is invalid: CREATE (a)-[a]->(b)

Recommendations

What	Recommendation	Example
Node labels	Camel case, beginning with an upper-case character	`:VehicleOwner`
Relationships types	Upper case, using udnerscore to separated words	`:OWNS_VEHICLE`

Expressions

An expression in Cypher can be (many things):

A decimal literal: 13, -4000, 3.14, 6.022E23
A hexadecimal integer literal (start with 0x): 0x13ZF, 0xFC3A9
An octal integer literal (start with 0): 01372, 02127, -05671
A string literal: 'Hello', "World"
boolean literal: true, false, TRUE, FALSE
variable: n, x, rel, myFancyVar, `A var with spaces and chars!`
property: n.prop, x.prop. rel.thisProp, myFancyVariable.`(weird prop name)`
dynamic property: n["prop"], rel[n.city + n.zip], map[coll[0]]
parameter: $param, $0
list of expressions: ['a', 'b'], [1, 2, 3], [ ], ['a', 2, n.prop, $param]
function call: length(p), nodes(p)
aggregate function: avg(x.prop), count(*)
path pattern: (a)-->()<--(b)
operator application: 1 + 2, 3 < 4
predicate expression: a.prop = 'Hello', length(p) > 10, exists(a.name)
regular expression: a.name =~ 'Tob.*'
case sensitive string matching expression: a.surname STARTS WITH 'Sven', a.surname ENDS WITH 'son', a.surname CONTAINS 'son'
a CASE expression

String Literals can contain the following escape sequences:

Escape Sequence	Character
\t	Tab
\b	Backspace
\n	Newline
\r	Carriage return
\f	Form feed
'	Single quote
"	Double quote
\	Backslash
\uxxxx	Unicode UTF-16 code point
\Uxxxxxxxx	Unicode UTF-32 code point

Reserved keywords

Parameters

Cypher supports querying with paramters. Developers don't have to resort to string building to create a query. Additionally parameters make caching of execution plans much easier for Cypher, thus leaing to faster query execution times.

Can be used for: literals and expressions, node and relationship ids, for explicit indexes only: index values and queries.

Cannot be used for: property keys, relationship types, labels

Simple example: Parameters

{
  "name": "Ethan"
}

Query

MATCH (n:Person {name: $name})
RETURN n

Cool example: Create multiple nodes with properties:

{
  "props": [ {
    "awesome": true,
    "name": "Ethan",
    "position": "Developer"
  }, {
    "children": 3,
    "name": "Andres",
    "position": "Developer"
  } ]
}

Query

UNWIND $props AS properties
CREATE (n:Person)
SET n = properties
RETURN n

Operators

Basics

Operator Type	Example
General	`DISTINCT`, `.` for property access, `[]` for dynamic property access
Mathematical	`+`, `-`, `*`, `/`, `%` `^`
Comparison	`=`, `<>`, `<`, `>`, `<=`, `>=`, `IS NULL`, `IS NOT NULL`
String-Specific Comparison	`STARTS WITH`, `ENDS WITH`, `CONTAINS`
Boolean	`AND`, `OR`, `XOR`, `NOT`
String	`+` for concatenation, `=~` for regex matching
Temporal	`+` and `-` for operation between durations and temporal instants/durations, `*` and `/` for operations between durations and numbers
List	`+` for concatentation, `IN` to check existence of an element in a list, `[]` for accessing element(s)

DISTINCT removes duplicate values:

CREATE (a:Person { name: 'Anne', eyeColor: 'blue' }),(b:Person { name: 'Bill', eyeColor: 'brown' }),(c:Person { name: 'Carol', eyeColor: 'blue' })
WITH [a, b, c] AS ps
UNWIND ps AS p
RETURN DISTINCT p.eyeColor

Even though both Anne and Carol have blue eyes, 'blue' is only returned once in the output.

DISTINCT is commonly used in conjunction with aggregating functions

Comments

// double slashes all the way

Reflection

Today I spent 5+ hours messing around with Cypher, Neo4j and JavaScript. I used a JSON file I had from a previous project and worked on writing a Cypher query that would import it into Neo4j. This task turned out to be harder than I originally thought, plus it took a many failed attempts for me to learn that I can easily create multiple nodes and relationships with only a few lines of Cypher.

The query I wound up using in the end is this:

UNWIND $data AS data
CREATE (d:Department {name: data.department})
WITH data, d
UNWIND data.tests as rt
MERGE (r:Rank {name: rt.rank})
CREATE (tc:TestCollection {name: d.name + r.name}),
      (tc)-[:DEPARTMENT]->(d),
      (tc)-[:RANK]->(r)
WITH d, rt, tc
UNWIND rt.tests as test
CREATE (t:Test {name: test}),
      (tc)-[:TEST]->(t)

From developing this query I learned a lot about the clauses UNWIND, CREATE, MERGE, and WITH.

First, the UNWIND clause is kind of like a forEach loop (but don't get it confused with the actual FOREACH clause as it operates quite differently). The UNWIND command allowed me to iterate over an array of objects; I actually used it to iterate over multinested lists (notice how there are 3 UNWIND uses). During each iteration I can add the entry to the query namespace allowing it to be piped into other clause via the WITH command.

One big issue I had was the use of MERGE and CREATE. They are both very similar, but MERGE won't create another node if it can find it in the graph. I utilize this when creating the Rank nodes as I only wanted 4 of them, but I would need them to have multiple relationships with TestCollection nodes. This is easier to visualize than it is to type so please refer to the included images for more details.

Finally an important detail is the use of WITH clause. After the CREATE and MERGE clauses are used I need to pipe the new entites (nodes, relationships) to the next part of the query. I believe this query can be considered to have 3 parts (separated by each UNWIND clause), but I'm not 100% certain.

Also, I was able to do all of this using Node.js 10.3.0, JavaScript, and some JSON data. The data itself was manipulated via the modify_data.js file that uses Node.js fileSystem api and some messy ES6 array methods to build a usable JSON object for the query.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Week4.md

Week4.md

Week 4

What is Cypher

Update Queries

Transactions

Syntax

Values and types

Property types

Structural Types

Composite Types

Naming Rules and Recommendations

Naming Rules

Scoping and namespace rules

Recommendations

Expressions

Parameters

Operators

Comments

Reflection

Files

Week4.md

Latest commit

History

Week4.md

File metadata and controls

Week 4

What is Cypher

Update Queries

Transactions

Syntax

Values and types

Property types

Structural Types

Composite Types

Naming Rules and Recommendations

Naming Rules

Scoping and namespace rules

Recommendations

Expressions

Parameters

Operators

Comments

Reflection