Add memgraph tutorial #507

karmenrabar · 2023-09-14T07:59:23Z

I've created a demo Jupyter notebook that demonstrates how to generate PyGraphistry visualizations using the Python driver for Neo4j, while working with data in Memgraph. Additionally, I've included a README file to illustrate the process of connecting to Memgraph.

demos/demos_databases_apis/memgraph/visualizing_iam_dataset.ipynb

lmeyerov

Super cool!

Can you scrub & rotate the user/pass, and maybe switch to api tokens?
The github preview wasn't showing the screenshots some reason here, maybe check?
This file is 0.5MB b/c the screenshots, maybe there is a way to host the images outside of the code repo? Not sure of a good pattern there

karmenrabar · 2023-09-14T09:31:20Z

Thank you, @lmeyerov, for reviewing the PR and providing constructive comments! I've made the changes; hopefully, it's better now. Please let me know 😊 I've hosted the images in my public repo and outside the code one

lmeyerov · 2023-09-14T22:16:28Z

@karmenrabar I dug into the text and am enjoying this tutorial, this should be quite helpful for folks!

I made a pass smoothing some prose + clarifying text on Graphistry side (for new users). Feel free to tweak if you think further helpful.
For the schema viz step, can you switch it to graphistry.cypher("CALL db.schema()").plot() ? If that doesn't work in memgraph, no worries, we can land as is, let me know

karmenrabar · 2023-09-15T08:38:48Z

Amazing, thank you for additional text, I appreciate it @lmeyerov ! I'm super glad it could be helpful.

Unfortunately, it appears that graphistry.cypher("CALL db.schema()").plot() is not compatible with Memgraph, since Memgraph has a different method for retrieving schema information. Therefore it would be cool if we land it as it is. But it is a good idea to explore the alternatives for mentioned query to use with Graphistry !

lmeyerov · 2023-09-16T22:04:18Z

Thanks @karmenrabar , merged!

RE: CALL db.schema(), I'm curious if there's another Bolt-standardized cypher command here, or maybe Memgraph has a proprietary cypher extension that can be used instead?

karmenrabar · 2023-09-17T14:38:05Z

@lmeyerov amazing, thank you !

Similarly to CALL db.schema(), Memgraph does provide a meta_util.schema procedure that can be used to get the graph schema in Memgraph Lab. More about it can be found here. If include_properties is set to true, the graph schema will contain additional information about properties.
For example:

CALL meta_util.schema(true) 
YIELD nodes, relationships 
RETURN nodes, relationships;

You can also generate graph schema using Memgraph Lab, which provides a visual user interface for managing and interacting with your graph data (source).

lmeyerov · 2023-09-17T15:51:18Z

Awesome - I think it'd help to update the tutorial to that, or a sample

Thinking through making this useful for our community, can the data creation step switch to a pandas -> apache arrow upload, and for the fetch step, to apache arrow download? A lot of our users like to work with hundreds of thousands or millions of events & entities, and assuming speed on memgraph side, we find this to keep interactions subsecond

karmenrabar · 2023-09-18T14:30:45Z

Memgraph indeed offers data loading capabilities using PyArrow and it is done by using GQL Alchemy.. However, a different driver is used and, to achieve the fastest performance when executing queries, it's best to use it with pre-defined indexes. Also, the data format suitable for PyArrow differs from the one that is used here. But, it's a good idea to explore for a next project !

lmeyerov · 2023-09-18T16:24:26Z

Oh super interesting, thanks!

Just to make sure I understand GQL Alchemy right:

Will the data transferred over the network to memgraph/neo4j be transferred in arrow format, or is it a client-side ORM that will translate arrow to local objects and then construct regular bolt-protocol messages?
Any sense of expected speedups and why?

When I was looking at the repo, I think it still transmits over bolt, but maybe instead of doing a clientside ORM, it uses a serverside bulk CSV load, which may help? I couldn't tell however..

karmenrabar · 2023-09-20T08:45:39Z

It is a client-side ORM (OGM) that translates tables to graph with a proper configuration. It does that with GQLAlchemy query builder that builds Cypher query which is being run over Bolt. So, you are right, I wouldn’t expect any speedups since it’s not using LOAD CSV clause with preset indexes (which is the best way of import). But, it should still be as fast as running simple Cypher queries like I did.

:)

Add memgraph tutorial

59caaca

lmeyerov reviewed Sep 14, 2023

View reviewed changes

demos/demos_databases_apis/memgraph/visualizing_iam_dataset.ipynb Outdated Show resolved Hide resolved

lmeyerov self-requested a review September 14, 2023 08:09

lmeyerov requested changes Sep 14, 2023

View reviewed changes

Changed user/pass and updated screenshots

568c83d

lmeyerov added 3 commits September 14, 2023 18:07

docs(memgraph demo): update text

a5511d7

docs(memgraph demo): update text 2

5490ea2

docs(memgraph demo): update text 3

d64b71a

docs(changelog); add memgraph tutorial

e14b839

lmeyerov merged commit db31c0a into graphistry:master Sep 16, 2023
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add memgraph tutorial #507

Add memgraph tutorial #507

karmenrabar commented Sep 14, 2023

lmeyerov left a comment

karmenrabar commented Sep 14, 2023

lmeyerov commented Sep 14, 2023

karmenrabar commented Sep 15, 2023

lmeyerov commented Sep 16, 2023

karmenrabar commented Sep 17, 2023

lmeyerov commented Sep 17, 2023 •

edited

Loading

karmenrabar commented Sep 18, 2023

lmeyerov commented Sep 18, 2023

karmenrabar commented Sep 20, 2023

Add memgraph tutorial #507

Add memgraph tutorial #507

Conversation

karmenrabar commented Sep 14, 2023

lmeyerov left a comment

Choose a reason for hiding this comment

karmenrabar commented Sep 14, 2023

lmeyerov commented Sep 14, 2023

karmenrabar commented Sep 15, 2023

lmeyerov commented Sep 16, 2023

karmenrabar commented Sep 17, 2023

lmeyerov commented Sep 17, 2023 • edited Loading

karmenrabar commented Sep 18, 2023

lmeyerov commented Sep 18, 2023

karmenrabar commented Sep 20, 2023

lmeyerov commented Sep 17, 2023 •

edited

Loading