-
Notifications
You must be signed in to change notification settings - Fork 206
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add memgraph tutorial #507
Conversation
demos/demos_databases_apis/memgraph/visualizing_iam_dataset.ipynb
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Super cool!
- Can you scrub & rotate the user/pass, and maybe switch to api tokens?
- The github preview wasn't showing the screenshots some reason here, maybe check?
- This file is 0.5MB b/c the screenshots, maybe there is a way to host the images outside of the code repo? Not sure of a good pattern there
Thank you, @lmeyerov, for reviewing the PR and providing constructive comments! I've made the changes; hopefully, it's better now. Please let me know 😊 I've hosted the images in my public repo and outside the code one |
@karmenrabar I dug into the text and am enjoying this tutorial, this should be quite helpful for folks!
|
Amazing, thank you for additional text, I appreciate it @lmeyerov ! I'm super glad it could be helpful. Unfortunately, it appears that |
Thanks @karmenrabar , merged! RE: |
@lmeyerov amazing, thank you ! Similarly to
You can also generate graph schema using Memgraph Lab, which provides a visual user interface for managing and interacting with your graph data (source). |
Awesome - I think it'd help to update the tutorial to that, or a sample Thinking through making this useful for our community, can the data creation step switch to a pandas -> apache arrow upload, and for the fetch step, to apache arrow download? A lot of our users like to work with hundreds of thousands or millions of events & entities, and assuming speed on memgraph side, we find this to keep interactions subsecond |
Memgraph indeed offers data loading capabilities using PyArrow and it is done by using GQL Alchemy.. However, a different driver is used and, to achieve the fastest performance when executing queries, it's best to use it with pre-defined indexes. Also, the data format suitable for PyArrow differs from the one that is used here. But, it's a good idea to explore for a next project ! |
Oh super interesting, thanks! Just to make sure I understand GQL Alchemy right:
When I was looking at the repo, I think it still transmits over bolt, but maybe instead of doing a clientside ORM, it uses a serverside bulk CSV load, which may help? I couldn't tell however.. |
It is a client-side ORM (OGM) that translates tables to graph with a proper configuration. It does that with GQLAlchemy query builder that builds Cypher query which is being run over Bolt. So, you are right, I wouldn’t expect any speedups since it’s not using LOAD CSV clause with preset indexes (which is the best way of import). But, it should still be as fast as running simple Cypher queries like I did. :) |
I've created a demo Jupyter notebook that demonstrates how to generate PyGraphistry visualizations using the Python driver for Neo4j, while working with data in Memgraph. Additionally, I've included a README file to illustrate the process of connecting to Memgraph.