-
Notifications
You must be signed in to change notification settings - Fork 337
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grakn Crash: Inserting two things and matching one #6304
Comments
Ok -- this issue is due to the long |
Thanks. Is that issue a large problem to solve, or is there a workaround that can be put in place? For some context, although the match might seem long, it's actually a very common query that we execute on our other graphs (Neo4j and NetworkX). |
Had a quick look - I don't think the algorithm would take more than a day, but we have to do some shifting of the architecture to accommodate the elegant implementation, which could take some time. Quick chat with haikal we think it'll probably get done in the next month or so? Short term workaround is ugly as heck: split your query into a two parts (for this one, you only have to knock off a couple of the attributes at the end), and then pipe the IIDs you get from the first query into the second and filter out the ones that are not satisfying the second query. Agree that this is a priority (label is blocker on the other issue) as people do write long queries, and this is used to validate rules which can also get large and then the validation takes ages! |
Description
On an empty database, when executing separate insert, match, insert, queries Grakn uses up substantial CPU resources. A few such cases in succession easily leads to a situation where all resources are exhausted and the AWS EC2 instance goes down and/or a situation where Grakn never seems to give up the resources.
Environment
Reproducible Steps
Steps to create the smallest reproducible scenario:
Load the following schema:
Execute the following Python code:
Expected Output
The database doesn't use up so much resource
Actual Output
CPU usage: 126%, Memory: 65.4% (2.6GB) (on a 2-core t3-medium EC2 instance)
Additional Information
This is on a database with no data to start with.
The text was updated successfully, but these errors were encountered: