Multiple RETURN links import now possible #3354

CasperWA · 2019-09-26T22:18:54Z

aiida/tools/importexport/dbimport/backends/django/__init__.py

sphuber · 2019-09-30T09:47:51Z

So if I understand correctly, the only real change was that the if-condition was comparing to LinkType.RETURN instead of LinkType.RETURN.value and therefore was always false, causing an unnecessary exception being raised. Is that correct @CasperWA ?

Unfortunately, looking at the current code, there are a lot of problems that really should be addressed sooner rather than later. The whole point of #1762 was to get to a state where we know what we export and import, but seeing this code I am afraid we do not actually have much of an idea. But I agree that we cannot tackle that right now. So for me also fine to merge this now

CasperWA · 2019-09-30T09:55:01Z

So if I understand correctly, the only real change was that the if-condition was comparing to LinkType.RETURN instead of LinkType.RETURN.value and therefore was always false, causing an unnecessary exception being raised. Is that correct @CasperWA ?

This is correct.

Unfortunately, looking at the current code, there are a lot of problems that really should be addressed sooner rather than later. The whole point of #1762 was to get to a state where we know what we export and import, but seeing this code I am afraid we do not actually have much of an idea. But I agree that we cannot tackle that right now. So for me also fine to merge this now

Yeah, I have been trying slowly to take out each step of the export and import functions, where possible, trying to create backend-agnostic utility functions. This should hopefully make each step of the process easier to deal with and understand, however, it is taking forever to do.

sphuber · 2019-09-30T09:59:18Z

I understand and already your work has made the organization of the code and the code itself a lot better, but eventually small steps won't solve this particular problem. At some point we will have to face the music and properly design an interface for export/import that matches ORM implementation but that allows for efficient bulk inserts. This is a big challenge, but is the only way to once and for all fix it.

CasperWA · 2019-09-30T10:05:12Z

I completely agree.

CasperWA · 2019-10-01T17:21:31Z

@sphuber @giovannipizzi this is ready for review. I have re-implemented the validate_link function from aiida.orm.utils.links. Note that for "unique_triple" Links, this is essentially similar to stating "this Link already exists", so if the util function link_triple_exists returns True, the import simply skips the Link in question instead of raising.

It is not the fastest, nor is it the prettiest fix, but it should be more rigorous in testing the created Links from import.

aiida/tools/importexport/dbimport/backends/django/__init__.py

CasperWA · 2019-10-02T19:42:23Z

I got insecure whether the ad hoc rules would actually work, feeling there is something missed between the usage of .get_outgoing and .get_incoming in the actual validate_links() function.
It seemed to me that if the Link type and label was the same for an existing incoming Link for a Node and we are only concerned with the existing outgoing Links being similar (possible only for CALL_WORK, I think), then the ad hoc rules here would not catch the distinction and simply raise, while it didn't need to.

However, when writing the test, I came to the realization that there would never be a situation where this is a possible case, due to a combination of the fixed Link rule to follow CALL_WORK forward and the fact that only sealed Nodes may be exported.
I.e., a graph:

work1 -> work2 -> work3

cannot be exported, following an export of

work1 -> work2 -> work3
           |
           +-> work4

with the same Node UUIDS, since they must be sealed - which means the Link between work2 and work4 cannot be created after export of the first graph.
On the other hand, the sub-graph equal to the first graph cannot be exported from the second graph due to the Link follow rules (non-changeable call_work_forward=True, changeable call_work_backward=False).

sphuber

One more thing to fix and then this is good to go

sphuber · 2019-10-03T07:57:49Z

aiida/tools/importexport/dbimport/backends/django/__init__.py

                        )
+                    )

                # New link
                links_to_store.append(


The link checking itself now seems correct, very nice. There is just one thing we realized. In this logic we should not rely on export logic, i.e. what export can legally export. So for example, we should not assume that the contents of an archive are necessarily consistent. This means we have to check that the links within an archive are also consistent. To do this, we simply have to update the existing_ sets at this point. Then in the next iterations, the added link will be included in the existence check and if the archive itself contains duplicate links, it will be noticed.

Done. I have done the same for the SQLAlchemy backend, however, the only way to do this was to not "bulk create" the Links after the for loop, but instead add the new Link in every run using session.add(Link).

Btw, also for the SQLAlchemy import, I did a check that the new Links are correctly continuously added and can be found using the QueryBuilder in the subsequent run of the for loop.

sphuber · 2019-10-03T11:12:59Z

Thanks a million @CasperWA great stuff ! 👍

CasperWA requested review from giovannipizzi and AntimoMarrazzo September 26, 2019 22:18

giovannipizzi reviewed Sep 30, 2019

View reviewed changes

aiida/tools/importexport/dbimport/backends/django/__init__.py Outdated Show resolved Hide resolved

CasperWA force-pushed the fix_3353_import_multiple_returns branch from 154a3dc to 7e971be Compare October 1, 2019 17:18

CasperWA force-pushed the fix_3353_import_multiple_returns branch from 7e971be to b6ce2b0 Compare October 2, 2019 12:19

sphuber requested changes Oct 2, 2019

View reviewed changes

CasperWA commented Oct 2, 2019

View reviewed changes

aiida/tools/importexport/dbimport/backends/django/__init__.py Outdated Show resolved Hide resolved

CasperWA force-pushed the fix_3353_import_multiple_returns branch from b6ce2b0 to a8fc655 Compare October 2, 2019 19:29

sphuber requested changes Oct 3, 2019

View reviewed changes

CasperWA added 5 commits October 3, 2019 11:07

Multiple RETURN links import now possible

e213bd0

Redo Link validation for import functions

c9ed5fe

Fix test - sort lists before assertion

904223c

Address review by @sphuber

9573472

New links are continuously added to existing links

ab8b6d3

CasperWA force-pushed the fix_3353_import_multiple_returns branch from a8fc655 to ab8b6d3 Compare October 3, 2019 09:27

giovannipizzi approved these changes Oct 3, 2019

View reviewed changes

sphuber approved these changes Oct 3, 2019

View reviewed changes

sphuber merged commit b2492e2 into aiidateam:develop Oct 3, 2019

CasperWA deleted the fix_3353_import_multiple_returns branch October 3, 2019 16:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple RETURN links import now possible #3354

Multiple RETURN links import now possible #3354

CasperWA commented Sep 26, 2019

sphuber commented Sep 30, 2019 •

edited

Loading

CasperWA commented Sep 30, 2019

sphuber commented Sep 30, 2019

CasperWA commented Sep 30, 2019

CasperWA commented Oct 1, 2019

CasperWA commented Oct 2, 2019 •

edited

Loading

sphuber left a comment

sphuber Oct 3, 2019

CasperWA Oct 3, 2019

CasperWA Oct 3, 2019

sphuber commented Oct 3, 2019

Multiple RETURN links import now possible #3354

Multiple RETURN links import now possible #3354

Conversation

CasperWA commented Sep 26, 2019

sphuber commented Sep 30, 2019 • edited Loading

CasperWA commented Sep 30, 2019

sphuber commented Sep 30, 2019

CasperWA commented Sep 30, 2019

CasperWA commented Oct 1, 2019

CasperWA commented Oct 2, 2019 • edited Loading

sphuber left a comment

Choose a reason for hiding this comment

sphuber Oct 3, 2019

Choose a reason for hiding this comment

CasperWA Oct 3, 2019

Choose a reason for hiding this comment

CasperWA Oct 3, 2019

Choose a reason for hiding this comment

sphuber commented Oct 3, 2019

sphuber commented Sep 30, 2019 •

edited

Loading

CasperWA commented Oct 2, 2019 •

edited

Loading