Add CSV export query results module #199

katarinasupe · 2023-03-02T14:18:10Z

Description

I created csv_query procedure inside export_util module. Find arguments description in docstring.

Discussed with @Josipmrden:

Potentially use pandas instead of csv - what are the pros? - Use csv, pandas would be new dependency, and csv is not.
Leave config as a map or add stream as an argument. Another option is to return the stream and remove it from the arguments altogether. I vote for this. - The option to always return the stream is not good with big results, since it always has to write to stream, even if the user only wants to save in file. The config parameter does not make sense for MAGE modules, so we will go with stream as argument.
Probably edit the path inside the code to be /usr/lib/memgraph/query_modules instead of /var/lib/memgraph/internal_modules used for development. In json() procedure in this module, I left it to the user to provide the whole path and advised them to provide /usr/lib/memgraph/query_modules/something.json. This may be a better way to go. - We will go with path and handle potential errors

TODOs:

Use path as argument instead of file name
Add error handling for opening file path
Use stream as argument
Update README
Add tests
Write docs - [master < add-export-csv-query-procedure] Add export_util.csv_query() procedure docs docs#761

Pull request type

Algorithm/Module

######################################

Reviewer checklist (the reviewer checks this part)

Module/Algorithm

######################################

Josipmrden · 2023-03-03T07:53:44Z

As for Pandas, I think it makes it more easier to work with CSV files, but we can check what are the implications of adding another dependency into MAGE.

As for the second one, I'm fine with both.

ok, makes sense

antoniofilipovic · 2023-03-03T15:02:48Z

@katarinasupe what is benchmarking difference between csv and pandas? When doing writes. Is there any difference

katarinasupe · 2023-03-06T07:37:42Z

I did not benchmark it @antoniofilipovic. csv can be used out of the box, while pandas must be added as a new dependency to MAGE.

katarinasupe · 2023-03-06T18:39:54Z

@Josipmrden, do you know what is happening with tests? Suddenly they decided to fail 😂

Add CSV export query results module

32f054e

katarinasupe added lang: python Issue on Python codebase status: draft PR is in draft phase type: module labels Mar 2, 2023

katarinasupe self-assigned this Mar 2, 2023

katarinasupe requested a review from Josipmrden March 2, 2023 14:18

antoniofilipovic added this to the v1.6.1 milestone Mar 3, 2023

antoniofilipovic self-requested a review March 3, 2023 15:03

Add exceptions and change arguments

2f7f93e

Update black

69cfaa6

katarinasupe mentioned this pull request Mar 6, 2023

[master < add-export-csv-query-procedure] Add export_util.csv_query() procedure docs memgraph/docs#761

Merged

6 tasks

katarinasupe added 2 commits March 6, 2023 15:09

Fix docstring

d81278f

Small docstring fix

4eca3ca

katarinasupe added status: ready PR is ready for review and removed status: draft PR is in draft phase labels Mar 6, 2023

katarinasupe marked this pull request as ready for review March 6, 2023 15:06

katarinasupe and others added 4 commits March 7, 2023 17:46

Add tests

43b6b2c

Added gqlalchemy to main requirements, not test requirements

4ff10c5

add working test

7b38fa8

add empty line on test

0ff2060

antoniofilipovic approved these changes Mar 15, 2023

View reviewed changes

antoniofilipovic added status: ship it PR approved and removed status: ready PR is ready for review labels Mar 15, 2023

antoniofilipovic and others added 3 commits March 17, 2023 09:39

add empty line

dc804e7

Update test.yml

999b05d

Update test.yml

3619d35

antoniofilipovic added 7 commits March 17, 2023 14:56

update test.yml

fe1a567

fix workflow

63971a7

add load option

6a03197

try only prod

61348b7

revert back to pytorch 1.12

6be2bbb

add requirements

385bc43

revert changes in test.yml

f3b700a

antoniofilipovic merged commit d46005f into main Mar 20, 2023
4 checks passed

antoniofilipovic deleted the add-csv-export-query-module branch March 20, 2023 10:35

antepusic mentioned this pull request Apr 5, 2023

Switch to safe torch version #207

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CSV export query results module #199

Add CSV export query results module #199

katarinasupe commented Mar 2, 2023 •

edited

Josipmrden commented Mar 3, 2023

antoniofilipovic commented Mar 3, 2023

katarinasupe commented Mar 6, 2023

katarinasupe commented Mar 6, 2023

Add CSV export query results module #199

Add CSV export query results module #199

Conversation

katarinasupe commented Mar 2, 2023 • edited

Description

Pull request type

Reviewer checklist (the reviewer checks this part)

Module/Algorithm

Josipmrden commented Mar 3, 2023

antoniofilipovic commented Mar 3, 2023

katarinasupe commented Mar 6, 2023

katarinasupe commented Mar 6, 2023

katarinasupe commented Mar 2, 2023 •

edited