Invalid content #79

kaitj · 2020-05-14T19:21:12Z

Handles previously found issue of a server error in the case of invalid content being uploaded (#71). This PR adds code to check for this by checking if a version is found. Also adds in a new test case for this particular error.

Additionally, removes an old error that is no longer raised!

- removed old ValueError (was not being raised and may have been from earlier version - added IndexError, which is raised if a version is not found in the file - this is raised when the content contains non standad .fcsv content (eg. html) - added a test case to test_model_auto.py

tkkuehn

Largely looks good to me, with one criticism: In the case where the regex happens to match something on the first line but the rest of the document isn't a properly formatted csv, this will get through the check you added at line 102 but raise an unhandled csv.Error at line 120 (I think), so I don't think this fully solves the problem. Would like to see a test case for this (pathological) situation and some code to handle it gracefully.

kaitj · 2020-05-27T18:24:26Z

Looking at it again, I agree that we can probably have a better check in place, but I also see 3 cases where an error is going to get thrown if it passes the first check:

Invalid content (either line 129 or line 152)
Too few rows (if the content doesn't have the same number of rows line 159)
Too many rows (as you mention the error at line 120)

I think (1) or (3) is most likely to occur in the case it passes that first test, in which case you'll probably be checking the .fcsv manually. We could write a check to properly validate the file, but I'm not sure how we go about implementing that without checking the file line by line (essentially what we are doing already anyways). Any ideas? (In the mean time I am going to mock up a fake .fcsv with a proper header and see what happens)

tkkuehn · 2020-05-27T18:31:43Z

I think we're already handling all those kinds of errors, I just mean that the csv reader won't know what to do when it starts parsing at line 120 if the file has no CSV-like structure, which will lead to an exception we're not handling. See the second last example here for an example of how to handle that kind of case.

kaitj · 2020-05-27T18:49:37Z

Ahh I see what you mean now. Can discuss some more during the call today!

kaitj · 2020-05-28T20:10:13Z

Changes from Tristan to handling valid header with invalid content looks good! Merging the PR and closing

kaitj added 2 commits April 2, 2020 10:20

add fcsv with invalid content to create check

1334160

github-actions bot requested a review from tkkuehn May 14, 2020 19:21

github-actions bot assigned kaitj May 14, 2020

kaitj mentioned this pull request May 14, 2020

[Bug] Upload Error #71

Closed

fix error message

5463ffb

tkkuehn requested changes May 27, 2020

View reviewed changes

kaitj assigned tkkuehn May 27, 2020

kaitj and others added 2 commits May 28, 2020 09:35

add fcsv with valid header, invalid content

14be17a

Test for invalid content

f5116d7

tkkuehn unassigned kaitj May 28, 2020

kaitj merged commit fe47358 into master May 28, 2020

kaitj deleted the invalid-content branch May 28, 2020 20:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid content #79

Invalid content #79

kaitj commented May 14, 2020

tkkuehn left a comment

kaitj commented May 27, 2020

tkkuehn commented May 27, 2020

kaitj commented May 27, 2020

kaitj commented May 28, 2020

Invalid content #79

Invalid content #79

Conversation

kaitj commented May 14, 2020

tkkuehn left a comment

Choose a reason for hiding this comment

kaitj commented May 27, 2020

tkkuehn commented May 27, 2020

kaitj commented May 27, 2020

kaitj commented May 28, 2020