-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
suspicious dwca parsing exception with escaped double quotes in json fragment #48
Comments
After second take, I realized that there's actually an error in the data:
should be:
Notice the |
@neilcobb - please note that record:
from https://scan-bugs.org:443/portal/content/dwca/MCZ_DwC-A.zip appears to have an invalid occurrence record at https://scan-bugs.org:443/portal/collections/individual/index.php?occid=26225229 . Does Symbiota do any validation on escaped field values ? |
@neilcobb please note that at https://scan-bugs.org/portal/collections/individual/index.php?occid=26225229 , the "correct" identification remarks are shown:
It appears that Symbiota adds an extra backslash. |
Bug transferred to Symbiota/Symbiota-deprecated#130 . |
GloBI is using your dwca-io library (thanks!) for parsing dwc archives.
During routine integration testing (see https://travis-ci.org/globalbioticinteractions/scan/jobs/588191246#L229), I found:
I've isolated the offending line and reproduced the issue. On close inspection, I see usage of the
""
to escape double quotes in csv for json fragments. However, I don't see any malformed csv.Does dwca-io support
""
-style escaping?The text was updated successfully, but these errors were encountered: