Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Export Invalid JSON #2

Closed
garywoodfine opened this issue May 17, 2019 · 3 comments
Closed

Export Invalid JSON #2

garywoodfine opened this issue May 17, 2019 · 3 comments

Comments

@garywoodfine
Copy link

When downloading a completed Data Annotation Task, with mutliple files included, the JSON file is not valid file because it is not defined as an array and is missing comma's between array entities.

i.e
{ "content" : "blah ..." .... } {"content" : "blah ..." ...}

should be

[ { "content" : "blah ..." .... } , {"content" : "blah ..." ...} ]

It makes it very difficult to ingest the file otherwise because it breaks Valid Json Formats and when working on large Annotation projects the files are large and very difficult to patch.

@gajju3588
Copy link
Collaborator

As input file is generally new line seperated, output is also new line seperated.

It was very hard to come to a common ground regarding this, but i guess you can add one more output format which will have a json array.

@garywoodfine
Copy link
Author

In my opinion exporting a valid JSON array format would be more compliant as most developers will be importing the output of this phase using JSON Parsers and modeling the classes on the JSON. At least it is the process I used.

Exporting the from the web app, and importing the contents to another database in order to build the model. In my case, we needed to transform the output to an input that is accepted by AWS Comprehend.

@DataTurks
Copy link
Owner

You can read each line and parse that programmatically. This is very similar to having a complete json array.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants