Add predictor/collection upload route #611

adelavega · 2019-07-03T18:38:41Z

Partially addresses Interface for uploading for custom annotations #603 by adding a route to upload custom predictors
Fixes an issue where the API Docs did not indicate file type correctly
To fix the above, and to better organize the core app, I separate the API docs into its own file, which can them be imported from other modules without circular issues. The resulting core.py is also much cleaner, and I reorganized the imports so that only three need to be sequentially later in the file (because of complicated imports, again).

Note: because of sequential db changes, this depends on #607 to be merged.

codecov-io · 2019-07-03T19:54:44Z

Codecov Report

Merging #611 into master will decrease coverage by 1.74%.
The diff coverage is 34.48%.

@@            Coverage Diff             @@
##           master     #611      +/-   ##
==========================================
- Coverage   72.54%   70.79%   -1.75%     
==========================================
  Files          55       56       +1     
  Lines        2065     2164      +99     
==========================================
+ Hits         1498     1532      +34     
- Misses        567      632      +65

Impacted Files	Coverage Δ
neuroscout/resources/__init__.py	`100% <ø> (ø)`	⬆️
celery_worker/tasks.py	`0% <0%> (ø)`	⬆️
celery_worker/app.py	`0% <0%> (ø)`	⬆️
celery_worker/upload.py	`0% <0%> (ø)`
neuroscout/models/auth.py	`96% <100%> (+0.16%)`	⬆️
neuroscout/schemas/predictor.py	`100% <100%> (ø)`	⬆️
neuroscout/schemas/user.py	`92.59% <100%> (+0.28%)`	⬆️
neuroscout/core.py	`67.74% <100%> (-2.85%)`	⬇️
neuroscout/models/__init__.py	`100% <100%> (ø)`	⬆️
neuroscout/populate/ingest.py	`92.36% <100%> (ø)`	⬆️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7498ec2...b77bd53. Read the comment docs.

adelavega · 2019-07-04T22:26:47Z

The upload route is working (still need to test Celery task).

The correct request looks like:

curl -X POST "http://localhost/api/predictors/create" -H  "accept: application/json" -H  "authorization: JWT [JWT]" -H  "Content-Type: multipart/form-data" -F "dataset_id=9" -F "runs=258,256" -F "runs=252,251" -F "event_files=@/path/run_1.tsv;type=text/tab-separated-values" -F "event_files=@/path/run_2.tsv;type=text/tab-separated-values" -F "collection_name=test"

adelavega · 2019-07-04T22:27:10Z

Note the repetition of parameters for runs and event files, but comma separated values for runs

adelavega · 2019-07-05T19:45:46Z

Upload route is working!

Here is a summary.

Make a POST request to /predictors/create with your JWT.
The list of runs (and run IDs) must equal the number of files being uploaded.
Any given run can only be associated with a single event file.
Event files can have multiple new Predictors, but all of the event files uploaded at once but have the same columns.

Example:

curl -X POST "http://localhost/api/predictors/create" -H  "accept: application/json" 
-H  "authorization: JWT [JWT]" -H  "Content-Type: multipart/form-data" 
-F "dataset_id=9" -F "runs=258,256" -F "runs=252,251" 
-F "event_files=@/path/run_1.tsv;type=text/tab-separated-values" -F "event_files=@/path/run_2.tsv;type=text/tab-separated-values" 
-F "collection_name=test"

Returns:

{
  "collection_name": "test", 
  "id": 13, 
  "predictors": [], 
  "status": "PENDING", 
  "traceback": null, 
  "uploaded_at": "2019-07-05T19:3"
}

This creates a new PredictorCollection which (like NeurovaultCollection or Report), is used to track the Celery status of the upload, and also serves to group together the new predictors that were uploaded. This is also associated with your us id.

To check the status, make a GET request:

curl -X GET "http://localhost/api/predictors/create?id=11" -H "accept: application/json"

which might return something like this if it succeeds:

{
  "collection_name": "test", 
  "id": 11, 
  "predictors": [
    {
      "id": 19244, 
      "name": "reaction"
    }
  ], 
  "status": "OK", 
  "traceback": null, 
  "uploaded_at": "2019-07-05T19:3"
}

You can now use those predictors like normal, and they will display for all users (maybe this is not what we want to do? might want to have this as an option private/public?)

To see which collections you've created do a GET on /api/user with your JWT:

{
  "email": "user@example.com", 
  "first_login": null, 
  "name": null, 
  "picture": null, 
  "predictor_collections": [
    {
      "collection_name": "test", 
      "id": 11
    }
  ]
}

adelavega · 2019-07-05T19:49:15Z

WDYT? @rwblair

Does this seem reasonable for the frontend?

Minor design quibbles:

Maybe this should be under a different route? Maybe it makes more sense to have /predictor_collection/ and do a POST and GET to that? rather than /predictor/create. It would be more "RESTful". OTOH, this is more consistent with how we handle neurovault uploads and reports, so I think it's probably good as is.
It might also make more sense to think of "PredictorCollection" is the appropriate name.

To do still:

Add tests
Check that clearing cache works

… importable

adelavega · 2019-07-08T23:25:24Z

I added some basic tests. Can't fully test because Flask doesn't write out to the test_db, so Celery can't see it.

At a separate time, we should fix this to allow for real testing of Celery tasks.

adelavega force-pushed the enh/upload_pred branch from 5372b79 to 5489c9e Compare July 3, 2019 19:19

adelavega changed the title ~~Add predictior upload route~~ Add predictor upload route Jul 3, 2019

Fix not null constraint test

22572b0

adelavega added 8 commits July 3, 2019 17:41

Implement inital version of route, upload model

217ff42

Add upload task to celeryh

47e398a

Fix table name error

6bf1f61

Added upload task

a5b4688

Added resource

585c683

Reorganize celery tasks into modules

53215ee

Merge branch 'enh/tour' into enh/upload_pred

08f5a70

POST route is functional

0c9ca8d

adelavega added 3 commits July 5, 2019 13:02

Roll back db prior to catching error

32adb22

Filled in models and schemas, as well as added get status resource

2e3cd04

Add predictor to predictor collection

9cfb2db

adelavega marked this pull request as ready for review July 5, 2019 19:45

adelavega added 4 commits July 5, 2019 14:54

Implement clearing cache, and reorganize cache in main app to make it…

a55e9d2

… importable

Don't get_or_create. Always create new Predictor

48ad64f

Merge branch 'master' into enh/upload_pred

dcb0e50

Add basic test for upload, and multiform upload to flask test client

26a61fc

adelavega added 3 commits July 8, 2019 18:31

Skip test on travis

0bbf4be

Import os to skip test

705f5b2

Move /predictors/create to /predictors/collection

b77bd53

adelavega changed the title ~~Add predictor upload route~~ Add predictor/collection upload route Jul 10, 2019

adelavega merged commit 302b37f into master Jul 10, 2019

adelavega deleted the enh/upload_pred branch July 10, 2019 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add predictor/collection upload route #611

Add predictor/collection upload route #611

adelavega commented Jul 3, 2019 •

edited

codecov-io commented Jul 3, 2019 •

edited

adelavega commented Jul 4, 2019

adelavega commented Jul 4, 2019

adelavega commented Jul 5, 2019

adelavega commented Jul 5, 2019 •

edited

adelavega commented Jul 8, 2019

Add predictor/collection upload route #611

Add predictor/collection upload route #611

Conversation

adelavega commented Jul 3, 2019 • edited

codecov-io commented Jul 3, 2019 • edited

Codecov Report

adelavega commented Jul 4, 2019

adelavega commented Jul 4, 2019

adelavega commented Jul 5, 2019

adelavega commented Jul 5, 2019 • edited

adelavega commented Jul 8, 2019

adelavega commented Jul 3, 2019 •

edited

codecov-io commented Jul 3, 2019 •

edited

adelavega commented Jul 5, 2019 •

edited