Beta2 rollout #323

beijbom · 2020-05-25T17:21:08Z

This is the base-branch for the new back-end rollout:

Spacer integration. Issue vision_backend app cleanup & testability #264
Switch to running jobs through AWS Batch [@beijbom ]
Add tables to track all backend jobs. Cut reliance of SQS to track completed jobs. [@beijbom]
Unskip currente backend tests. [@StephenChan ]
More vision backend tests. [@StephenChan ]
Support for different extractors. Issue Enable support for different feature extractors. #235 [@StephenChan ]

BONUS (can be pushed to later)

More vision backend tests & SWE [@beijbom ]
Purge python 2.7 compatibility code. Issue Clean up Python 2.x compatibility code #364 [@StephenChan ]
Make reg-tests robust for all settings. Add comparison for classifier performance. [@beijbom]

…d be manually tested dev-side).

…serving (as far as could be manually tested dev-side).

…ready works (as far as could be manually tested dev-side).

…ross-platform (as far as could be manually tested dev-side). Manually tested the basic behavior for everything except the classify task (which got an error calling predict_proba() on the model; not sure if a pickle cross-platform issue or something else).

…er. Note that 1.17+ drops Python 2 support.

…umpy.

* added spacer 0.2 to requirement. Changed to chardet (not cchardet) in local installs * fixed int conversion -- skipped failing unit-test for now * Lots of work remaining, but got the extract_features task to execute using the new messaging and spacer system. * started working on train classifier task * fixed get_secret for spacer job hash * added force_dummy_extractor setting * reworked submit_classifier. Still need to handle the previous classifiers * fixed PEP 8 line breaks * classify_image task done * updated deploy tasks. * updated calls to add_scores and add_annotations to use the new interface * encapsulated the spacer_job_token encoding and decoding * renamed the backend abstraction to queue abstraction, simplified unique filename by using microseconds * removed FORCE_NO_BACKEND_SUBMIT * fixed the renaming from backends to queues * upgrade pyspacer version, added test-queue for dev-config * bumped pyspacer one version to fix the aws permission error * fixed import from spacer to avoid using the fire module * implemented path method for s3 storage backend * renamed factory method from backend to queue * abstracted away more stuff to the storage backend, enabled unit-tests with spacer and the s3 storage * removed more usages of storage.path * updated to use spacer storage to store train and val data * added more train_classifier tests * Updated views.py to use new data-classes. Renamed config and classes related to LocalQueue * removed the regtests configs, updated pyspacer req * bumped pyspacer version to 0.2.6 * Updated vb_regtests.py * removed outdated scripts.py * added some more test for train_classifier * removed hacky exists_full method on the storage classes. Removed redundant feature submit call * made backend_overview page work even if celery is not running * added full help text to regtests and more docstrings. Also sorted images to make sure the 'small' mode has at least 1 of each label * fixed bug in tasks.reset_features(). Made the vb_regtests more robust * One more safeguard agains raceconditions * Changed to microseconds in comment. * Update storage_backends.py * Purged path_full * Add migrations to change fields' string attributes from byte strings to Unicode strings. These were generated by `makemigrations` after switching to Python 3. None of the attributes seem to contain non-ASCII characters, so the type conversion should not pose issues. * Skip the images 0023 migration test because it now fails on Travis. * RemovePointOutliersTest was unreliable to begin with on Windows dev environment, and now it seems unreliable on Travis too. Just skip for now. * Update vb_regtests.py Fixed typo in instructions. Co-authored-by: StephenChan <StephenChan@users.noreply.github.com> Co-authored-by: StephenChan <stephenjchan@gmail.com>

beijbom · 2020-06-18T22:36:27Z

@StephenChan : I merged the first PR. Do you want to take a stab next and address any combination of your 3 items in the PR description. Doesn't seem like the order matter that much. Once done I'll take one more pass on the vision_backend app.

StephenChan · 2020-06-19T02:54:22Z

Do you want to take a stab next and address any combination of your 3 items in the PR description. Doesn't seem like the order matter that much. Once done I'll take one more pass on the vision_backend app.

Sure. I can do 1) more vision backend tests and 3) support for different extractors. The second item, 'Purge python 2.7 compatibility code', doesn't necessarily have to happen before the rollout. Though I'll make sure to use Py3-only code for this branch.

beijbom · 2020-06-19T04:30:09Z

Great. Let’s test explicitly for duplicate row/col values. I didn’t handle that explicitly, so I’m curious if it works out anyways.

…

On Thu, Jun 18, 2020 at 19:54 StephenChan ***@***.***> wrote: Do you want to take a stab next and address any combination of your 3 items in the PR description. Doesn't seem like the order matter that much. Once done I'll take one more pass on the vision_backend app. Sure. I can do 1) more vision backend tests and 3) support for different extractors. The second item, 'Purge python 2.7 compatibility code', doesn't necessarily have to happen before the rollout. Though I'll make sure to use Py3-only code for this branch. — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#323 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAITTF6QCGU4JZSNPLJ7VWLRXLHOVANCNFSM4NJVQRGQ> .

StephenChan · 2020-06-21T20:28:19Z

Great. Let’s test explicitly for duplicate row/col values. I didn’t handle that explicitly, so I’m curious if it works out anyways.

Agreed, there should be tests for that.

Also, a follow-up on the string-attribute related migrations from PR #324:

Just in case, I'll set up a Python 3 environment on our staging server and see if that encounters any issues with the new migrations.

I set up Python 3 on the staging server. The migration unit test fails, just like on Travis. However, the migration itself seems okay. I ran the images 0024 migration, and started the staging server. Didn't see any issues with existing metadata. Rolled it back to 0023, still no issues seen. So it's likely the unit test's issue.

* Shuffled such that py3 migration is the last one. * Added some test settings overrides

* AWS batch integration

* Attempting to un-skip test_deploy.py's SuccessTest.test_done, but with LocalQueue it gets `TypeError: Object of type 'int32' is not JSON serializable`. * bumped pyspacer version * Fix the rest of test_deploy.SuccessTest.test_done. * Unskip test_deploy.TaskErrorsTest.test_classifier_deleted. Add a test for handling spacer-side deploy errors. * Unskip the last two skipped deploy tests. * Improve explanation on skipping test_backend_overview. * Fix TestDeployCollector; it was broken by a previous commit. Co-authored-by: StephenChan <stephenjchan@gmail.com>

…ssifier use a mock Image.valset property, so that the image counts going into training vs. validation can be easily controlled.

…t class. (Testing the classify-related util functions individually would still be worthwhile, but ClassifyUtilsTest was kind of in an odd middle ground, somewhere between single-function unit tests and full integration tests. Essentially, this commit pushes it to full integration tests.)

…Reorganize vision_backend task test files.

…olerant of any score values from deploy, since it seems the values can vary.

…robust by not assuming certain scores.

…so that we can hopefully see what's going on whenever it decides to fail in Travis again (seems it may be an intermittent thing, and only happens in Travis).

… features representation

More tests for vision backend tasks

- Existing sources get set to VGG16, and new sources default to EfficientNet. - Allow the field to be set in the Create Source and Edit Source forms. - Display the field value on the source main page.

…or that source. Warn the user appropriately. - Move the Source Edit cancel button to a separate form, to make it easier to assign a submit handler to the main form. - Rename the reset_after_labelset_change task to reset_backend_for_source.

… resets classifiers and classifications, and one that resets that plus features. - Expand on tests involving these tasks. - Refactor other VB tasks tests, mainly removing the need to decorate every BaseTaskTest subclass with mock.patch().

…d after being deleted.

…Not a fix exactly, but remove unnecessary collect_all_jobs() calls.

Enable support for different feature extractors

Remove 'Beta' branding

StephenChan

This looks finally ready to go, as far as I can tell.

@beijbom Any other loose ends you can think of, besides the announcement blog post?

beijbom · 2020-12-24T00:05:24Z

Very nice! Nope; this should be good to go!

kriegman · 2020-12-24T04:07:33Z

Yeah. I'll have the Blog post for the feature extractor/classifier fully drafted tonight. But that's not really an announcement of the full release.

…

On Wed, Dec 23, 2020 at 4:05 PM Oscar Beijbom ***@***.***> wrote: Very nice! Nope; this should be good to go! — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <https://urldefense.com/v3/__https://github.com/beijbom/coralnet/pull/323*issuecomment-750607448__;Iw!!Mih3wA!S2fC73VRQTCZmF4VI7s_Vrrj_I5496JIx8_45VtdnUVQvAHQdKQST9DgyFoOuVhv$>, or unsubscribe <https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ABKA5ALWTR7VU33HTZVTLADSWKAVDANCNFSM4NJVQRGQ__;!!Mih3wA!S2fC73VRQTCZmF4VI7s_Vrrj_I5496JIx8_45VtdnUVQvAHQdKQST9DgyPeO1Fpf$> .

StephenChan added 6 commits May 20, 2020 21:29

Make the checkformissingimages command Py3 compatible (as far as coul…

e657e0f

…d be manually tested dev-side).

Make the vb_export_spacer_data command Py3-compatible and Unicode-pre…

c75806d

…serving (as far as could be manually tested dev-side).

Make the vb_inspect_extracted_features command cross-platform. Py3 al…

dff2bf5

…ready works (as far as could be manually tested dev-side).

Bump numpy version from 1.11.1 to 1.18.1 to be compatible with pyspac…

3f51a23

…er. Note that 1.17+ drops Python 2 support.

Remove Python 2.7 from Travis, since it won't work with the updated n…

23f6d06

…umpy.

beijbom added the work in progress label May 25, 2020

beijbom assigned StephenChan May 25, 2020

beijbom and others added 4 commits May 25, 2020 14:10

Merge branch 'master' into beta2-rollout

e16223a

Merge branch 'master' into beta2-rollout

318f797

Merge branch 'master' into beta2-rollout

eed5262

beijbom self-assigned this Jun 18, 2020

Merge branch 'master' into beta2-rollout

d1b3230

beijbom added 2 commits September 12, 2020 15:48

Merge branch 'master' into beta2-rollout

d282b67

Merge branch 'master' into beta2-rollout

0bf5f45

beijbom mentioned this pull request Sep 25, 2020

vision_backend app cleanup & testability #264

Closed

StephenChan mentioned this pull request Oct 4, 2020

Last annotation field + sorting options in Browse #349

Merged

beijbom added 4 commits October 20, 2020 13:53

merge commit with master. Some tests failing

16221a3

Beta2 rollout migration fix (#354)

07ce440

* Shuffled such that py3 migration is the last one. * Added some test settings overrides

Aws batch integration (#355)

4e818c8

* AWS batch integration

bumped up pyspacer to 0.3.0

bd6bc22

beijbom mentioned this pull request Oct 25, 2020

Changed to compressed numpy array for storing features coralnet/pyspacer#33

Merged

beijbom and others added 3 commits November 12, 2020 20:03

Write tests for running VB tasks with duplicate point locations.

f301670

Test the clean_up_old_batch_jobs task.

d77c236

StephenChan added 5 commits November 26, 2020 01:33

vision_backend/test_tasks.py: Ensure that all tests which train a cla…

9d7427f

…ssifier use a mock Image.valset property, so that the image counts going into training vs. validation can be easily controlled.

Add more tests for feature extraction, training, and classification. …

79849b1

…Reorganize vision_backend task test files.

Update dev_stephen settings.

f5cb9ae

Make vision_backend_api.tests.DeployResultEndpointTest.test_success t…

52b679f

…olerant of any score values from deploy, since it seems the values can vary.

StephenChan mentioned this pull request Dec 1, 2020

More tests for vision backend tasks #357

Merged

StephenChan and others added 16 commits December 3, 2020 18:32

Make vision_backend_api.tests.test_deploy.SuccessTest.test_done more …

e4ab80c

…robust by not assuming certain scores.

Add debug info to ClassifyImageTest.test_classify_unconfirmed_image, …

2638124

…so that we can hopefully see what's going on whenever it decides to fail in Travis again (seems it may be an intermittent thing, and only happens in Travis).

removed convertion to set to support backwards compatability with old…

592f2ee

… features representation

fixed outdated comments

8f3aac0

updated exctract features dupe test to updated logic

b65c823

Merge pull request #357 from beijbom/vb-tasks-more-tests

6b0d37a

More tests for vision backend tasks

Merge branch 'master' into beta2-rollout

b588d06

Add a feature extractor field to the Source model.

89be8d9

- Existing sources get set to VGG16, and new sources default to EfficientNet. - Allow the field to be set in the Create Source and Edit Source forms. - Display the field value on the source main page.

Remove 'beta' branding.

b125cf8

Fill in help text for the source feature-extractor field.

6d2c9eb

In the VB reset task tests, ensure that backend objects get re-create…

3ff9b47

…d after being deleted.

Fix VB reset task tests: 1) Ensure the valset function is mocked. 2) …

556b782

…Not a fix exactly, but remove unnecessary collect_all_jobs() calls.

Merge pull request #360 from beijbom/extractor-choice

850c884

Enable support for different feature extractors

Merge pull request #363 from beijbom/remove-beta-branding

8f6fe4d

Remove 'Beta' branding

StephenChan self-requested a review December 23, 2020 21:35

StephenChan approved these changes Dec 23, 2020

View reviewed changes

StephenChan merged commit 5f7c6ad into master Dec 24, 2020

StephenChan deleted the beta2-rollout branch November 11, 2023 04:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Beta2 rollout #323

Beta2 rollout #323

beijbom commented May 25, 2020 •

edited by StephenChan

Loading

beijbom commented Jun 18, 2020 •

edited

Loading

StephenChan commented Jun 19, 2020

beijbom commented Jun 19, 2020 via email

StephenChan commented Jun 21, 2020

StephenChan left a comment

beijbom commented Dec 24, 2020

kriegman commented Dec 24, 2020 via email

Beta2 rollout #323

Beta2 rollout #323

Conversation

beijbom commented May 25, 2020 • edited by StephenChan Loading

beijbom commented Jun 18, 2020 • edited Loading

StephenChan commented Jun 19, 2020

beijbom commented Jun 19, 2020 via email

StephenChan commented Jun 21, 2020

StephenChan left a comment

Choose a reason for hiding this comment

beijbom commented Dec 24, 2020

kriegman commented Dec 24, 2020 via email

beijbom commented May 25, 2020 •

edited by StephenChan

Loading

beijbom commented Jun 18, 2020 •

edited

Loading