Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: Bumping Celery #10280

Merged
merged 1 commit into from Jul 10, 2020
Merged

Conversation

john-bodley
Copy link
Member

@john-bodley john-bodley commented Jul 10, 2020

SUMMARY

At Airbnb we had an incident with Celery where tasks weren't being picked up and the CPU was pegged at close to 100%. We configure Celery with the gevent pool and the only workaround to remedy the problem was to restart the workers sans gevent.

We haven't been able to track down the root cause of the problem, though a Google search surfaced a similar issue (though we actually don't call AsyncResult.get() since the SQL Lab results are stored elsewhere) which I believe was fixed in 4.4.2 (which is the version Superset currently uses). The issue seems to be that there are no release note associated with said release nor does that tag exist within the Celery GitHub repo. Given there have been a number of recent releases (which mostly look like bug fixes) I felt it was prudent that we bumped the version of Celery to the latest stable version.

Note we use a MySQL database for Superset and historically have been using the mysqlclient DBAPI but we're looking into using PyMySQL which fully supports gevent. We're not sure if this was the issue, but the Celery workers poll certain async engines, i.e., Presto, every few seconds per task and thus having a fully gevent compliant DBAPI may also be benefical.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TEST PLAN

CI.

ADDITIONAL INFORMATION

  • Has associated issue:
  • Changes UI
  • Requires DB Migration.
  • Confirm DB Migration upgrade and downgrade tested.
  • Introduces new feature or API
  • Removes existing feature or API

Copy link
Member

@villebro villebro left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's a shame the otherwise well curated changelog is missing 4.4.1 and 4.4.2. But by the looks of it the more recent versions have been fixing many bugs, so I'm sure we're better off on 4.4.6 than 4.4.2.

@john-bodley john-bodley merged commit 3f2f5e8 into apache:master Jul 10, 2020
@john-bodley john-bodley deleted the john-bodley--bump-celery branch July 10, 2020 16:12
auxten pushed a commit to auxten/incubator-superset that referenced this pull request Nov 20, 2020
Co-authored-by: John Bodley <john.bodley@airbnb.com>
@mistercrunch mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 0.37.0 labels Mar 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels size/XS 🚢 0.37.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants