Item | Presenter | Notes |
---|
Discuss access for teams that maintain many repos across the org | | |
Continued discussion on whether we should change the Depr 6-month window approach. Should we have one big ticket for something like Python 3.8 or Node 18 and just start the 6-month clock once all the maintained repos have been updated? |
| |
Teak Maintenance Goals, take a look at https://docs.google.com/spreadsheets/d/1wtpoypH1XOPc_G6h9AUNXJ6XiNKD6dlkMP3lubdpE9I/edit?gid=195838733#gid=195838733 | | Next time |
edx-platform Specific Conversations |
Celery sharing | | See https://github.com/edx/configuration/pull/68 I think we’ve proven empirically that the issue is as follows (this is not captured well by our docs yet, so that could cause some confusion w.r.t. state of actual resolution): We were running with celery mingle enabled (b/c its enabled by default). Mingle means that, on worker startup (including restarts), each worker asks about the state of every other worker bound to the broker (redis). Every edx python IDA that uses celery used a single broker (the legacy redis cluster). edxapp was running with 30 worker instances each, and each one of those runs around 14 parent celery worker processes.
The confluence of these three things kicked off a “connection storm” in redis, causing massive amounts of (duplicated) task data to be sent out over the network to every worker, which caused us to pin the redis engine CPU at 100%, and blocked all workers from processing tasks from any queue. The way we proved this empirically - during deploys (i.e. when we bring up a larger number of new worker processes), look at the following: The number of “sync with” celery logs eminating from the celery workers. The total network out from redis to the workers. Redis engine CPU utilization Redis new and current network connection counts.
In the bad state, all three of these metrics spiked and stayed elevated for quite some time. When mingle was disabled (on stage), none of them spiked.
|
Config overrides and YAML | | |
Toggle annotations and DEPR | | |