Elasticsearch Ownership and Rollout

Context

See https://openedx.atlassian.net/wiki/spaces/COMM/pages/1413810945

Runbook

https://openedx.atlassian.net/wiki/spaces/COMM/pages/1741455546

Effort Coordination

This blended project upgrades our outdated and unsupported Elasticsearch from version 1.5 to version 7 in each of our services. We will need a resource from each owning squad to do a final review and deploy of the Elasticsearch upgrade for their owning service. The expected effort is:

  • Squad engineer’s review of PR: 2-5 hours depending on the complexity of the service’s use of Elasticsearch (PR will already be reviewed carefully by @Diana Huang or @Michael Terry (Deactivated))

  • Squad engineer’s deploy and monitoring of the migration: 2-3 days depending on production rollout

Service/Repo

Owning Team

PRs

Current Blocker

Status

Service/Repo

Owning Team

PRs

Current Blocker

Status

terraform

Arch-BOM

 

@Tim McCormack working on updating ES clusters to ES 7.8.

devstack

SRE / Arch-BOM

https://github.com/edx/devstack/pull/611

 

  • notes

  • comments-service

  • edx-platform

  • course-discovery

  • analytics

edx-notes-api

SRE
#site-reliability-engineering

Contact:@Fred Smith (Deactivated)
ARCH-BOM: @Diana Huang

indexing on prod caused errors due to malformed tags - checking to see if we can/should fix this.

  • PR review

  • PR merge

  • Deploy to Stage

  • Deploy to Edge

  • Deploy to Prod (target week: Sep 21st)

course-discovery

Engage (Always Available)
#discovery

Contact: @Michael Terry (Deactivated) ARCH-BOM: @Feanil Patel

- Merged and reverted on Stage.

Waiting for new PR with original code + fixes.

Fixes code:

  • RG working with @Feanil Patel and @Michael Terry (Deactivated) to resolve search query discrepancies.

  • PR review

  • Manual test

  • PR merge

  • Devstack PR merge

  • Deploy and Test on Stage

  • Deploy to Prod (week: Jan 4th)

edx-search

TNL
#tnl-squad

Contact: @sburch (Deactivated)
ARCH-BOM: @Tim McCormack

 

 

  • PR review

  • Test confidence

  • edx-search PR merge

  • edx-platform PR merge

  • Devstack PR merge

  • Deploy and Test on Stage

  • Deploy to Prod (Actual week: Nov 16)

cs_comments_service

TNL
#tnl-squad

Contact: @David Joy (Deactivated)
ARCH-BOM: @Robert Raposa

 

  • PR review

  • Test confidence

  • PR merge

  • Devstack PR merge

  • Deploy and Test on Stage

  • Deploy to Prod (target week: Oct 27th)

edx-analytics-data-api
edx-analytics-pipeline

Data Engineering
#data-engineering

Contact: @Stuart Young (Deactivated)
ARCH-BOM: @Diana Huang

@Stuart Young (Deactivated) review and testing

  • PR review

  • Manual test

  • PR merge

  • Devstack PR merge

  • Deploy and Test on Stage

  • Deploy to Prod (target week: Nov)