Skip to content

Latest commit

 

History

History
119 lines (85 loc) · 5.62 KB

operations.md

File metadata and controls

119 lines (85 loc) · 5.62 KB

Operations

Moderate

With this module, you'll learn the discipline of change management, automated techniques to distribute your product, and how to monitor product health.

NASA control room

Topics

Managing Releases

With this topic, you'll learn about change management and how to release new versions of an application with as little downtime as possible.

Learning Outcomes

  • Describe what release management is
  • Create a release checklist with release and rollback steps
  • Theorize what approval you'd need to get before releasing a change

Resources

Exercises


Managing Incidents

With this topic, you'll learn to keep calm under the stress of resolving production application incidents.

Learning Outcomes

  • Describe what incident management is
  • Describe what a runbook is
  • Explain what an SLO means
  • Explain what an SLA means
  • Create a runbook with step-by-step instructions to solve a common problem

Resources

Exercises


Continuous Integration and Deployment

With this topic, you'll learn to automate the release process using continuous integration and deployment (CI/CD) pipelines.

Learning Outcomes

  • Describe what a CI/CD pipeline is
  • Create a CI pipeline to run automated tests and compile build artifacts
  • Create a CD pipeline to deploy build artifacts for development, staging, and production environments

Resources

Exercises


Performance Testing

With this topic, you'll learn how to verify the performance of an application using load tests, smoke tests, and black-box testing.

Learning Outcomes

  • Describe what performance testing is
  • Explain how you might create a performance test for an application
  • Integrate performance testing into your CI/CD pipeline

Resources

Exercises


Working with Site Reliability Engineers

With this topic, you'll learn about the role of a site reliability engineer and how to work with them on your team effectively.

Learning Outcomes

  • Describe what toil is
  • Describe the role of a site reliability engineer
  • Identify what you need from a site reliability engineer to complete software engineering tasks
  • Explain how you might collaborate with a site reliability engineer to maintain resilience for an application

Resources

Exercises