| Topics | |
|---|---|
| Service Overview | What is it, who uses it, where does it fit in overall |
| Technical Architecture | Overview, upstream dependencies, sub-components |
| Development Process | Source control, external dependencies, build, test, tools |
| Change Management / Deployment | Process, technology, cadence, gates, rollback |
| Configuration Management | Process, technology, source control |
| Demand Forecasting, Capacity Management | How do you shift load, or scale? How do you load test? Can you shed load? |
| SLAs, SLI, SLOs, KPIs, etc. | What are your targets? Are you meeting them? |
| Monitoring, Logging, Diagnostics, Tickets | How do you monitor, diagnose? How noisy? |
| Incident Response, production playbook, disaster recovery, backup/restore | How do you respond to issues? What is your waste case plan? Do you use it regularly? |
| Review of Past Outages, War Stories | What has gone wrong previously? How was it fixed? |
Created
June 14, 2017 08:10
-
-
Save dastergon/61f5f4c7994f29c515991419a3d7878c to your computer and use it in GitHub Desktop.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment