Troubleshooting Guide
Version: 0.1 (Draft)
Date: 2025-08-08
Owner: SRE
General Approach
- Check dashboards and logs with traceId; identify blast radius; rollback if needed.
Common Issues
- Deployment failures; DB connection pool saturation; queue backlogs; auth errors.
Playbooks
- Scale out workers; drain DLQs; rotate credentials; revert config flags.
References
- See: test-environment-setup-guide.md, ci-cd-pipeline-guide.md