Learning Resources
SRE Resources
Site Reliability Engineering (SRE) is where software engineering meets operations. This is my personal curated collection of SRE-related resources that I'm constantly referring to. Whether you're just starting your SRE journey or looking to deepen your expertise, you'll certainly find something valuable here.
Books
Essential Reading
- Site Reliability Engineering: How Google Runs Production Systems by Betsy Beyer, Chris Jones, Jennifer Petoff & Niall Murphy
- The Site Reliability Workbook by Betsy Beyer, Niall Murphy, David K. Rensin, Kent Kawahara & Stephen Thorne
- Implementing Service Level Objectives by Alex Hidalgo
- Paperback and Kindle version available here.
Advanced Topics
- Database Reliability Engineering by Laine Campbell & Charity Majors
- Paperback and Kindle version available here.
- Patterns of Enterprise Application Architecture by Martins Fowler
- Paperback and Kindle version available here.
- Building Secure and Reliable Systems by Heather Adkins, Betsy Beyer, Paul Blankinship, Piotr Lewandowski, Ana Oprea, Adam Stubblefield
- Paperback and Kindle version available here.
GitHub Repositories
- Awesome SRE - github.com/dastergon/awesome-sre
Blogs & Websites
- Google SRE Blog - sre.google/blog
- First-hand insights from Google's SRE teams
- Case studies and best practices
- Netflix Technology Blog - netflixtechblog.com
- Deep dives into Netflix's reliability practices
- Particularly strong on chaos engineering and scale
- Stripe Engineering Blog - stripe.com/blog/engineering
- Excellent posts on building reliable payment systems
- Focus on operational excellence
- SRE Weekly - sreweekly.com
- Weekly compilation of great articles on various SRE topics.
- High Scalability - highscalability.com
- Case studies and first-hand insights from big-techs
Communities & Forums
Discussion Platforms
- r/sre - reddit.com/r/sre
- Active community of SRE practitioners
- Regular discussions on tools, practices, and career growth
Conferences
- SREcon - usenix.org/conferences/byname/925
- Premier conference for SRE professionals
Last updated: October 2024