Learn to grow and improve your Site Reliability Engineering team with less trial-and-error
Get started with a free 7-day course covering SRE culture patterns delivered to your inbox.
Our content has been featured on:
SREpath proudly advises 135+ Site Reliability Engineers and Managers
CRUFORM: Guidance to grow and enhance SRE teams
No need to snoop around FAANG* HQs to find the keys to good SRE
* FAANG = Facebook, Apple, Amazon, Netflix and Google
Hi. I’m Ash 👋🏽
I have learned over the years that well put together teams are the backbone of modern tech work.
This could not be more true for a space like SRE, where the work is broad, complex and ambiguous.
My goal with SREpath is to help more teams thrive in such situations.
Storytime. It was 2015, and I was a founding member of a startup gaining traction in the financial sector.
I was in line to get a hefty payday if we continued to win over clients. I’m not crazy rich right now, so what do you think happened? 😅
Long story cut short: our features were great, amazing even.
We had a proven zero-knowledge security solution way back then! Our webRTC conferencing feature was solid too.
But our service reliability was not so great. Our clients lost faith and we lost referrals because of it.
Who knew that end-users cared so much about uptime and performance?
We knew this fact but couldn’t do the right things fast enough.
If only we had a little more time and skill to plan out site reliability and the team that would make it happen.
Right now, I could’ve been swimming in money like Scrooge McDu…
After this failure, I did a lot of soul searching and ran a postmortem.
Many of the issues came down to the team that would’ve handled site reliability issues.
Experienced engineers got headhunted, newer engineers got lost in the fray, our capacity planning capability was non-existent, and incident response capability was overstretched.
There’s more, but you get the picture.
The funny thing is that I’d been an Operations Manager in a traditional org before this startup. I ran teams for a living!
But I lost sight of the big picture in the fast-paced fray of startup/tech work life. I have a feeling that others end up doing the same. This site is for you.
My research into the work of teams, and more specifically, SRE teams, aims to bring clarity and focus to your efforts.
I’ve already spent 100s if not 1000s of hours researching and writing on observability, incident management, K8s, team structuring and more.
Some of my greatest hits (on Reddit and HN) include exploring Netflix’s SRE practice, describing runbooks and even a push for psychological safety in how we word SRE terms.
The idea here is to distil my key findings into easy-to-digest content, which will slowly pour through this website’s newsletter.
It’s being made for your benefit and your team’s.
What’s in it for me? It helps me feel like I’m making an impact. And maybe one day, I could turn this passion project into my full-time gig.
So what are you waiting for? Give me 7 days and I will give you a new level of clarity on Site Reliability culture.
Email subscription also gains you access to:
– Future updates like 🔎 SRE capability reviews
– Our Notion page for a sneak peek at content in the works
Whether you decide to learn more about SRE through SREpath or not, I wish you the very best for your team’s journey.
P.S. You’re awesome for taking the time to explore ways to improve how you do the work. Few people put their minds toward efforts that can pay tremendous dividends.