Site Reliability Engineer — On-call & Incident Response
Quick facts
About the role
Join a 4-engineer SRE rotation supporting a top-100 SaaS platform. You'll be in the rotation, write postmortems people actually read, and own the runbook library. Incident frequency is low (3-4 SEV-2s/month) but the bar on response quality is high — customers measure us by the speed and clarity of every status update.
What you’ll do
- Take a primary on-call slot in the 4-engineer rotation
- Drive the postmortem culture — templates, follow-up tracking, customer-facing summaries
- Curate + maintain the runbook library
- Run the quarterly chaos engineering drills
- Partner with product on the customer-trust narrative
What we’re looking for
- Production on-call experience at a service customers depend on
- Has authored postmortems read beyond their immediate team
- Fluent in at least one observability stack (Datadog, Honeycomb, OTel + Grafana)
- Comfortable in Kubernetes troubleshooting under time pressure
- Calm, clear writer — incident comms is half the job
About Glimmer
Glimmer is a b2b saas company (200–500 people). Founded in 2017. Based in London, UK. They hire and pay through Loxala, so scope, milestones, and funds stay protected by escrow from the first message.
Skills required
Recommended for you.
Roles Skill Matcher thinks fit you, based on this brief and the skills it needs.
Senior Full-Stack Engineer — React + Node
Senior React Developer
NestJS Backend Engineer — Payments
Related skills in demand.
Build these to widen the roles Loxala can match you with next.
About applying to this job.
How do I apply for this role?
Do I need an account to apply?
How and when do I get paid?
Can I message the client before applying?
Is the role remote?

Think you’re a fit for this role?
Apply in minutes with Loxi and let Glimmer see your best work — profile and proposal, side by side.