Apply
Senior Site Reliability Engineer
Bring your expertise as a Senior Site Reliability Engineer and make an immediate impact on a close-knit organization that is embarking on an exciting hyper-growth phase.
- You will own the SRE practice, leading end-to-end architecture and strategy, scaling observability.
- You will pioneer next-generation AI infrastructure, crafting robust monitoring solutions on a massive scale.
- You will champion SRE excellence and drive operational maturity in a fast-scaling environment.
What & Why:
Our client is in an exciting period of hyper-growth as their team expands its AI infrastructure. This is a newly created position to support their AI infrastructure.
Who:
Our client is a leading AI infrastructure organization, and they are committed to an environmentally responsible approach to powering technology. They have developed state-of-the-art data centers powered entirely by renewable energy.
This hybrid role (3 days per week on-site) is based in downtown Vancouver. They have a start-up culture that values inclusiveness and a proactive approach to overcoming new challenges.
You:
You will bring the following education, skills and experience to the role:
- Must be able to work independently and navigate the challenges of a fast-paced start-up environment, focusing on finding solutions rather than just identifying problems.
- Senior-level experience SRE, DevOps, or similar roles.
- Experience owning the architecture, scalability, and operations of an observability platform in a large-scale environment.
- Expertise in Prometheus, Grafana, Kubernetes and Linux.
- Go or Python proficiency.
Compensation:
- $135,000 to $150,000 + competitive incentive program.
Next Steps:
If the sound of this opportunity excites you, and you’re confident that it’s a good fit for your experience and career goals, then we’d love to hear from you! Please send your updated resume to us by applying to this posting and one of our awesome team of recruiters will be in touch.