[Hiring] Staff Site Reliability Engineer I @Remote
Staff Site Reliability Engineer I @Remote
Software Development
Salary usd 188,550 - 2..
Remote Location
Employment Type full-time
Posted 2d ago

[Hiring] Staff Site Reliability Engineer I @Remote

2d ago - Remote is hiring a remote Staff Site Reliability Engineer I. πŸ’Έ Salary: usd 188,550 - 212,150 per year πŸ“Location: Worldwide

Role Description

As a Staff SRE at Remote, you will own the technical direction of our SRE platform, shaping its architecture, reliability strategy, and long-term evolution. This is a leadership role as much as a technical one:

  • Drive platform-wide initiatives.
  • Set the reliability bar for engineering teams across the organization.
  • Be a force multiplier for the engineers around you.

A key part of this role is identifying and leading opportunities to leverage AI:

  • Reduce operational toil.
  • Enable engineering teams to build, ship, and operate software more effectively.

You will work with a high degree of autonomy, translating technical risks into business impact and aligning with Engineering Managers, Team Leads, and Product teams to ensure reliability and engineering efficiency are built into everything we do.

Qualifications

  • 8+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering.
  • Deep expertise in Kubernetes: operating, designing, and scaling production clusters.
  • Proven experience designing and managing cloud infrastructure on AWS (or other cloud providers) at scale.
  • Strong infrastructure-as-code practice with Terraform.
  • Experience defining and operating reliability frameworks: SLOs, SLIs, error budgets, alerting strategies.
  • Solid observability background: Datadog, Grafana/Prometheus, or similar.
  • Proficiency with CI/CD platforms (GitLab CI, GitHub Actions, or similar) and deployment automation.
  • Comfortable with Bash and scripting for automation; broader programming skills are a plus.
  • Experience with container tooling (Docker) and the broader ecosystem around it.
  • Curiosity and practical experience applying AI tools to infrastructure, operations, or developer tooling.

Requirements

  • Proven track record of driving platform-wide technical initiatives and influencing engineering direction without formal authority.
  • Strong communicator: able to tailor messaging to technical and non-technical audiences, write clearly, and align stakeholders across teams.
  • Self-directed: able to identify what needs attention, define the path forward, and execute with minimal supervision.
  • Experience mentoring senior engineers and creating space for others to lead and grow.
  • Comfortable navigating ambiguity, translating vague requirements into concrete solutions.
  • Approaches technical problems with a business lens, understands the cost and value of engineering decisions.

Key Responsibilities

  • Own the technical direction of Remote's SRE/Platform domain, its architecture, tooling, and long-term roadmap.
  • Define and drive the reliability strategy across the platform: SLOs/SLIs, error budgets, observability, and incident management maturity.
  • Lead complex, cross-team infrastructure initiatives from discovery through delivery, delegating effectively and keeping projects aligned with business goals.
  • Identify and lead AI enablement initiatives across the engineering organization.
  • Drive AI-powered automation for platform operations: intelligent alerting, automated incident triage, self-healing infrastructure, and AI-assisted runbooks.
  • Contribute to capacity planning and cost-efficiency of Remote's infrastructure.
  • Mentor senior engineers, raising the technical bar through code reviews, design feedback, and hands-on guidance.
  • Collaborate with the Security team on platform hardening, threat mitigation, and compliance.
  • Be a steward of engineering quality across the SRE team, championing best practices, managing technical debt deliberately, and raising standards over time.
  • Contribute to hiring, onboarding, and continuously improving how the SRE team operates.

Benefits

  • Work from anywhere.
  • Flexible paid time off.
  • Flexible working hours (we are async).
  • 16 weeks paid parental leave.
  • Mental health support services.
  • Stock options.
  • Learning budget.
  • Home office budget & IT equipment.
  • Budget for local in-person social events or co-working spaces.
Before You Apply
️
worldwide Be aware of the location restriction for this remote position: Worldwide
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Staff Site Reliability Engineer I @Remote
Software Development
Salary usd 188,550 - 2..
Remote Location
Employment Type full-time
Posted 2d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 165,000+ Remote Jobs
️
worldwide Be aware of the location restriction for this remote position: Worldwide
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 165,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 165,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later