Staff Site Reliability Engineer @Fabric
DevOps / Sysadmin
Salary usd 140,000 - 1..
Remote Location
🇺🇸 USA Only
Job Type full-time
Posted 2wks ago

[Hiring] Staff Site Reliability Engineer @Fabric

2wks ago - Fabric is hiring a remote Staff Site Reliability Engineer. 💸 Salary: usd 140,000 - 170,000 per year 📍Location: USA

Role Description

As a Staff Site Reliability Engineer, you will own and evolve the infrastructure powering healthcare experiences for millions of patients. This role bridges the gap between traditional infrastructure excellence and the future of AI-driven operations. You will act as a primary architect for our AWS and Kubernetes (EKS) environment, ensuring the platform is resilient, scalable, and compliant while exploring how agentic workflows can modernize SRE practices.

What You'll Do

  • Infrastructure & Kubernetes Orchestration
    • Designing, deploying, and maintaining production Kubernetes (EKS) clusters to ensure enterprise-grade availability for our users.
    • Eliminating manual configuration by building and managing a scalable infrastructure state entirely through Terraform.
    • Optimizing the AWS footprint—specifically EC2, RDS, and S3—to balance high performance with cost-efficiency and reliability.
  • AI-Assisted Operations & Automation
    • Exploring and deploying agentic workflows for AI-assisted runbooks that automate complex operational decisions and repetitive tasks.
    • Building and evolving deployment pipelines using GitHub Actions or Semaphore to ensure delivery is both rapid and safe.
    • Focusing on toil reduction by developing internal tools that replace manual operational work with intelligent, autonomous systems.
  • Observability & Incident Management
    • Driving the evolution of the observability stack in Datadog by implementing the sophisticated metrics, traces, and logs needed to meet SLOs.
    • Leading incident response efforts and facilitating the blameless postmortems that help systematically reduce recovery time (MTTR).
    • Defining and monitoring the SLIs and SLOs that ensure the platform consistently meets rigorous healthcare performance standards.
  • Compliance & Collaboration
    • Ensuring every piece of infrastructure remains fully compliant with HIPAA and other critical healthcare regulatory requirements.
    • Mentoring engineers across the company on reliability best practices and contributing a clinical-safety perspective to cross-functional design reviews.

Qualifications

  • 8+ years of experience in SRE, DevOps, or Platform roles managing production environments at scale.
  • Expert technical depth in AWS (EKS, EC2, RDS, S3) and production-grade Kubernetes management.
  • Proficiency with modern tooling including Terraform (IaC), Datadog (Observability), and CI/CD systems.
  • Deeply proficient coding and scripting skills in Python, Bash, Ruby, or Go.
  • Preferred experience building agentic workflows or AI-assisted tooling to drive operational efficiency.
  • A "rigor-first" mindset with a dedication to HIPAA-compliant, high-availability architecture.

Benefits

  • The national pay range for this role is $140,000.00 – $170,000.00 per year.
  • Actual compensation will be determined by factors such as the candidate's geographic market, experience, skills, and qualifications.
  • Certain roles may also be eligible for additional compensation, including a comprehensive benefits package such as medical, dental, vision, unlimited PTO, and a 401(k) plan, stock options, and bonuses.
  • If your compensation requirement is greater than our posted range, please still consider applying; a determination can be made based on unique qualifications.
  • Expected compensation ranges for this role may change over time.
Before You Apply
🇺🇸 Be aware of the location restriction for this remote position: USA Only
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Staff Site Reliability Engineer @Fabric
DevOps / Sysadmin
Salary usd 140,000 - 1..
Remote Location
🇺🇸 USA Only
Job Type full-time
Posted 2wks ago
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 152,720 Remote Jobs
🇺🇸 Be aware of the location restriction for this remote position: USA Only
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 152,720 Remote Jobs
×

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 ★★★★★ from 500+ reviews
Unlock All Jobs Now

Maybe later