[Hiring] Site Reliability Engineer @Referral Board
Site Reliability Engineer @Referral Board
All Others
Salary usd 143,100 - 1..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Employment Type full-time
Posted 1wk ago

[Hiring] Site Reliability Engineer @Referral Board

1wk ago - Referral Board is hiring a remote Site Reliability Engineer. πŸ’Έ Salary: usd 143,100 - 175,000 per year πŸ“Location: USA

Role Description

We are Cloud Infrastructure SREs that integrate, scale, and evolve multi-cloud infrastructure across 4 Cloud Service Providers, 70+ globally distributed regions, and tens of thousands of hosts to power Elastic Cloud. We tackle hard problems at scale through automation, Infrastructure as Code (IaC), configuration management, and purpose-built software that eliminates toil and improves reliability.

If that scale of challenge genuinely excites you, we'd love to hear from you.

What you will be doing

  • Engineering software to automate large-scale systems β€” building internal tools and services, not just running scripts.
  • Optimizing the reliability and lifecycle of hosts across multiple cloud providers.
  • Strengthening our observability posture β€” crafting alerting and monitoring systems that drive incident prevention over incident response.
  • Scaling global infrastructure and evolving the infrastructure management processes to meet growing demand.
  • Contributing to code reviews, sharing your work, planning what we need to do next, and mentoring teammates.
  • Being part of a balanced SRE on-call rotation: responding to incidents, improving runbooks, leading postmortems, and championing reliability improvements.

Qualifications

  • Experience building software with Golang. You are also comfortable reviewing others' code and have opinions about what good code looks like.
  • Production experience operating large-scale cloud compute (hundreds of hosts or more) via automated workflows.
  • Deep experience with Linux systems β€” you are at home in the terminal debugging at the OS level.
  • Proficiency working with containerized workloads in production.
  • A customer-first, systems-thinking approach to operational problems β€” you care about root causes, not just symptoms.
  • Comfortable working across time zones in both real-time and asynchronous contexts.
  • You write clear and maintainable documentation such as software designs, runbooks, architecture diagrams/decisions, postmortems, etc.
  • You communicate project status regularly and clearly, flag blockers early, and follow through on action items.
  • A sensible approach to AI integration β€” identifying where AI tools genuinely reduce operational burden and embedding them into workflows without adding complexity.

Bonus Points

  • Production experience with any of: Terraform, Puppet, Ansible, Argo CD, Argo Workflows, CUE, Docker, Kubernetes, Ubuntu, or Ubuntu Live Patch.
  • Experience being on-call during incidents and using observability tools (e.g. Elastic Stack, Graphite, Prometheus, Influx) to diagnose issues, quantify impact, and confirm mitigations.
  • Hands-on experience engineering solutions with the Elastic Stack.

Compensation

Compensation for this role is in the form of base salary. This role does not have a variable compensation component.

The typical starting salary range for new hires in this role is:

  • $143,100 β€” $175,000 USD

The typical starting salary range for this role in select locations (including Seattle WA, Los Angeles CA, the San Francisco Bay Area CA, and the New York City Metro Area) is:

  • $143,100 β€” $175,000 USD

Benefits

  • Competitive pay based on the work you do here and not your previous salary.
  • Health coverage for you and your family in many locations.
  • Ability to craft your calendar with flexible locations and schedules for many roles.
  • Generous number of vacation days each year.
  • Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service.
  • Up to 40 hours each year to use toward volunteer projects you love.
  • Embracing parenthood with a minimum of 16 weeks of parental leave.

Additional Information

As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life.

We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.

Different people approach problems differently. We need that. Elastic is an equal opportunity/affirmative action employer committed to diversity, equity, and inclusion.

We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all individuals.

Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Site Reliability Engineer @Referral Board
All Others
Salary usd 143,100 - 1..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Employment Type full-time
Posted 1wk ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 155,000+ Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 155,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 155,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later