[Hiring] Site Reliability Specialist (Observability & Kubernetes) @Everbridge
Site Reliability Specialist (Observability & Kubernetes) @Everbridge
Software Development
Salary $118,700 - $145..
Remote Location
🇺🇸 USA Only
Employment Type full-time
Posted 2d ago

[Hiring] Site Reliability Specialist (Observability & Kubernetes) @Everbridge

2d ago - Everbridge is hiring a remote Site Reliability Specialist (Observability & Kubernetes). 💸 Salary: $118,700 - $145,000 📍Location: USA

Role Description

At Everbridge, we’re building a resilient, scalable, and secure cloud platform that powers critical services used around the world. We’re looking for a Senior Platform Site Reliability Specialist to own, operate, and evolve our enterprise observability platform.

In this role, you will be responsible for the up-keep, reliability, scalability, and strategic growth of Everbridge’s observability stack, EKS, and supporting services, ensuring our engineering teams have deep visibility into system health, performance, and reliability across a large-scale, cloud-native environment. You will also be working with other cloud technologies within the AWS and GCP areas.

Who we are looking for:

  • Someone who shows up for the team, not just themselves.
  • Communicates clearly and collaborates easily.
  • Treats interactions with other teams with respect and professionalism.
  • Comfortable being involved, offering support, and helping move work forward without ego.
  • Values building trust, keeping things running smoothly, and making the teams around them better.

What you'll do:

  • Observability Platform Ownership
    • Head the design, operation, and evolution of Everbridge’s observability stack.
    • Build and maintain a highly available, scalable observability platform.
    • Standardize instrumentation, dashboards, alerts, and SLOs.
    • Support incident response, root cause analysis, and capacity planning.
  • Grafana Stack & Telemetry
    • Operate and scale Grafana and technology:
    • Grafana Loki (logs)
    • Grafana Mimir (metrics)
    • Grafana Tempo (tracing)
    • Grafana Alerting
  • Kubernetes
    • Maintain reliability and security of EKS clusters running observability.
    • Manage cluster lifecycle and upgrades.
  • Infrastructure as Code & Automation
    • Terraform for infrastructure provisioning.
    • HashiCorp Packer.
    • Gitlab CI/CD at Scale.

Qualifications

  • 6+ years in SRE / Platform Engineering.
  • Strong Grafana ecosystem experience.
  • Kubernetes and Amazon EKS expertise.
  • Terraform proficiency.

Preferred Qualifications

  • OpenTelemetry experience.
  • Large-scale observability systems.
  • Cost optimization experience.

Benefits

  • Healthcare.
  • Dental.
  • Parental planning.
  • Mental health benefits.
  • Disability income benefits.
  • Life and AD&D insurance.
  • 401(k) plan and match.
  • Paid time off.
  • Fitness reimbursements.

Salary

The reasonably estimated salary for this role at Everbridge ranges from $118,700 - $145,000 and may also include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience.

Fair Chance Statement US & Canada

We are committed to providing equal employment opportunities in compliance with all applicable Federal, Provincial/State and Local laws, including the California Fair Chance Act and any local County Fair Chance Ordinance (or local equivalent). Pursuant to these and other relevant regulations, we consider qualified applicants with criminal histories in a manner consistent with the law.

  • Access to sensitive or confidential information, such as financial records, proprietary data, or client information.
  • Management of cash, company funds, or other valuable assets.
  • Work in environments requiring heightened security measures.
  • Compliance with contractual or regulatory requirements specific to the position.

We evaluate each applicant's criminal history individually, considering its nature, timing, and relevance to the specific job duties, while maintaining our commitment to fair hiring practices and promoting workplace equity.

Before You Apply
🇺🇸 Be aware of the location restriction for this remote position: USA Only
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Site Reliability Specialist (Observability & Kubernetes) @Everbridge
Software Development
Salary $118,700 - $145..
Remote Location
🇺🇸 USA Only
Employment Type full-time
Posted 2d ago
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 150,000+ Remote Jobs
🇺🇸 Be aware of the location restriction for this remote position: USA Only
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 150,000+ Remote Jobs
×

Apply to the best remote jobs
before everyone else

Access 150,000+ vetted remote jobs and get daily alerts.

4.9 ★★★★★ from 500+ reviews
Unlock All Jobs Now

Maybe later