[Hiring] Staff Site Reliability Engineer @SimSpace Corporation
Staff Site Reliability Engineer @SimSpace Corporation
Software Development
Salary usd 165,000 - 2..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Employment Type full-time
Posted 3wks ago

[Hiring] Staff Site Reliability Engineer @SimSpace Corporation

3wks ago - SimSpace Corporation is hiring a remote Staff Site Reliability Engineer. πŸ’Έ Salary: usd 165,000 - 230,000 per year πŸ“Location: USA

Role Description

We are looking for a Staff Site Reliability Engineer to define the technical vision, lead the architecture, and secure the infrastructure that powers the SimSpace cyber range platform. The ideal candidate is a deeply experienced SRE and exceptional software engineer who thinks strategically about distributed systems, reliability, and operability at a global scale.

In this position, you'll provide overarching technical leadership across our SRE practice, bridging traditional site reliability, DevOps, and DevSecOps. You'll architect the systems and strategies that allow SimSpace to deliver software seamlessly across our own data centers, to customers who bring their own hardware, and as pre-packaged appliances with bundled hardware and software.

As our on-premises product matures and scales, you will design the long-term automation frameworks that make these varied deployments robust, secure, and repeatable.

What will you be doing as a Staff SRE at SimSpace?

  • Technical Strategy & Architecture: Design and architect the overarching infrastructure strategy that enables consistent, repeatable, and secure deployments across SimSpace-hosted data centers, customer-provided hardware, and highly restricted air-gapped environments.
  • Platform Evolution & Configuration Management: Lead the evolution of our CI/CD and Kubernetes platforms. Drive advanced application packaging, templating, and configuration management strategies using Jsonnet and Grafana Tanka (alongside Kustomize).
  • Reliability Leadership: Define, measure, and govern Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets across the engineering organization.
  • Advanced Observability: Architect our enterprise observability strategy using the Grafana stack. Design frameworks for proactive monitoring, complex anomaly detection, and distributed tracing.
  • Security & Compliance Architecture: Drive the infrastructure security posture at an architectural level. Embed advanced container security, zero-trust network segmentation, and automated compliance policies directly into our deployment pipelines and runtime environments.
  • Cross-Functional Enablement: Serve as a strategic partner and consultant to development teams. Advocate for an "SRE culture" by designing self-service tooling.
  • Incident Command: Act as an Incident Commander during complex, high-severity outages. Drive blameless post-mortems and engineer long-term fixes.
  • Mentorship & Multiplier: Act as a technical mentor to senior and mid-level engineers. Raise the baseline of engineering excellence across the company.

Qualifications

  • 8+ years of experience in Site Reliability, Platform, or DevOps engineering.
  • Deep software engineering skills (beyond scripting) and can architect complex, production-quality systems.
  • Deep, architectural understanding of Kubernetes in multi-tenant and multi-cluster production environments.
  • Extensive experience architecting sophisticated CI/CD pipelines and GitOps workflows.
  • Systems-level thinking with the ability to design architectures that span self-hosted, on-premises, VMware-based, and air-gapped deployment models.
  • Deep expertise with observability platforms (Grafana stack preferred).
  • Strong background in infrastructure security architecture.
  • Exceptional communication and stakeholder management skills.

Requirements

  • Language agnostic, but highly proficient in at least one modern language (e.g., Go, Python).
  • Expert-level knowledge of Jsonnet and Grafana Tanka for managing complex, scalable Kubernetes configurations.
  • Proven ability to design alerting and monitoring strategies for complex distributed systems.
  • Ability to influence cross-functional leadership and negotiate reliability tradeoffs.

Benefits

  • Base salary range: $165,000 - $230,000 with opportunities for bonuses.
  • Comprehensive medical, dental, and vision benefits, plus savings plansβ€”coverage starts on day one!
  • Access to company-paid counseling, coaching, and resources for mental health support.
  • 401(k)-retirement savings plan featuring a company match.
  • Unlimited vacation and dedicated health & wellness days.
  • Paid parental leave plans.
  • Equity stock options at hire, with annual performance-based grants.
  • Referral rewards for qualified hires through our employee referral program.
  • Full- and partial-subsidized membership plans and equipment discounts for fitness goals.
  • Access to a LinkedIn Learning membership for personal and professional development.
  • Monthly reimbursements for meaningful connections with teammates.
  • Legal plan coverage, pet insurance, wellness reimbursements, and more.
Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Staff Site Reliability Engineer @SimSpace Corporation
Software Development
Salary usd 165,000 - 2..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Employment Type full-time
Posted 3wks ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 155,000+ Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 155,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 155,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later