Staff DevOps Engineer @Cast & Crew
DevOps / Sysadmin
Salary usd 190,000 - 2..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2d ago

[Hiring] Staff DevOps Engineer @Cast & Crew

2d ago - Cast & Crew is hiring a remote Staff DevOps Engineer. πŸ’Έ Salary: usd 190,000 - 235,000 per year πŸ“Location: USA

Role Description

We are looking for a Staff DevOps Engineer to serve as a technical anchor for our platform engineering practice. In this role you will own the design and evolution of our CI/CD pipelines, Kubernetes infrastructure on AWS EKS, and the developer experience tooling that hundreds of engineers depend on daily. Staff-level engineers at this organization are expected to operate with significant autonomy, identify and resolve systemic problems before they become incidents, and raise the technical bar across the teams they partner with.

What You’ll Do

  • Platform & Infrastructure
    • Architect and continuously improve CI/CD pipelines in Azure DevOps, including pipeline-as-code standards, templating strategies, and artifact promotion workflows across environments.
    • Own the health and evolution of our AWS EKS clusters β€” node lifecycle, autoscaling, networking (VPC/CNI), RBAC, and cluster upgrades with minimal service disruption.
    • Design and enforce Infrastructure-as-Code practices using Terraform or equivalent tooling; champion GitOps patterns across engineering teams.
    • Drive platform reliability improvements informed by observability data from New Relic, working closely with SRE to translate dashboards and alerts into actionable platform changes.
  • Developer Experience
    • Define and maintain golden-path templates for containerized workloads β€” Dockerfile standards, Helm chart libraries, and local development parity with production.
    • Partner with engineering teams to accelerate onboarding of new services onto the platform and reduce toil through automation.
  • Incident & Operational Excellence
    • Act as an escalation point for complex infrastructure incidents coordinated through PagerDuty; participate in on-call rotation and lead post-incident reviews for platform-layer failures.
    • Identify recurring failure modes and drive systemic fixes that reduce page volume and MTTR across the platform.
    • Maintain and improve runbooks and platform documentation in Confluence, ensuring knowledge is accessible and current.
  • Technical Leadership
    • Define and socialize DevOps standards β€” pipeline design, container hygiene, secret management, and deployment safety β€” across a multi-team engineering organization.
    • Conduct architecture reviews and provide technical guidance on infrastructure-impacting decisions made by product engineering teams.
    • Mentor senior and mid-level engineers; grow internal platform capability through pairing, code review, and structured knowledge sharing.
    • Identify tooling gaps and build the business case for platform investments, working with engineering leadership to prioritize roadmap items.

Qualifications

  • 8+ years of DevOps or platform engineering experience, with at least 2 years operating at a Staff or Principal level in an organization of 100+ engineers.
  • Deep, hands-on expertise with Kubernetes β€” EKS specifically preferred β€” including troubleshooting workloads, networking, storage, and cluster operations at scale.
  • Strong command of Azure DevOps Pipelines, including YAML pipeline authoring, library management, service connections, and environment promotion gates.
  • Proven track record designing and maintaining CI/CD systems for microservice architectures with multiple independent teams as consumers.
  • Experience operating observability platforms (New Relic, Datadog, or similar) to drive proactive reliability improvements, not just reactive alerting.
  • Proficiency in at least one scripting language (Python, Bash, or Go) and Infrastructure-as-Code tooling (Terraform, Pulumi, or CDK).
  • Familiarity with feature flag patterns and operational considerations around progressive delivery (Unleash or equivalent is a plus).
  • Excellent written communication skills β€” you default to documentation and can translate complex infrastructure decisions into guidance engineers actually read.

Nice to Have

  • Experience with data engineering or ML infrastructure workloads on Kubernetes (Spark on EKS, Argo Workflows, Airflow).
  • Background contributing to or maintaining internal developer portals (Backstage or similar).
  • Familiarity with FinOps practices and tooling for AWS cost attribution and optimization across shared Kubernetes clusters.
  • Experience in SRE-adjacent roles; comfort with SLO/SLI definition and error budget policy.

Benefits

  • Comprehensive package of employee benefits including: Medical, Dental, Vision, PTO, health and wellness programs, employee discounts, and more!
Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Staff DevOps Engineer @Cast & Crew
DevOps / Sysadmin
Salary usd 190,000 - 2..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs