Role Description
We're looking for a Platform Engineer who thrives at the intersection of reliability, security, and developer productivity. You'll be a core contributor to our infrastructure β owning the systems that keep August Health fast, secure, and resilient as we scale. This is a high-autonomy, high-impact role. You'll work closely with our engineering team to shape how we build, deploy, and operate software β with real influence over architecture decisions and engineering culture.
-
Infrastructure as code
β managing and evolving our AWS infrastructure using Pulumi, with a focus on reliability, cost efficiency, and maintainability
-
Kubernetes platform
β operating and improving our K8s clusters: workload scheduling, resource management, networking, and observability
-
CI/CD pipelines
β owning and optimizing our GitHub Actions workflows to keep builds fast, feedback tight, and deployments safe
-
Security & compliance
β hardening our infrastructure posture, supporting audit readiness, and implementing controls that meet the requirements of operating in healthcare
-
Data pipeline infrastructure
β supporting the reliable operation of our data engineering workflows
-
LLM tooling
β deploying and maintaining prompt tracing, evaluation, and observability tools as we integrate AI capabilities into our product
-
Network & access
β managing secure, zero-trust connectivity via Tailscale across our distributed infrastructure
-
Disaster recovery & incident response
β designing, documenting, and regularly testing DR/IR processes so we're always ready
Qualifications
-
Strong hands-on experience with
AWS
β particularly EKS, Cognito, Aurora, RDS, Lambda, and VPC; you can make smart tradeoff decisions across services and know when to reach for each
-
Proficiency with
Kubernetes
in production β you've operated clusters at scale and know how to debug when things go wrong
-
Experience with
infrastructure as code
, ideally Pulumi or a similar tool (Terraform, CDK)
-
Comfort with
GitHub Actions
or similar CI/CD systems β you've built and optimized pipelines, not just used them
-
A security-minded approach β you think about least privilege, secrets management, and compliance by default; experience working toward or maintaining SOC 2 and/or HIPAA compliance is important, not just a nice-to-have
-
Solid
observability
experience β you're comfortable with Prometheus, have instrumented backend services before, and can look at an existing metrics setup and form a point of view on what's missing or misleading
-
Familiarity with
data pipeline infrastructure
, including tools like Snowflake and Apache NiFi
-
Strong communication skills β you can explain infrastructure decisions to non-infrastructure engineers, and you write good documentation
-
Self-direction β you can identify what needs doing, prioritize well, and drive projects to completion without heavy oversight
Requirements
-
Experience with
Tailscale
or other zero-trust networking tools
-
Ability to read and write backend code in
Scala or Java
β the ideal candidate is comfortable enough to review or contribute to application code, not just the infrastructure around it
-
CKA (Certified Kubernetes Administrator)
certification from the Linux Foundation
-
Prior work in a
healthcare or other regulated industry
, with hands-on experience maintaining SOC 2 or HIPAA compliance
-
Experience deploying
LLM observability or evaluation tooling
(e.g., Langfuse, Phoenix, Helicone, or similar)
Benefits
-
Market-competitive compensation based on experience and ability, including significant equity option grants
-
100% company-paid premiums for health, dental, and vision coverage
-
Company contributions to your HSA
-
2% 401(k) match
-
Support for physical and mental health through services like Rightway Health Advocacy and Spring Health Mental Wellness
-
Flexible time off policy
-
100% paid family leave
-
All-expenses-paid, in-person company offsites twice a year