Senior Site Reliability Engineer @Junipersquare

[Hiring] Senior Site Reliability Engineer @Junipersquare

Apr 04, 2025 - Junipersquare is hiring a remote Senior Site Reliability Engineer. đź’¸ Salary: $140,000 - $185,000 usd. đź“ŤLocation: USA, UK, Canada, India, Luxembourg.

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We are looking for a Senior Site Reliability Engineer (SRE) to join our team and help scale, secure, and improve our cloud infrastructure. In this role, you will work with modern cloud-native technologies, automate infrastructure management, and enhance system reliability. You will collaborate closely with software engineers and the platform team to build and maintain self-service tools that empower development teams while ensuring the reliability and scalability of our services.

This role requires a high degree of ownership, a bias for action, and a problem-solving mindset. If you are someone who naturally seeks out inefficiencies, takes the initiative to fix them, and enjoys building scalable systems, we want to hear from you.

  • Own reliability and scalability initiatives—identify, prioritize, and implement solutions before issues escalate.
  • Participate in an on-call rotation, responding to incidents, performing root cause analysis, and driving long-term fixes.
  • Design, deploy, and manage Kubernetes clusters using Helm charts, Cilium, and Karpenter to optimize performance and cost.
  • Architect and maintain AWS infrastructure with a focus on RDS/Aurora PostgreSQL, networking, and scaling best practices.
  • Implement GitHub Actions CI/CD pipelines, integrating security best practices and automation.
  • Define and enforce policy-based security for Kubernetes using Kyverno.
  • Automate infrastructure provisioning with Crossplane and Terraform to ensure consistency and scalability.
  • Enhance observability and monitoring using Datadog to proactively detect and resolve issues.
  • Improve security and reliability by identifying risks in CI/CD, cloud environments, and Kubernetes, then implementing necessary safeguards.
  • Lead post-incident reviews, drive lessons learned into long-term improvements, and document best practices in Confluence.

Qualifications

  • 5+ years of experience in SRE, DevOps, or Infrastructure Engineering with a proven track record of ownership and initiative.
  • Strong experience with Kubernetes, Helm, and CNIs, including networking and security.
  • Proficiency in AWS services such as RDS, Aurora, IAM, VPC, EKS, and EC2.
  • Experience in PostgreSQL administration, including performance tuning and high availability in RDS/Aurora.
  • Hands-on experience with GitHub Actions and ArgoCD for secure and scalable CI/CD automation.
  • Strong background in Infrastructure as Code (IaC) with Crossplane and Terraform.
  • Deep understanding of observability and monitoring with Datadog.
  • Experience with Kyverno for Kubernetes policy-based security enforcement.
  • Proficiency in Python and Bash scripting for automation and system management.
  • Strong understanding of CI/CD security best practices and ability to implement controls for securing deployments.

Requirements

  • Self-starter mentality—actively seeks out and fixes problems without waiting for assignments.
  • High ownership and accountability—takes initiative in driving improvements and following through to resolution.
  • Strong problem-solving mindset—identifies bottlenecks, inefficiencies, and risks, then delivers scalable solutions.
  • Excellent communication skills—documents processes in Confluence, collaborates cross-functionally, and influences engineering teams toward operational excellence.

Preferred Qualifications

  • Deep experience with GitHub Actions for CI/CD automation, with a focus on security best practices.
  • Extensive knowledge of Helm charts for managing Kubernetes applications.
  • Strong experience in PostgreSQL, including optimization and high availability in RDS/Aurora.
  • Experience with NoSQL databases and best practices for scaling and performance.
  • Proven ability to influence engineering culture toward automation, self-service, and operational excellence.
  • Experience with Karpenter for Kubernetes autoscaling.
  • Previous experience with cost optimization strategies in AWS environments.
  • Experience with Atlassian tools (Jira, Confluence) for tracking incidents and documentation.
  • Strong experience with and a passion for expanding AI into the SRE and DevOps world.

Compensation

Compensation for this position includes a base salary, equity, and a variety of benefits. The U.S. base salary range for this role is $140,000 - $185,000 USD. Actual base salaries will be based on candidate-specific factors, including experience, skillset, and location, and local minimum pay requirements as applicable.

  • Health, dental, and vision care for you and your family
  • Life insurance
  • Mental wellness coverage
  • Fertility and growing family support
  • Flex Time Off in addition to company paid holidays
  • Paid family leave, medical leave, and bereavement leave policies
  • Retirement saving plans
  • Allowance to customize your work and technology setup at home
  • Annual professional development stipend

Your recruiter can provide additional details about compensation and benefits.

Similar Remote Jobs

More jobs at Junipersquare

More Devops / Sysadmin jobs

More jobs in USA

Before You Apply
️
đź“Ť Be aware of the location restriction for this remote position: USA, UK, Canada, India, Luxembourg
‼ Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Senior Site Reliability Engineer @Junipersquare
Devops / Sysadmin
Salary đź’¸ $140,000 - $185,000 usd
Remote Location
USA, UK, Canada, India, Luxembourg
Job Type full-time
Posted Apr 04, 2025
Apply for this position Unlock 54,710 Remote Jobs
️
đź“Ť Be aware of the location restriction for this remote position: USA, UK, Canada, India, Luxembourg
‼ Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Senior Site Reliability Engineer Apply for this position Unlock 54,710 Remote Jobs
Ă—
  • Unlock 54,710 hidden remote jobs.
  • Your shortcut to remote work. Apply before everyone else.
  • Click and apply. No middlemen, no hassle.

We’re not like the other sites. Come see why!

50% off in April 2025
  • Single payment
  • Lifetime access
  • Filter by location/skills/salary…
  • Create custom email alerts
  • Private Slack Community