Get daily remote job opportunities in your inbox

No middlemen, no spam, no infinite scrolling.

Get relevant job opportunities, one email at a time.

Unsubscribe at any time.

Site Reliability Engineer @Cognits

[Hiring] Site Reliability Engineer @Cognits

Apr 13, 2025 - Cognits is hiring a remote Site Reliability Engineer. 💸 Salary: unspecified. 📍Location: Latin America (LATAM).

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We are looking for a passionate and detail-oriented Site Reliability Engineer (SRE) to help design, build, and maintain reliable infrastructure and cloud-based services. In this role, you will adopt and promote SRE best practices, improve observability, and work closely with engineering, security, and product teams to ensure scalable and resilient systems.

  • Apply SRE principles to the design, operation, and scaling of cloud services.
  • Take ownership of the reliability and performance of critical infrastructure and applications.
  • Participate in the on-call rotation, handling production incidents and driving root cause analysis.
  • Build and manage Infrastructure-as-Code (IaC) using Terraform, Pulumi, or similar tools.
  • Manage cloud environments (primarily AWS) and enterprise networking components like NGINX, load balancers, firewalls, VPCs, DNS, and security groups.
  • Work with Kubernetes, Helm, and Spinnaker to orchestrate and manage containerized workloads.
  • Develop tools and applications in Java, Python, or Go to improve system automation and observability.
  • Collaborate with cross-functional teams to ensure service-level objectives (SLOs) are met.
  • Continuously improve monitoring and alerting systems using Prometheus, Grafana, Splunk, or Datadog.
  • Communicate proactively with stakeholders and leadership through reports, updates, and postmortems.
  • Drive a culture of resilience, operational excellence, and continuous improvement.

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or an equivalent combination of education and hands-on experience.
  • 5+ years of hands-on experience in infrastructure or site reliability engineering.
  • Proven experience working with cloud-native environments and distributed systems.
  • B2+ English level, both written and spoken.

Requirements

  • Clear and concise communication with technical and non-technical audiences.
  • Strong analytical thinking and ability to manage complex systems.
  • Comfortable with ambiguity, able to define and lead initiatives proactively.
  • Thrives in fast-paced, high-stakes environments.
  • Strong sense of ownership and accountability.
  • Expertise in Amazon Web Services (AWS) or other cloud platforms.
  • Proficiency in Infrastructure-as-Code tools like Terraform and Pulumi.
  • Deep experience in enterprise networking: NGINX, load balancers, firewalls, VPCs, DNS, ACLs.
  • Experience with containerized applications and Docker.
  • Production-grade usage of Kubernetes, Helm, and Spinnaker.
  • Programming/scripting in Python, Java, or Go.
  • In-depth knowledge of build/release pipelines and automation practices.
  • Advanced monitoring and observability with Prometheus, Grafana, Datadog, Splunk, or similar.
  • Familiarity with CI/CD workflows, incident response, and recovery strategies.
  • Experience leading or contributing to on-call rotations and incident response protocols.
  • Cloud certifications (AWS Certified DevOps Engineer, GCP Professional SRE, etc.) are a plus.
  • Certifications in Kubernetes administration or Terraform.
  • Contributions to open source or internal DevOps tooling.
  • Experience implementing SLOs/SLIs and measuring error budgets.

Similar Remote Jobs

More jobs at Cognits

More Devops / Sysadmin jobs

More jobs in Latin America (LATAM)

Before You Apply
📍 Be aware of the location restriction for this remote position: Latin America (LATAM)
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Site Reliability Engineer @Cognits
Devops / Sysadmin
Salary 💸 unspecified
Remote Location
Latin America (LATAM)
Job Type contract
Posted Apr 13, 2025
Apply for this position Unlock 54,159 Remote Jobs
📍 Be aware of the location restriction for this remote position: Latin America (LATAM)
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Site Reliability Engineer Apply for this position Unlock 54,159 Remote Jobs
×
  • Unlock 54,159 hidden remote jobs.
  • Your shortcut to remote work. Apply before everyone else.
  • Click and apply. No middlemen, no hassle.

We’re not like the other sites. Come see why!

50% off in April 2025
  • Single payment
  • Lifetime access
  • Filter by location/skills/salary…
  • Create custom email alerts
  • Private Slack Community