Site Reliability Engineer II @Backblaze External Website
DevOps / Sysadmin
Salary unspecified
Remote Location
Job Type full-time
Posted 5d ago

[Hiring] Site Reliability Engineer II @Backblaze External Website

5d ago - Backblaze External Website is hiring a remote Site Reliability Engineer II. 💸 Salary: unspecified 📍Location: India

Role Description

We are seeking a Site Reliability Engineer II (SRE II) to help ensure the stability, scalability, and reliability of our services and infrastructure. This role focuses on building automation, maintaining observability, and supporting incident response to keep customer-facing systems performing at their best. The SRE will collaborate with engineering, product, and operations teams to embed reliability practices into day-to-day development and operations while contributing to tools and processes that improve efficiency and reduce manual effort.

Key Responsibilities

  • Service Reliability & Operations
    • Support the availability and durability of critical services across production environments.
    • Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk.
    • Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements.
    • Follow established ITIL/OSS processes (incident, change, problem, and capacity management).
  • Automation & Tooling
    • Develop automation for common operational tasks, reducing manual intervention and toil.
    • Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, Catchpoint, ELK).
    • Work with CI/CD pipelines, configuration management, and infrastructure as code tools (Terraform, Ansible, Jenkins).
    • Write scripts (Bash, Python, Go, etc.) to improve system reliability and efficiency.
  • Collaboration
    • Partner with engineering, product, and operations teams to support resilient system design and operations.
    • Assist in capacity planning and disaster recovery exercises.
    • Work with vendors and service providers to troubleshoot service issues and track SLA performance.
    • Document systems, share learnings, and help grow a reliability-minded engineering culture.
  • Continuous Improvement
    • Contribute to playbooks, runbooks, and operational documentation.
    • Identify recurring issues and propose long-term improvements.
    • Promote reliability-focused practices within development and operations teams.

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 2–4 years of experience in site reliability, systems engineering, or operations.
  • Exposure to large-scale, production-grade systems.
  • Solid Linux systems administration and troubleshooting skills.
  • Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis.
  • Proficiency in at least one scripting language (Python, Bash, or Go).
  • Understanding of containers (Kubernetes, Docker) and microservices concepts.
  • Knowledge of incident response and operational best practices.

Preferred Attributes

  • Experience in a SaaS, service provider, or distributed systems environment.
  • Familiarity with ITIL/OSS practices and SLO/SLA’s.
  • Strong problem-solving skills and willingness to learn new technologies.
  • Experience with cloud platforms (AWS, GCP, or Azure).
  • Ability to work independently, take ownership, and drive projects from problem discovery through resolution.

Company Description

At Backblaze, we value being fair and good to our customers, partners, and employees. That’s why diversity, equity, and inclusion are at the core of our values. We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries, but further reinforces our commitment to doing the right thing.

We are proud to be an Equal Opportunity Employer.

Before You Apply
remote Be aware of the location restriction for this remote position: India
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Site Reliability Engineer II @Backblaze External Website
DevOps / Sysadmin
Salary unspecified
Remote Location
Job Type full-time
Posted 5d ago
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 152,720 Remote Jobs
remote Be aware of the location restriction for this remote position: India
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 152,720 Remote Jobs
×

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 ★★★★★ from 500+ reviews
Unlock All Jobs Now

Maybe later