[Hiring] Cloud Reliability & Recovery Engineer @AlphaSense India
Cloud Reliability & Recovery Engineer @AlphaSense India
All Others
Salary unspecified
Remote Location
Employment Type full-time
Posted 2d ago

[Hiring] Cloud Reliability & Recovery Engineer @AlphaSense India

2d ago - AlphaSense India is hiring a remote Cloud Reliability & Recovery Engineer. πŸ’Έ Salary: unspecified πŸ“Location: India

Role Description

We are seeking an experienced Cloud Engineer to design, implement, and continuously improve our Business Continuity Planning (BCP) and Disaster Recovery (DR) capabilities across AWS cloud environments. This is a hands-on technical role requiring deep AWS expertise, strong scripting skills, and a passion for building highly available, fault-tolerant, and resilient cloud architecture by leveraging container orchestration with Kubernetes and infrastructure as code using Terraform.

Good understanding of CI/CD pipelines to enable rapid, reliable deployments and minimize downtime. Adept at implementing DR strategies including multi-region failover, backup and restore automation, and recovery testing aligned with industry BCP/DR standards. You will collaborate closely with security, infrastructure, and application teams to ensure our systems can withstand and rapidly recover from any disruption.

Qualifications

  • 5+ years in cloud infrastructure, SRE, or IT disaster recovery engineering roles
  • 3+ years of hands-on AWS experience in production environments at scale
  • Proven delivery of multi-region DR architectures with defined and tested RTO/RPO targets
  • Expert-level proficiency with core AWS resilience services
  • Strong scripting skills: Python, Bash, or PowerShell for automation and orchestration
  • Experience with Infrastructure as Code: Terraform and/or AWS CloudFormation
  • Solid understanding of networking fundamentals: VPC, TGW, Direct Connect, VPN, DNS failover
  • Excellent written and verbal communication; able to produce executive-level DR reports

Requirements

  • Design and implement multi-region, multi-AZ AWS architectures that meet RTO/RPO targets
  • Engineer active-active and active-passive failover patterns using Route 53, Global Accelerator, and CloudFront
  • Build automated DR runbooks and playbooks using AWS Systems Manager Automation and Step Functions
  • Implement chaos engineering practices using AWS Fault Injection Simulator (FIS) to validate resiliency
  • Architect cross-region replication strategies for S3, DynamoDB Global Tables, RDS, and Aurora Global
  • Review containerized workloads using Kubernetes, ensuring resilience through self-healing, auto-scaling, and multi-cluster or multi-region deployments
  • Administer AWS Backup across all services (EC2, EBS, RDS, EFS, FSx, DynamoDB, Aurora) with policy-based automation
  • Design immutable backup vaults and cross-account/cross-region backup replication pipelines
  • Develop and automate data recovery testing procedures, ensuring integrity and meeting defined SLAs
  • Implement point-in-time recovery (PITR) for databases and storage; validate via regular restore drills
  • Maintain Business Continuity Plans (BCP) and Disaster Recovery (DR) strategies, including tracking RTO (Recovery Time Objective) and RPO (Recovery Point Objective)
  • Author and maintain Terraform/CloudFormation templates for all BCP/DR infrastructure components
  • Automate DR testing pipelines through CI/CD (CodePipeline, CodeBuild, GitHub Actions)
  • Write Python/Bash/PowerShell scripts to orchestrate failover, failback, and health-check workflows
  • Manage infrastructure state in AWS Control Tower and implement Landing Zone DR patterns
  • Build CloudWatch dashboards, alarms, and composite alarms for availability and DR-readiness indicators
  • Integrate AWS Health, Personal Health Dashboard events into PagerDuty/OpsGenie alerting workflows
  • Participate in on-call rotations and lead DR incident response; conduct post-incident reviews (PIRs)
  • Develop and maintain runbooks for AWS service degradations, regional outages, and data corruption events
  • Conduct regular BCP/DR tabletop exercises and full failover simulations to validate recovery procedures and improve organizational readiness, document results and action items
  • Ensure DR controls meet SOC 2, ISO 22301, NIST 800-53, and HIPAA/PCI requirements as applicable
  • Maintain current and accurate DR documentation: BIAs, BCPs, DRP runbooks, and recovery evidence
  • Collaborate with audit and compliance teams to provide DR evidence and remediation tracking

Company Description

AlphaSense is an equal-opportunity employer. We are committed to a work environment that supports, inspires, and respects all individuals. All employees share in the responsibility for fulfilling AlphaSense’s commitment to equal employment opportunity. AlphaSense does not discriminate against any employee or applicant on the basis of race, color, sex (including pregnancy), national origin, age, religion, marital status, sexual orientation, gender identity, gender expression, military or veteran status, disability, or any other non-merit factor.

This policy applies to every aspect of employment at AlphaSense, including recruitment, hiring, training, advancement, and termination.

In addition, it is the policy of AlphaSense to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations, and ordinances where a particular employee works.

Before You Apply
️
remote Be aware of the location restriction for this remote position: India
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Cloud Reliability & Recovery Engineer @AlphaSense India
All Others
Salary unspecified
Remote Location
Employment Type full-time
Posted 2d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 150,000+ Remote Jobs
️
remote Be aware of the location restriction for this remote position: India
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 150,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 150,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later