Role Description
We are looking for an experienced Senior DevOps Engineer. The ideal candidate will have a strong grasp of DevOps principles, technical expertise, and a focus on improving system performance and reliability.
-
Availability & Performance:
Administer servers and virtual machines running Debian, configure load balancers, implement DDoS protection, and work with virtualization platforms (Proxmox VE).
-
Networking & Infrastructure:
Configure networks, manage DNS, optimize routing and load balancing, and ensure high availability.
-
Automation & IaC:
Develop and maintain infrastructure using Ansible, AWX, and Terraform.
-
Monitoring & Incident Management:
Set up monitoring and alerting systems (Zabbix, Grafana, Victoria Metrics, Opsgenie), analyze logs, and troubleshoot system issues.
-
Containerization & Orchestration:
Work with Docker and Kubernetes for application management, develop Helm charts, manage clusters, and integrate with CI/CD pipelines.
-
Data Systems:
Configure and optimize HA solutions for Redis, Kafka, RabbitMQ, PostgreSQL, MongoDB, and ClickHouse.
-
Process Automation:
Develop automation scripts using Python or Golang.
-
Communication & DevOps Culture:
Collaborate with development and QA teams, promote DevOps best practices.
-
Continuous Improvement:
Implement and maintain CI/CD pipelines, identify opportunities for process optimization and automation.
Qualifications
-
5+ years of experience in a similar role.
-
Strong knowledge of server hardware and experience with bare-metal hosting.
-
Excellent troubleshooting skills for production servers running Debian/Ubuntu.
-
Hands-on experience with virtualization: QEMU KVM, libvirtd, Proxmox VE.
-
Solid understanding of networking: TCP/IP, DNS, L2/L3 networks, routing, IPTables, VPN.
-
Experience configuring and managing Nginx.
-
Proficiency with Docker and Kubernetes, including Helm chart development.
-
Strong experience with IaC tools: Ansible, Terraform, etc.
-
CI/CD experience with GitLab CI, ArgoCD, Flux.
-
Experience working with PostgreSQL, Elasticsearch, Redis, MongoDB, ClickHouse.
-
Practical experience with Kafka and RabbitMQ.
-
Experience with monitoring tools: Zabbix, Prometheus, Victoria Metrics, Grafana, ELK Stack.
Requirements
-
Experience with AWS or GCP (Nice to Have).
-
Programming skills in Python or Golang (Nice to Have).
Benefits
-
20 days of paid vacation, 5 family days and sick leave compensation.
-
Flexible workday start, allowing you to manage your schedule comfortably.
-
Support from a professional corporate coach and psychologist.
-
Regular internal and external activities, workshops, team trips, and corporate events.
-
Access to our internal knowledge base, meetups, and team-building initiatives.
-
Continuous learning opportunities: training in new technologies and ongoing support for your professional development.