Role Description
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best.
You will enjoy the flexibility to telecommute* from anywhere within the U.S. as you take on some tough challenges.
Primary Responsibilities:
-
Design, implement, and operate CI/CD pipelines supporting application, data, and platform deployments across Azure and Databricks environments.
-
Own production reliability, availability, performance, and scalability for cloud platforms, including Databricks workspaces, jobs, clusters, and workflows.
-
Build and maintain Infrastructure as Code (IaC) and configuration management to provision and manage Databricks, cloud infrastructure, and networking in a repeatable and secure manner.
-
Automate Databricks environment management, including workspace configuration, cluster policies, job orchestration, and access controls.
-
Implement and enhance monitoring, alerting, and observability across cloud and Databricks platforms using Splunk, Azure Monitor, and telemetry frameworks.
-
Partner with data engineering, platform, security, and product teams to enable reliable, compliant, and scalable data pipelines and analytics platforms.
-
Drive cloud and platform modernization initiatives, including containerization, platform standardization, and Databricks best practices.
-
Embed AI‑assisted DevOps practices, leveraging GenAI tools to accelerate troubleshooting, automate operational tasks, improve deployment reliability, and optimize system performance.
-
Enable AIOps capabilities such as intelligent alerting, anomaly detection, log analysis, and predictive insights for proactive operations across Databricks workloads.
-
Support production issue triage and resolution, including on‑call support, incident management, and post‑incident root cause analysis.
-
Ensure security‑by‑design and compliance across pipelines and platforms, including secrets management, RBAC, audit readiness, and governance.
-
Continuously reduce operational toil and improve delivery velocity through automation, AI‑driven insights, and self‑service tooling.
-
Design, develop, and deploy AI-powered solutions to address complex business challenges with emphasis on responsible use of AI.
Qualifications
-
Bachelor's degree in Computer Science, IT or Engineering related field.
-
5+ years of experience in DevOps, Platform Engineering.
-
5+ years of experience with CI/CD tools such as Git, Jenkins, or equivalent enterprise platforms.
-
3+ years of experience with .NET, Angular and Typescript.
-
3+ years of proven experience operating and supporting any one Azure public cloud infrastructure.
-
3+ years of experience in scripting or programming languages (Python, Shell, or similar) with a strong automation mindset.
-
3+ years of experience in Infrastructure as Code (IaC): Terraform‑based, repeatable environment provisioning.
-
1+ years of hands‑on experience with Databricks platform operations, including clusters, jobs, workflows, and environment configuration.
Requirements
-
Experience with Infrastructure as Code, automation, and configuration management practices.
-
Experience supporting mission‑critical data and analytics platforms, including incident management and RCA.
-
Experience with containerization and orchestration technologies (Docker, Kubernetes, AKS).
-
Experience operating in security‑ and compliance‑driven environments, including RBAC, audit controls, and governance.
-
Prior experience mentoring engineers or acting as a technical lead within DevOps or platform initiatives.
-
Solid understanding of monitoring, logging, and observability in large‑scale production environments.
-
Strong communication skills and ability to collaborate with cross‑functional engineering teams.
-
Exposure to AI‑enabled DevOps or AIOps, including automated remediation, intelligent monitoring, or predictive analytics.
-
Familiarity with GenAI tools for AI‑assisted troubleshooting, pipeline optimization, or operational insights.
Benefits
-
Comprehensive benefits package.
-
Incentive and recognition programs.
-
Equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements).
Application Deadline
This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants.