[Hiring] DevOps / SRE Engineer - AI Platform @Makro PRO
DevOps / SRE Engineer - AI Platform @Makro PRO
Devops
Salary unspecified
Remote Location
Employment Type full-time
Posted 4d ago

[Hiring] DevOps / SRE Engineer - AI Platform @Makro PRO

4d ago - Makro PRO is hiring a remote DevOps / SRE Engineer - AI Platform. πŸ’Έ Salary: unspecified πŸ“Location: Worldwide

Role Description

The DevOps / SRE Engineer owns the operational substrate of an AI-native retail decisioning platform β€” infrastructure, CI / CD, observability, cost meter, and incident response for a system that runs production agents taking real business actions. The role builds on the enterprise Terraform standard, CI / CD spine, and FinOps tagging policy rather than reinventing parallel infrastructure. Remote candidates outside of Thailand are welcome to apply.

  • Adopt the enterprise Terraform standard and module library for all platform infrastructure; author platform-specific modules where needed (agent runtime, vector DB, knowledge graph); run drift detection weekly.
  • Build platform-specific CI / CD pipelines on the enterprise spine β€” service deploys, agent deploys, eval-gate enforcement; integrate eval gates so no agent reaches production without eval pass.
  • Operate rollback orchestration with sub-15-minute recovery; quarterly game days.
  • Own the platform observability stack β€” OpenTelemetry, Langfuse for LLM traces, custom dashboards for per-agent cost.
  • Implement the per-agent cost meter end-to-end β€” token counts, vector queries, model inference, downstream LLM Gateway costs; surface cost data to the enterprise GenAI cost dashboard.
  • Stand up the platform on-call rotation; author runbooks for every production agent and service; lead incident response with measurable corrective actions.
  • Implement platform cost-tagging policy consistent with the enterprise standard (team, domain, environment, project, agent, suite, persona); report monthly to Cost Review.
  • Drive cost optimisation β€” right-sizing, caching, model routing decisions, reserved compute.

Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related discipline.
  • 5+ years SRE / DevOps with production ownership.
  • Terraform at scale β€” modules, state, drift, environment promotion.
  • CI / CD for data + ML / AI services (GitLab CI / CD or comparable).
  • Cloud platform (Azure preferred; AWS / GCP transferable).
  • Observability β€” OpenTelemetry, Langfuse (or comparable LLM traces), custom dashboards.
  • FinOps β€” tagging policies, attribution, optimisation.
  • Incident response β€” on-call, post-mortems, runbook authorship.

Preferred Qualifications

  • AI / agent platform SRE experience; cost-meter / chargeback systems built or operated.
  • Multi-cloud production experience; open-source contributions to IaC / observability tooling.
  • AI / ML / agent system observability instrumentation (LLM cost, agent cost, eval scores).
  • Vendor certifications such as HashiCorp Terraform Associate / Professional, Azure Solutions Architect Associate, or Databricks Data Engineer Professional.
Before You Apply
️
worldwide Be aware of the location restriction for this remote position: Worldwide
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
DevOps / SRE Engineer - AI Platform @Makro PRO
Devops
Salary unspecified
Remote Location
Employment Type full-time
Posted 4d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 160,000+ Remote Jobs
️
worldwide Be aware of the location restriction for this remote position: Worldwide
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 160,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 160,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later