Senior DevOps / SRE Engineer @MLabs
DevOps / Sysadmin
Salary usd 120,000 - 1..
Remote Location
Job Type full-time
Posted Today

[Hiring] Senior DevOps / SRE Engineer @MLabs

Today - MLabs is hiring a remote Senior DevOps / SRE Engineer. 💸 Salary: usd 120,000 - 150,000 per year 📍Location: USA timezones, GMT (UTC+0)

Role Description

A confidential client operating at the intersection of decentralized finance and artificial intelligence is seeking a Senior DevOps / SRE Engineer. This role is critical to the organization’s mission: managing high-stakes environments where infrastructure reliability directly impacts capital protection.

The successful candidate will own the architecture that keeps dozens of concurrent AI agents alive, fast, and secure. This is a high-impact position designed for an engineer who thrives on building resilient, zero-downtime systems for autonomous agents managing real-time financial workloads.

Key Responsibilities

  • Agent Infrastructure Management: Build and maintain the infrastructure for concurrent AI trading agents, managing complex cron schedules, state files, and trailing stop processes.
  • Deployment & Orchestration: Deploy and manage agent environments, including workspace persistence, isolated session management, and Model Context Protocol (MCP) server connectivity.
  • CI/CD Pipeline Development: Design and operate pipelines for shipping trading skills and plugins to production without interrupting live trading activity.
  • Zero-Downtime Operations: Execute deployment strategies (blue/green, canary) ensuring active financial positions remain protected during every infrastructure change.
  • Observability & Monitoring: Build comprehensive alerting across the full stack using metrics, logs, and traces to detect agent failures, state file corruption, or infrastructure regressions before financial loss occurs.
  • Cloud & Database Scaling: Operate and scale core platform infrastructure, including Kubernetes (EKS) clusters, Redis, Postgres, ClickHouse, and Kafka.
  • Blockchain Reliability: Maintain blockchain node infrastructure and ensure stable connectivity to exchange APIs and on-chain transaction systems.
  • Incident Leadership: Lead incident response and on-call practices, including debugging, mitigation, and post-mortems to improve long-term platform reliability.

Qualifications

  • Extensive experience in DevOps, SRE, or Infrastructure Engineering, preferably within a startup environment where systems were built from the ground up.
  • Proven track record of deploying, scaling, and debugging production workloads, specifically within AWS EKS.
  • Proficiency with tools such as Terraform, Ansible, or equivalent frameworks.
  • Hands-on experience with Docker and Helm for packaging production services.
  • Experience operating production-grade data and messaging systems (Redis, Postgres/RDS, ClickHouse, Kafka).
  • Strong experience with Prometheus, Grafana, Datadog, Loki, or OpenTelemetry to build proactive operational visibility.
  • Ability to debug across multiple languages, including Python, Node.js, and Go.

Requirements

  • Understanding of systems where latency and reliability have direct financial consequences.
  • Familiarity with node infrastructure, exchange APIs, wallet operations, and on-chain monitoring.
  • Experience managing secrets, access controls, and production hardening for sensitive financial environments.
  • Experience defining SLOs and building mature on-call practices.

Preferred Qualifications (Plus)

  • Experience with OpenClaw agent deployments and workspace templates.
  • Familiarity with Model Context Protocol (MCP) server deployment and auth management.
  • Direct experience with Hyperliquid or other decentralized exchange (DEX) protocols.
  • Background in fintech, market data infrastructure, or high-frequency trading systems.

Benefits

  • Opportunity to build infrastructure for a new category of software (Autonomous AI Agents).
  • High-autonomy environment with a focus on engineering excellence and technical ownership.
  • Competitive compensation package commensurate with senior-level experience.
  • Remote-first or flexible working arrangements (as specified by the client).

Commitment to Equality and Accessibility

At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all.

If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing [email protected].

Before You Apply
remote Be aware of the location restriction for this remote position: USA timezones, GMT (UTC+0)
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Senior DevOps / SRE Engineer @MLabs
DevOps / Sysadmin
Salary usd 120,000 - 1..
Remote Location
Job Type full-time
Posted Today
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 147,291 Remote Jobs
remote Be aware of the location restriction for this remote position: USA timezones, GMT (UTC+0)
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Unlock 147,291 Remote Jobs
×

Apply to the best remote jobs
before everyone else

Access 147,291+ vetted remote jobs and get daily alerts.

4.9 ★★★★★ from 500+ reviews
Unlock All Jobs Now

Maybe later