[Hiring] Senior AI Infrastructure Engineer @Sword Health
Senior AI Infrastructure Engineer @Sword Health
Artificial Intelligence
Salary €66,500 - €104,..
Remote Location
Employment Type full-time
Posted 1mth ago

[Hiring] Senior AI Infrastructure Engineer @Sword Health

1mth ago - Sword Health is hiring a remote Senior AI Infrastructure Engineer. πŸ’Έ Salary: €66,500 - €104,500 a year πŸ“Location: Europe

Role Description

As a Senior AI Infrastructure Engineer at Sword Health, you will own the infrastructure that brings our AI models to life in production. Your work will directly power the AI Care platform that is transforming healthcare worldwide.

  • Design, build, and maintain the inference infrastructure that powers Sword Health's AI products, ensuring models are served with high throughput, low latency, and cost efficiency.
  • Own the end-to-end deployment pipeline for AI models - from real-time computer vision powering movement analysis to large language models driving conversational AI experiences.
  • Architect and scale Kubernetes clusters for GPU-accelerated workloads, including autoscaling strategies, resource scheduling, and multi-model serving.
  • Build and operate the infrastructure behind Sword Health's real-time AI agents, including WebRTC cluster provisioning and deploying speech-to-text and text-to-speech capabilities at low latency.
  • Drive inference scaling strategies - evaluate and implement techniques such as speculative decoding, continuous batching, and model parallelism to meet growing demand without proportionally increasing costs.
  • Develop and maintain Infrastructure as Code (Terraform) and GitOps workflows tailored to GPU-enabled, AI-specific environments.
  • Instrument and monitor AI inference systems, building observability around GPU utilization, model latency, throughput, and error rates to ensure reliability and performance.
  • Collaborate closely with ML Engineers, Data Scientists, and Product teams to translate model requirements into robust, production-ready infrastructure.
  • Evaluate emerging AI infrastructure tools, frameworks, and hardware to keep Sword Health at the cutting edge of inference performance and efficiency.
  • Mentor team members on AI infrastructure best practices, fostering knowledge sharing around GPU workloads, model serving patterns, and production ML systems.

Qualifications

  • 5+ years of experience in infrastructure engineering, with at least 2 years focused on AI/ML workloads in production environments.
  • Strong experience with Kubernetes for orchestrating GPU-accelerated workloads, including scheduling, resource management, and autoscaling for inference services.
  • Hands-on experience with model serving and inference optimization frameworks for both real-time computer vision and large language model workloads.
  • Solid understanding of LLM inference optimization techniques, including speculative decoding, batching strategies, quantization, and inference scaling patterns.
  • Experience provisioning and managing infrastructure for real-time AI systems, including WebRTC clusters and AI agent architectures.
  • Familiarity with real-time video/computer vision inference pipelines and the infrastructure challenges of processing continuous visual data streams at low latency.
  • Familiarity with speech-to-text and text-to-speech serving infrastructure and the challenges of running voice AI at low latency.
  • Experience with Infrastructure as Code (Terraform or similar) and GitOps methodologies for managing complex, GPU-enabled environments.
  • Working knowledge of GPU infrastructure - NVIDIA CUDA ecosystem, multi-GPU setups, and GPU monitoring/profiling.
  • Strong Linux systems fundamentals and networking knowledge, particularly for latency-sensitive, real-time workloads.
  • Fluent in English (written and oral).
  • A proactive, ownership-driven mindset - you see a bottleneck in an inference pipeline and you fix it before it becomes a problem.

Requirements

  • Experience with LLM serving engines such as vLLM, SGLang, or LLM-D.
  • Experience with NVIDIA Triton Inference Server and TensorRT for real-time computer vision workloads.
  • Familiarity with NVIDIA Riva or similar platforms for STT/TTS serving.
  • Understanding of speculative decoding, continuous batching, quantization, and model parallelism techniques.
  • Experience with Istio or similar service mesh.
  • Experience with Kafka for event streaming.
  • Experience with Prometheus, AlertManager, and Grafana for monitoring and observability.
  • Experience with Elasticsearch, Logstash, and Kibana (ELK) for log management.
  • Experience with Vault for secrets management.
  • Experience with Redis, MySQL, and DNS management.
  • Experience provisioning infrastructure on AWS, Azure, or GCP.
  • Good knowledge of cloud networking including VPC management, routing, NAT, and troubleshooting with tools like TCPdump.
  • Experience with WebRTC infrastructure and real-time media streaming.
  • Experience with Python, Go, or similar languages commonly used in ML infrastructure tooling.
  • Familiarity with SCRUM methodology.

Benefits

  • A stimulating, fast-paced environment with lots of room for creativity.
  • A bright future at a promising high-tech startup company.
  • Career development and growth, with a competitive salary.
  • The opportunity to work with a talented team and to add real value to an innovative solution with the potential to change the future of healthcare.
  • A flexible environment where you can control your hours (remotely) with unlimited vacation.
  • Access to our health and well-being program (digital therapist sessions).
  • Remote or Hybrid work policy.
Before You Apply
️
remote Be aware of the location restriction for this remote position: Europe
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Senior AI Infrastructure Engineer @Sword Health
Artificial Intelligence
Salary €66,500 - €104,..
Remote Location
Employment Type full-time
Posted 1mth ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 160,000+ Remote Jobs
️
remote Be aware of the location restriction for this remote position: Europe
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 160,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 160,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later