Back to Remote jobs  >   AI / ML
Solutions Architect - AI / ML - Training & GPU infra @The Next Chapter W&S
AI / ML
Salary total compensat..
Remote Location
Job Type full-time
Posted 4d ago

[Hiring] Solutions Architect - AI / ML - Training & GPU infra @The Next Chapter W&S

4d ago - The Next Chapter W&S is hiring a remote Solutions Architect - AI / ML - Training & GPU infra. πŸ’Έ Salary: total compensation up to eu 300k (base + variable) πŸ“Location: Europe

Role Description

Join a fast-moving AI infrastructure team working on the cutting edge of large-scale ML workloads. This role is ideal for engineers who enjoy solving deep technical challenges in distributed training, multi-GPU systems, and scalable AI inference infrastructure. You will work directly with AI-focused clients, helping them get the most out of modern GPUs (H100, B200, etc.) and ML frameworks such as PyTorch (and JAX in some environments).

Work alongside senior AI and infrastructure engineers building large-scale GPU platforms. As part of the customer solutions team, you will:

  • Design and validate production-grade distributed training (primary) and large-scale inference architectures on large GPU clusters, typically tens to thousands of GPUs.
  • Work hands-on with customers to debug, optimize, and scale ML workloads across multi-node GPU environments.
  • Act as a technical authority on GPU performance, networking, and schedulers, making trade-offs at scale and translating customer needs into concrete platform requirements.
  • Collaborate closely with engineering, product, and R&D to influence roadmap decisions based on real-world ML workloads.

This is a hands-on, technical role; you are expected to work directly in customer environments, not only advise at a high level.

Qualifications

  • Hands-on experience designing and operating production-grade, multi-node GPU workloads for training or inference.
  • Strong background in distributed deep learning (PyTorch Distributed, DeepSpeed) on GPU clusters.
  • Deep understanding of GPU architecture and interconnects (H100/A100 class, NVLink, InfiniBand).
  • Experience with Kubernetes or Slurm and performance tuning using GPU profiling and monitoring tools.

Requirements

This role is not a fit if your experience is limited to single-node training, high-level AI strategy, or non-production research environments. We are looking for engineers and architects who thrive at the intersection of AI workloads and large-scale infrastructure.

Benefits

  • Location: Remote from anywhere in Europe.
  • Total compensation up to EU 300k (base + variable), depending on level and experience.
Before You Apply
️
remote Be aware of the location restriction for this remote position: Europe
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Back to Remote jobs  >   AI / ML
Solutions Architect - AI / ML - Training & GPU infra @The Next Chapter W&S
AI / ML
Salary total compensat..
Remote Location
Job Type full-time
Posted 4d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
remote Be aware of the location restriction for this remote position: Europe
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later