[Hiring] Machine Learning DevOps - Cloud and Compute Cluster - R&D Support @Pathway
Machine Learning DevOps - Cloud and Compute Cluster - R&D Support @Pathway
Artificial Intelligence
Salary based on profil..
Remote Location
Employment Type full-time
Posted Today

[Hiring] Machine Learning DevOps - Cloud and Compute Cluster - R&D Support @Pathway

Today - Pathway is hiring a remote Machine Learning DevOps - Cloud and Compute Cluster - R&D Support. πŸ’Έ Salary: based on profile and location πŸ“Location: Northern America, Europe

Role Description

We are currently searching for a Machine Learning DevOps with experience in cloud and compute cluster management, scaling infrastructures, and Linux administration. Our development, ML training, and production environment is in the cloud, using several major cloud providers. We need support in managing and automating the processes, and scaling the infrastructure to growing team and production needs.

  • Optimize infrastructure for ML training and inference (e.g., GPUs, distributed compute).
  • Automate and maintain ML/LLM pipelines (data ingestion, training, validation, deployment).
  • Manage model versioning, reproducibility, and traceability.
  • Work with terabyte-large datasets.
  • Implement ML-centric CI/CD practices.
  • Monitor model performance and data drift in production.
  • Collaborate with machine learning engineers, software engineers, and platform teams.

The role focuses on operationalizing machine learning models, ensuring scalability, reliability, and automation across the ML lifecycle.

Qualifications

  • Very good familiarity with Linux, shell scripts, and cluster configuration scripts as the basic work tool.
  • Proficiency in workload management, containerization and orchestration (Slurm, Docker, Kubernetes).
  • Solid grasp of CI/CD tools and workflows (GitHub Actions, Jenkins, Gitlab CI, etc.).
  • Cloud infrastructure knowledge (AWS, GCP, Azure) – especially in ML services (e.g., SageMaker Hyperpod, Vertex AI).
  • Familiarity with monitoring/logging tools (Grafana, CloudWatch, Prometheus, Loki).
  • Experience with infrastructure as code (Terraform, CloudFormation, cluster-toolkit).
  • Experience with ML pipeline orchestration tools (e.g., MLflow, Kubeflow, Airflow, Metaflow).
  • Programming skills in Python (with exposure to ML libraries like TensorFlow, PyTorch).
  • Experience with cluster, systems, and networks administration.
  • Willingness to learn.
  • This position holds a minimum requirement of a BSc in Computer Science or Information Technology.
  • We will generally favor candidates who have undertaken ambitious efforts in the past.

Requirements

  • Experience with contributions to the Linux kernel, important bug bounties, or supporting an academic grid/cluster computing team in a scaling effort.
  • Achievements such as winning a sports championship should be mentioned in your application.

Benefits

  • Intellectually stimulating work environment.
  • Be a pioneer: work with real-time data processing & AI.
  • Work in one of the hottest AI startups, with exciting career prospects.
  • Responsibilities and ability to make significant contributions to the company’s success.
  • Inclusive workplace culture.

Further Details

  • Type of contract: Permanent employment contract.
  • Preferable joining date: Immediate.
  • Compensation: based on profile and location.
  • Location: Remote work with the possibility to work or meet with team members in offices located in Palo Alto, CA; Paris, France; or Wroclaw, Poland.
  • Candidates based anywhere in the EU, United States, and Canada will be considered.
Before You Apply
️
remote Be aware of the location restriction for this remote position: Northern America, Europe
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Machine Learning DevOps - Cloud and Compute Cluster - R&D Support @Pathway
Artificial Intelligence
Salary based on profil..
Remote Location
Employment Type full-time
Posted Today
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 160,000+ Remote Jobs
️
remote Be aware of the location restriction for this remote position: Northern America, Europe
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Application Denied βœ“
Unlock 160,000+ Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 160,000+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later