Lead Systems HPC Engineer @Nebius
Software Development
Salary usd 170,000 - 3..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2d ago

[Hiring] Lead Systems HPC Engineer @Nebius

2d ago - Nebius is hiring a remote Lead Systems HPC Engineer. πŸ’Έ Salary: usd 170,000 - 300,000 per year πŸ“Location: USA

Role Description

We are looking for a Lead Systems HPC Engineer to play a key role in building our hyperscaler platform, working across its core components while analyzing and optimizing the performance of large-scale GPU clusters at the intersection of hardware and software.

You will operate across the full stackβ€”from hardware and system software to networking (InfiniBand/RoCE), virtualization (KVM/QEMU), and distributed communication layers (e.g., MPI, NCCL).

In this role you will:

  • Focus on understanding system behavior across multiple layers, identifying performance bottlenecks, and driving improvements that shape how our clusters are built, operated, tuned, and validated.
  • Investigate and troubleshoot performance issues of GPU cluster under real workloads (training and inference).
  • Evaluate and integrate new hardware, system configurations and tuning approaches through software stack.
  • Support complex performance-related escalations from internal teams and customers.
  • Work closely with infrastructure, software engineering and hardware vendor teams (e.g. NVIDIA, Mellanox, Intel).
  • Contribute to hardware and cluster qualification (acceptance), ensuring systems meet performance expectations.

Qualifications

  • 5+ years of professional experience in system-level software development (focused on performance optimization, low-level programming).
  • 3+ years of hands-on experience with Linux systems (administration, troubleshooting, and performance tuning).
  • In-depth understanding of server architecture, including PCIe devices, NICs, Linux OS/Kernel, and high-performance computing (HPC) systems.
  • Strong proficiency in one or more performance-oriented programming languages (C/C++, Go, Python).

Requirements

  • We conduct coding interviews as part of the process.

Benefits

  • Health insurance: 100% company-paid medical, dental and vision coverage for employees and families.
  • 401(k) plan: Up to 4% company match with immediate vesting.
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
  • Remote work reimbursement: Up to $85/month for mobile and internet.
  • Disability & life insurance: Company-paid short-term, long-term and life insurance coverage.

Compensation

We offer competitive salaries ranging from $170k-$300k OTE + equity based on your experience.

What we offer

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.
Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Lead Systems HPC Engineer @Nebius
Software Development
Salary usd 170,000 - 3..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later