[Hiring] Senior Software Engineer – AI Middleware @Cornelis Networks, Inc.
Senior Software Engineer – AI Middleware @Cornelis Networks, Inc.
Software Development
Salary competitive com..
Remote Location
🇺🇸 USA Only
Employment Type full-time
Posted 2mths ago

[Hiring] Senior Software Engineer – AI Middleware @Cornelis Networks, Inc.

2mths ago - Cornelis Networks, Inc. is hiring a remote Senior Software Engineer – AI Middleware. 💸 Salary: competitive compensation package including equity and incentives 📍Location: USA

Role Description

We are seeking a highly experienced Senior Software Engineer to design, develop, and upstream-enable Cornelis Networks’ AI communication middleware. This role focuses on distributed AI workloads and enabling/optimizing collective communication libraries (e.g., NCCL/RCCL) over Cornelis Networks’ interconnects.

  • Design and implement performance-critical features for CCL enablement on Cornelis Networks’ fabrics.
  • Optimize distributed training performance across multi-node, multi-GPU configurations.
  • Improve GPU communication paths including GPU-direct transfers, IPC, and CPU/GPU synchronization.
  • Profile distributed AI workloads and identify bottlenecks across the software and hardware stack.
  • Tune AI frameworks such as PyTorch Distributed, TensorFlow/XLA, JAX, DeepSpeed, and Megatron-LM.
  • Develop benchmarks and microbenchmarks aligned with real model performance.
  • Contribute upstream to AI communication and distributed training projects.
  • Participate in design reviews, code reviews, CI, and long-term maintenance.
  • Prototype and validate Ultra Ethernet capabilities for AI collective communication.
  • Provide technical input for deployment considerations and performance validation.
  • Collaborate with kernel/driver, switch, performance, and systems teams.
  • Support advanced escalations by analyzing traces and providing robust fixes.

Qualifications

  • 8+ years of experience in high-performance systems programming in C/C++ on Linux.
  • Strong experience with GPU communication stacks including CUDA/ROCm and NCCL/RCCL.
  • Ability to optimize distributed training performance using profiling and tracing.
  • Understanding of collective communication concepts and topology awareness.
  • Experience delivering production-quality code.
  • Open-source contributions in relevant areas.

Requirements

  • Experience with AI frameworks such as PyTorch Distributed, DeepSpeed, and Megatron-LM.
  • Familiarity with libfabric/OFI, UCX, and RDMA concepts.
  • Experience with RoCEv2 and Ultra Ethernet.
  • Experience building cluster-scale performance test infrastructure.

Benefits

  • Competitive compensation package that includes equity, cash, and incentives.
  • Health and retirement benefits.
  • Dynamic, flexible work environment.
  • Access to a broad range of benefits, including medical, dental, and vision coverage.
  • Disability and life insurance.
  • Dependent care flexible spending account.
  • Accidental injury insurance and pet insurance.
  • Generous paid holidays.
  • 401(k) with company match.
  • Open Time Off (OTO) for regular full-time exempt employees.
  • Other paid time off benefits include sick time, bonding leave, and pregnancy disability leave.
Before You Apply
🇺🇸 Be aware of the location restriction for this remote position: USA Only
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Senior Software Engineer – AI Middleware @Cornelis Networks, Inc.
Software Development
Salary competitive com..
Remote Location
🇺🇸 USA Only
Employment Type full-time
Posted 2mths ago
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Application Denied
Unlock 160,000+ Remote Jobs
🇺🇸 Be aware of the location restriction for this remote position: USA Only
Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply
Applied
Sent Follow-Up
Interview Scheduled
Interview Completed
Offer Accepted
Offer Declined
Application Denied
Unlock 160,000+ Remote Jobs
×

Apply to the best remote jobs
before everyone else

Access 160,000+ vetted remote jobs and get daily alerts.

4.9 ★★★★★ from 500+ reviews
Unlock All Jobs Now

Maybe later