Back to Remote jobs  >   AI / ML
Technical Lead - AI Inferences @WEKA
AI / ML
Salary unspecified
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 6d ago

[Hiring] Technical Lead - AI Inferences @WEKA

6d ago - WEKA is hiring a remote Technical Lead - AI Inferences. πŸ’Έ Salary: unspecified πŸ“Location: USA

Role Description

We are seeking a Technical Lead - AI Inferences to spearhead our AI Inference team. In this role, you will bridge the gap between complex research and production-grade engineering. You will lead a tight-knit squad of 3 developers while remaining "hands-on-keyboard," architecting high-performance systems that optimize Large Language Model (LLM) serving.

Responsibilities

  • Technical Leadership: Architect and oversee the deployment of high-throughput, low-latency LLM inference pipelines.
  • Team Management: Mentor and lead a small team of developers, conducting code reviews, sprint planning, and technical career coaching.
  • Inference Optimization: Implement and evaluate state-of-the-art KV cache management solutions, including LMCache, and explore alternatives to minimize redundant computation.
  • Framework Mastery: Deeply integrate and optimize serving engines such as vLLM, LLM-d, and NIXL to maximize hardware utilization.
  • R&D: Stay at the forefront of the "Inference-as-a-Service" domain, benchmarking new tools and deciding when to pivot the stack.

Qualifications

  • Proven experience with KV cache reuse, speculative decoding, and continuous batching.
  • Deep familiarity with vLLM, LMCache, and NIXL. Understanding the trade-offs between centralized vs. distributed caching.
  • Expertise in Python, C++, or Rust, with a strong grasp of CUDA and GPU memory management.
  • Experience with Kubernetes (K8s) for scaling GPU workloads and optimizing cold-start times.

Benefits

  • Medical, Dental, Vision, Life insurance.
  • 401(K) plan.
  • Flexible Time Off (FTO).
  • Sick time and leave of absence as per the FMLA and other relevant leave laws.
Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Back to Remote jobs  >   AI / ML
Technical Lead - AI Inferences @WEKA
AI / ML
Salary unspecified
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 6d ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later