Member of Technical Staff, Inference @Inferact
Software Development
Salary usd 200,000 - 4..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2mths ago

[Hiring] Member of Technical Staff, Inference @Inferact

2mths ago - Inferact is hiring a remote Member of Technical Staff, Inference. πŸ’Έ Salary: usd 200,000 - 400,000 per year πŸ“Location: USA

Role Description

We're looking for an inference runtime engineer to push the boundaries of what's possible in LLM and diffusion model serving. Models grow larger. Architectures shift: mixture-of-experts, multimodal, agentic. Every breakthrough demands innovations on the inference engine itself. You'll work at the core of vLLM, optimizing how models execute across diverse hardware and architectures. Your work will directly impact how the world runs AI inference.

Qualifications

  • Bachelor's degree or equivalent experience in computer science, engineering, or similar
  • Deep understanding of transformer architectures and their variants
  • Strong programming skills in Python with experience in PyTorch internals
  • Experience with LLM inference systems (vLLM, TensorRT-LLM, SGLang, TGI)
  • Ability to read and implement model architectures and inference techniques from research papers
  • Demonstrate the ability to contribute performant and maintainable code and debug in complex ML codebases

Requirements

  • Deep understanding of KV-cache memory management, prefix caching, and hybrid model serving
  • Familiarity with RL frameworks and algorithms for LLMs
  • Experience with multimodal inference (audio/image/video/text)
  • Contributions to open-source ML or system infrastructure projects

Benefits

  • Generous health, dental, and vision benefits
  • 401(k) company match

Logistics

  • Location: This role is based in San Francisco, California. Will consider remote in the US for exceptional candidates
  • Compensation: Depending on background, skills, and experience, the expected annual salary range for this position is $200,000 - $400,000 USD + equity
  • Visa sponsorship: We sponsor visas on a case-by-case basis
Before You Apply
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Member of Technical Staff, Inference @Inferact
Software Development
Salary usd 200,000 - 4..
Remote Location
πŸ‡ΊπŸ‡Έ USA Only
Job Type full-time
Posted 2mths ago
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
️
πŸ‡ΊπŸ‡Έ Be aware of the location restriction for this remote position: USA Only
β€Ό Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more.
Apply for this position
Did not apply βœ“
Applied βœ“
Sent Follow-Up βœ“
Interview Scheduled βœ“
Interview Completed βœ“
Offer Accepted βœ“
Offer Declined βœ“
Unlock 152,720 Remote Jobs
Γ—

Apply to the best remote jobs
before everyone else

Access 152,720+ vetted remote jobs and get daily alerts.

4.9 β˜…β˜…β˜…β˜…β˜… from 500+ reviews
Unlock All Jobs Now

Maybe later