Role Description
We're looking for an Applied AI Engineer to help us build and ship AI-powered features that directly improve our product experience and business outcomes. This is a hands-on, product-focused role where you'll take ideas from concept to production β designing intelligent systems, validating them with real users, and turning them into reliable, scalable services.
You'll work at the intersection of AI, product, and engineering β partnering closely with cross-functional teams to identify high-impact opportunities, prototype quickly, and iterate based on data. This isn't a research-only role. You'll own the full lifecycle: experimentation, evaluation, deployment, monitoring, and continuous improvement.
The ideal candidate is excited about applying LLMs and modern ML tooling to real-world problems. You think in terms of systems, tradeoffs, and outcomes β not just models. You care about performance, quality, latency, and cost in production. Most importantly, you're motivated by shipping impactful AI experiences that customers actually use.
What You'll Do
-
Build and ship AI agents that serve real users: tool-calling LLM systems with structured output, parallel API orchestration, and streaming responses.
-
Design evaluation harnesses and quality scoring β we use Langfuse, rubrics to measure safety, effectiveness, and personalization.
-
Own the full loop: prototype a new agent capability, validate it with evals, deploy it to staging and production, monitor traces, and iterate.
-
Improve reliability, latency, and cost through prompt caching strategies, token budgets, retry logic, and observability.
-
Write the tools agents use: API integrations with Pydantic validation, exercise search over local databases, structured workout submission.
Qualifications
-
Strong Python skills: you've built and deployed services on large production systems.
-
Experience with LangChain/LangGraph or similar agent frameworks.
-
Hands-on experience with LLMs in production: prompt engineering, tool/function calling, structured output, evaluation.
-
Comfort with async Python, HTTP APIs, and streaming protocols (SSE, webhooks).
-
Experience with data validation and schema design (Pydantic, JSON Schema).
-
Ability to debug across layers: from a broken LLM tool call to a misconfigured Terraform resource.
-
Clear communication: you'll work directly with product, mobile, and backend engineers.
Nice to Have
-
Familiarity with AWS (Bedrock, ECR, CloudFront, S3, Cognito) or other cloud agent hosting.
-
Observability and tracing tools (Langfuse, OpenTelemetry, Datadog).
-
Exposure to evaluation frameworks: LLM-as-a-judge, automated scoring, dataset management.
-
Infrastructure-as-code (Terraform, CDK).
Compensation & Benefits
-
Base Salary:
$215,000β$250,000/year + equity. The salary range is set based on multiple considerations including business needs, market demands, talent availability, experience, and unique skills and attributes. The base pay range is subject to change and may be modified in the future.
-
Equity:
Equity participation offered alongside base compensation.
-
Health Coverage:
Comprehensive medical, vision, dental, and disability insurance plus tax savings accounts for all eligible employees.
-
Retirement:
401(k) plan with tax-advantaged savings options.
-
Remote-First:
Employment eligible to all employees located anywhere in the continental US. No travel required.
-
Wellness & Development:
Monthly health and fitness stipend contributing to overall wellbeing, access to a mental health platform, reimbursement for medical travel, and an annual learning & development stipend.
-
Flexible Time Off:
Flexible PTO so you can rest, recharge, and take care of life outside of work.
-
Future Membership:
Enjoy our platform for free!