|
Salary
unspecified
|
Remote
Location
|
|
Employment Type
contract
|
Posted
1mth ago
|
1mth ago - LILT (Production) is hiring a remote AI Benchmark Engineer | Native Language Specialist. πΈ Salary: unspecified πLocation: Worldwide
Role Description
We are building a rigorous, verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects, non-English data processing, and complex locale/encoding edge cases in terminal workflows.
We are seeking experienced native-speaking software engineers to design, build, and validate these benchmarks. You will create high-signal, high-quality tasks that genuinely test a model's ability to handle multilingual environments without relying on English translation crutches.
Note this is a remote, freelance opportunity.
What Youβll Deliver
Qualifications
Benefits
How to join our expert community
|
Be aware of the location restriction for this remote position: Worldwide |
| βΌ | Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more. | οΈ
|
Salary
unspecified
|
Remote
Location
|
|
Employment Type
contract
|
Posted
1mth ago
|
|
Be aware of the location restriction for this remote position: Worldwide |
| βΌ | Beware of scams! When applying for jobs, you should NEVER have to pay anything. Learn more. | οΈ
Access 160,000+ vetted remote jobs and get daily alerts.