Role Description
We are seeking bilingual English–Hebrew Generalists to support the evaluation and improvement of conversational AI systems. In this role, you will review AI-generated responses, assess their accuracy and clarity, and provide structured feedback to enhance overall model quality and user experience.
-
Review and evaluate AI-generated responses in Hebrew and English.
-
Check outputs for factual accuracy, clarity, and relevance.
-
Provide structured, high-quality written feedback on response quality.
-
Identify and flag reasoning errors, inconsistencies, or misleading information.
-
Follow defined evaluation guidelines to ensure consistency and reliability in assessments.
Qualifications
-
Bachelor’s degree (completed or currently pursuing).
-
Native or fluent proficiency in Hebrew and English.
-
Strong written communication skills.
-
High attention to detail and analytical thinking.
-
Ability to review and evaluate content across a variety of topics.
Requirements
-
Experience using AI tools or large language models (LLMs) - Nice to Have.
-
Background in content review, analysis, or quality evaluation - Nice to Have.
Benefits
-
Contribute directly to improving AI systems used by millions of users.
-
Gain experience in AI evaluation and human-in-the-loop workflows.
-
Flexible, remote contract role with self-paced working hours.
Contract & Payment Terms
-
Engagement on an independent contractor basis.
-
Fully remote with flexible scheduling.
-
Project duration may vary based on performance and business needs.
-
Work involves only publicly available information, with no access to confidential data.
-
Payments processed weekly via Stripe or Wise based on completed work.
-
Candidates requiring H1-B or STEM OPT sponsorship are not eligible.
Core Skills
-
LLM Evaluation
-
Content Review
-
Bilingual Communication (English & Hebrew)