Role Description
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.
What this opportunity involves:
-
Create challenging coding test cases that push AI coding systems to their limits.
-
Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources.
-
Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks.
-
Craft βfair but hardβ challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required).
-
Analyze AI failures to understand what the model struggles with vs. what it masters.
-
Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria.
Qualifications
-
Degree in Computer Science, Software Engineering or related fields.
-
5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations).
-
Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems.
-
Experience writing tests (functional, integration β not just running them).
-
Docker containers (running evaluations locally in containers).
-
CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results).
-
English proficiency - B2.
Requirements
-
This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects.
Benefits
-
On this project, contributors can earn up to $50 per hour equivalent, depending on their level and pace of contribution.
-
Compensation varies across projects depending on scope, complexity, and required expertise.
-
Please note that other projects on the platform may offer different earning levels based on their requirements.
How it works
-
Apply β Pass qualification(s) β Join a project β Complete tasks β Get paid.
Effort estimate
-
Tasks for this project are estimated to take 20 hours to complete, depending on complexity.
-
This is an estimate and not a schedule requirement; you choose when and how to work.
-
Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.