We're looking for a Freelance Agent Evaluation Engineer to evaluate AI coding agents and create challenging tasks for them to handle. This is a part-time, non-permanent project that requires experience in software development, testing, and full-stack development.
Requirements
- Degree in Computer Science, Software Engineering, or related fields
- 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations)
- Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems
- Experience writing tests (functional, integration — not just running them)
- Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis)
- CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
- English proficiency - B2
Benefits
- Opportunity to work on AI-related projects
- Chance to earn up to $30 per hour equivalent
- Flexible schedule and part-time work
Originally posted on Himalayas