All jobs

QA Engineer (AI Applications) (Remote)

100% Remote Full-time Open now

• Job Title: QA Engineer (AI Applications) (Remote)

  • Location

: Remote (United States, Canada, United Kingdom, Australia)

  • Work Mode:

Fully Remote Role Overview Help design and evaluate autonomous AI agents across multiple LLMs, spanning health, education, daily life, and other real-world domains (all coding work). Shape the future of agentic AI systems by providing expert human feedback to leading AI organisations. Help train Large Language Models (LLMs) for complex, multi-step architectural workflows.

Key Responsibilities

AI Agent Evaluation

  • Write evaluation rubrics with objective pass/fail criteria
  • Debug agent traces to identify failure patterns
  • Stress test agents against edge cases, prompt injection, and tool misuse

Technical Assessment

  • Assess production-grade modular software architecture
  • Analyse multi-turn system interactions and behaviours
  • Provide high-density technical feedback for LLM training

Project Workflow

  • Create an account and upload a resume/ID
  • Complete the onboarding assessment
  • Start earning through flexible task assignments

Qualifications

  • Experience in backend engineering, AI automation, or complex systems integration
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting)
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions

Preferred (Nice to Have)

  • Experience integrating agents with live tools such as Supabase, Gmail, and other APIs
  • Familiarity with persistent state and session-tracking patterns
  • Experience identifying privacy leaks, authority escalation, or indirect prompt injection vulnerabilities

Compensation

  • Hourly compensation ranges from USD $30–$50, depending on experience and task complexity
  • Payments are issued weekly via supported payout platforms (e.g., PayPal or AirTM)
  • Full compensation details are provided prior to task acceptance

Equal Opportunity Statement Selection decisions are based solely on skills, qualifications, and project requirements. We are committed to inclusive and fair engagement practices and consider all qualified applicants without regard to legally protected characteristics. Apply Now! Apply tot his job Apply To this Job

You might also like

Prin Supplier QA Engineer (Remote/Southern California)

100% Remote Full-time

QA Engineer, Platform and Ops Tooling

100% Remote Full-time

Senior Threat Intelligence Specialist (Supply Chain & Geopolitical Security)

100% Remote Full-time

Cyber Threat Intelligence Analyst – SkillBridge Internship

100% Remote Full-time

Windows QA Engineer (IT Systems & Endpoint Management) - Remote

100% Remote Full-time

Software Engineer in Test II (Remote)

100% Remote Full-time

Lead QA Engineer - NJ or Chicago 100% Remote

100% Remote Full-time

Sr. Cyber Threat Intelligence Analyst - Security Operations

100% Remote Full-time

Sr. Threat Intel Analyst (Remote)

100% Remote Full-time

Senior Threat Intelligence Analyst, Crypto

100% Remote Full-time

Senior Software Engineer (Data Systems, Python)

100% Remote Full-time

Experienced Remote Customer Service Representative – Automotive Industry Expertise

100% Remote Full-time

Remote Customer Service Executive (Night Shift) at arenaflex

100% Remote Full-time

Spanish-English Interpreter (US Resident)

100% Remote Full-time

Remote Online Chat Support Agent – Entry‑Level Customer Service & Financial Solutions Specialist

100% Remote Full-time

Remote Administrative Support Specialist – Flexible Focus Group & Consumer Research Participant

100% Remote Full-time

Junior Software Engineer, Full-Stack

100% Remote Full-time

Software Engineer

100% Remote Full-time

Experienced Full Stack Customer Support Specialist – Remote Apple Home Advisor

100% Remote Full-time

[Remote] Staff Software Engineer | Semantic Data Team

100% Remote Full-time