All jobs

Copy of AI Code Reviewer & Systems Evaluation Engineer 1

100% Remote Full-time Open now

An enterprise client is currently seeking experienced software engineers to contribute to improving advanced AI systems through human feedback. This work supports leading AI organizations in training large language models to better understand software development practices, debugging, and code quality. This is part of a cutting-edge initiative focused on enhancing how AI systems write, review, and optimize code in real-world scenarios. You’ll play a key role in shaping how AI models evaluate performance, detect issues, and generate reliable outputs.

Job Description

This opportunity is ideal for engineers who enjoy analyzing systems, improving code quality, and working on complex technical challenges. You will contribute to AI training projects by evaluating outputs, refining logic, and identifying potential vulnerabilities. What You'll Do:

  • Develop objective, verifiable evaluation criteria (rubrics) for system performance
  • Review system logs and execution paths to improve reliability and code quality
  • Refactor code and optimize system behavior toward ideal outcomes
  • Test systems for vulnerabilities, including data exposure and edge-case failures
  • Provide detailed, high-quality feedback on system performance and outputs

Qualifications

Requirements:

  • 2+ years of experience in backend engineering, AI automation, or systems integration
  • Strong proficiency in at least two programming languages (e.g., Python, JavaScript, Go, Java)
  • Experience working with SQL databases
  • Proven ability to build and maintain production-grade systems
  • Experience working in live (non-mocked) environments with multi-step interactions
  • Strong analytical skills and attention to detail

Nice to Haves:

  • Experience with multi-stage system workflows and coordination tasks
  • Familiarity with integrating tools such as APIs, databases, or external platforms
  • Understanding of system vulnerabilities (e.g., privacy leaks, prompt injection, access escalation)
  • Experience working with AI systems or agent-based workflows
  • Comfort working with persistent state tracking or similar frameworks

Additional Information

  • Fully remote and flexible work schedule
  • Project-based engagement with no guaranteed hours
  • Work on tasks based on availability and project assignment
  • Payment is based on completed tasks only
  • Must accept project invitations before beginning work
  • Freelancers may accept or decline tasks depending on availability
  • No guaranteed workload; volume may vary weekly

Apply To This Job

You might also like

Principal AI Engineer - Servicing Solutions - (Remote - USà

100% Remote Full-time

[Remote] Machine Learning Engineer- Remote

100% Remote Full-time

Lead Machine Learning Engineer – AI/ML (Remote Work Option)

100% Remote Full-time

Immediate Hiring: (Remote) Machine Learning Engineer Entry Level

100% Remote Full-time

Lead Machine Learning Engineer - LMTS

100% Remote Full-time

AI / Machine Learning Engineer Leaders and Decision-Makers for Workflow Management Tools - USA

100% Remote Full-time

[Remote] Machine Learning Engineer, Agentic AI

100% Remote Full-time

ML Engineer - Malvern, PA

100% Remote Full-time

Machine Learning Engineer - Remote - USA

100% Remote Full-time

[Remote] AI/Prompt Engineer – Intern/Entry Level

100% Remote Full-time

Delivery driver | Gulfport, FL

100% Remote Full-time

Senior Full Stack Software Engineer for CRM Customer Acquisitions and Digital Transformation - Remote Opportunity

100% Remote Full-time

Experienced Customer Service Representative – Remote Opportunity for Career Growth and Flexibility

100% Remote Full-time

Experienced Remote Data Entry Specialist – Flexible Work Arrangement at arenaflex

100% Remote Full-time

Join Today: Virtual Medical Assistant & Receptionist US ONLY

100% Remote Full-time

Customer Services Contractor

100% Remote Full-time

Nurse Practitioner (PRN) - In-Home Health Assessments

100% Remote Full-time

[Remote] Senior Manager, Clinical Trial Study Start Up

100% Remote Full-time

LQA Game Tester (Ukrainian) – Freelance Remote

100% Remote Full-time

Walmart Data Entry Jobs Work From Home Job – Entry Level

100% Remote Full-time