All roles

Professional Evaluator - Fully Remote | Upto $35/hr Hourly

Remote · USA Full-time New today

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey. Position AI Model Evaluation Contractor Type Contract Compensation $25–$35/hour Commitment 20 hours/week Role Responsibilities

  • Write realistic prompts reflecting professional and consumer domain-specific guidance.
  • Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness.
  • Identify fabricated claims, incorrect references, or misleading reasoning in model outputs.
  • Score and rank multiple model responses using structured rubrics across dimensions.
  • Provide written justifications with specific evidence for each evaluation.

Qualifications

Must-Have

  • Professional experience applying domain expertise in a practitioner or advisory capacity.
  • Familiarity with industry-specific standards, regulations, or clinical guidelines.
  • Strong written communication and critical reasoning skills. Application Process (Takes 20–30 mins to complete)
  • Submit your resume to begin.
  • Complete the Model Response Evaluation assessment. Resources & Support
  • For details about the interview process and platform information, please check https//talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to [email protected] PS Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. Apply tot his job Apply To this Job

Apply To This Job

Related roles

Hospitality Evaluator - Fully Remote | Upto $120/hr

Remote · USA Full-time

Provider Network Evaluator I-HCBS (Full-time Remote, North Carolina Based)

Remote · USA Full-time

Casualty Specialist, Evaluator (Remote)

Remote · USA Full-time

MPW Evaluator PRN position

Remote · USA Full-time

Lead Evaluator (Part Time)

Remote · USA Full-time

Educational Technology AI Rater & Evaluator

Remote · USA Full-time

Senior Product Owner Required ( USA only ) Remote - Contract to Hire

Remote · USA Full-time

Remote Product Owner

Remote · USA Full-time

Product Owner, Growth

Remote · USA Full-time

Project Manager - Group Health - REMOTE

Remote · USA Full-time

Sales Engineer

Remote · USA Full-time

Experienced Remote Data Entry Specialist – No Experience Needed – arenaflex

Remote · USA Full-time

Experienced Customer Support Professional – Delivering Exceptional Remote Service Experience

Remote · USA Full-time

Experienced Quality Control Data Entry Specialist – Healthcare and Clinical Research

Remote · USA Full-time

Experienced Nurse Practitioner/Physician Assistant – Comprehensive Primary Care & Retail Health Clinic Services

Remote · USA Full-time

Experienced Customer Care Representative – Remote Customer Service – arenaflex Pharmacy – Work From Home $16-$35/hr

Remote · USA Full-time

Experienced Phone and Chat Specialist with Bonus Opportunity at arenaflex

Remote · USA Full-time

Creative Producer - Questing (Project-Based Role)

Remote · USA Full-time

Hadoop Developer

Remote · USA Full-time

[Remote] Principal Product Manager – Risk & Compliance

Remote · USA Full-time