All roles

Data Annotation Specialist | $22/hr PT

Remote · USA Full-time New today

• *About The Job

  • *Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

  • *Benchmark**

,

  • *General Catalyst**

,

  • *Peter Thiel**

,

  • *Adam D'Angelo**

,

  • *Larry Summers**

, and

  • *Jack Dorsey**

.

  • *Position:**

Language Model Evaluator

  • *Type:
  • *Full-time or Part-time Contract Work
  • Compensation:
  • $23/hour
  • *Location:
  • *Geography restricted to Egypt, Saudi Arabia, UAE, USA
  • *Role Responsibilities
  • Evaluate LLM-generated responses on their ability to effectively answer user queries.
  • Conduct fact-checking using trusted public sources and external tools.
  • Generate high-quality human evaluation data by annotating response strengths, areas for improvement, and factual inaccuracies.
  • Assess reasoning quality, clarity, tone, and completeness of responses.
  • Ensure model responses align with expected conversational behavior and system guidelines.
  • Apply consistent annotations by following clear taxonomies, benchmarks, and detailed evaluation guidelines.
  • *Qualifications
  • *Must-Have
  • Bachelor’s degree
  • Native speaker or ILR 5/primary fluency (C2 on the CEFR scale) in Arabic
  • Significant experience using large language models (LLMs)
  • Excellent writing skills
  • Strong attention to detail
  • Adaptable and comfortable moving across topics, domains, and customer requirements
  • Background or experience in domains requiring structured analytical thinking
  • Excellent college-level mathematics skills
  • *Preferred
  • Prior experience with RLHF, model evaluation, or data annotation work
  • Experience writing or editing high-quality written content
  • Experience comparing multiple outputs and making fine-grained qualitative judgments
  • Familiarity with evaluation rubrics, benchmarks, or quality scoring systems
  • *Application Process (Takes 20–30 mins to complete)
  • Upload resume
  • AI interview based on your resume
  • Submit form
  • Resources & Support
  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]
  • PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Apply tot his job Apply To this Job

Related roles

Software Developer - AI Trainer

Remote · USA Full-time

[Remote] Data Annotation Specialist

Remote · USA Full-time

Data Processing AI Analyst - Geospatial exp

Remote · USA Full-time

AI Training Data Labeler – Work from Home, No Coding Needed

Remote · USA Full-time

[Remote] AI Trainer – Visual & Graphic Design Expert (Remote) - Los Angeles

Remote · USA Full-time

Native English Voice Actors or Linguists for AI Training

Remote · USA Full-time

Exciting Remote Sales Opportunity! FinTech & Construction Tech - NC, AZ, UT

Remote · USA Full-time

Senior Payments & Fintech Manager

Remote · USA Full-time

AI Trainer â?? GNU Image Manipulation Program Users (Remote)

Remote · USA Full-time

Sr. Software Engineer

Remote · USA Full-time

Senior Software Test Engineer - II

Remote · USA Full-time

Experienced Customer Service Representative - Remote Opportunity with arenaflex

Remote · USA Full-time

Translation Validator | Malayalam

Remote · USA Full-time

Experienced Full Stack Program Manager – Product Innovation & Development

Remote · USA Full-time

Experienced Full Stack Software Engineer – Home Automation and Security Solutions at arenaflex

Remote · USA Full-time

Experienced Remote Live Chat Representative – Customer Service Expert for arenaflex

Remote · USA Full-time

IAM Engineer

Remote · USA Full-time

Regulatory Affairs SME

Remote · USA Full-time

Klaviyo Email Marketer Needed for Full Funnel Lifecycle Flows (Design + Build + Rollout)

Remote · USA Full-time

Utilization Management Nurse Consultant - Medical Review (Remote)

Remote · USA Full-time