All roles

Data Annotation Specialist | $22/hr PT

Remote · USA Full-time New today

• About The Job

  • Mercor
  • connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
  • Benchmark**

• ,

  • General Catalyst**

• ,

  • Peter Thiel**

• ,

  • Adam D'Angelo**

• ,

  • Larry Summers**

• , and

  • Jack Dorsey**

• .

  • Position:**

• Language Model Evaluator

  • Type:
  • Full-time or Part-time Contract Work
  • Compensation:
  • $23/hour
  • Location:
  • Geography restricted to Egypt, Saudi Arabia, UAE, USA
  • Role Responsibilities
  • Evaluate LLM-generated responses on their ability to effectively answer user queries.
  • Conduct fact-checking using trusted public sources and external tools.
  • Generate high-quality human evaluation data by annotating response strengths, areas for improvement, and factual inaccuracies.
  • Assess reasoning quality, clarity, tone, and completeness of responses.
  • Ensure model responses align with expected conversational behavior and system guidelines.
  • Apply consistent annotations by following clear taxonomies, benchmarks, and detailed evaluation guidelines.
  • Qualifications
  • Must-Have
  • Bachelor’s degree
  • Native speaker or ILR 5/primary fluency (C2 on the CEFR scale) in Arabic
  • Significant experience using large language models (LLMs)
  • Excellent writing skills
  • Strong attention to detail
  • Adaptable and comfortable moving across topics, domains, and customer requirements
  • Background or experience in domains requiring structured analytical thinking
  • Excellent college-level mathematics skills
  • Preferred
  • Prior experience with RLHF, model evaluation, or data annotation work
  • Experience writing or editing high-quality written content
  • Experience comparing multiple outputs and making fine-grained qualitative judgments
  • Familiarity with evaluation rubrics, benchmarks, or quality scoring systems
  • Application Process (Takes 20–30 mins to complete)
  • Upload resume
  • AI interview based on your resume
  • Submit form
  • Resources & Support
  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]
  • PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Apply tot his job Apply To this Job

Related roles

Product Manager, Fintech

Remote · USA Full-time

Remote Coding Expertise for AI Training

Remote · USA Full-time

Investment Banker - AI Trainer

Remote · USA Full-time

Enterprise Sales Executive – Philanthropic Fintech

Remote · USA Full-time

Content Reviewer - AI Trainer

Remote · USA Full-time

AI Trainer - Advanced Mathematicians US (PST)

Remote · USA Full-time

Digital Content Editor - AI Trainer

Remote · USA Full-time

VP Fintech Sales Executive (Remote)

Remote · USA Full-time

AI Training Scenario Designer

Remote · USA Full-time

Remote Visual Evaluation Specialist AI Training

Remote · USA Full-time

eBilling Analyst - Remote (Legal Services)

Remote · USA Full-time

Experienced Data Entry Specialist – Remote Opportunity at arenaflex

Remote · USA Full-time

Experienced Entry Level Data Entry Specialist – Web & Cloud Application Development at arenaflex

Remote · USA Full-time

Customer Service & Benefits Specialist – Remote / Work From Home Opportunity at arenaflex

Remote · USA Full-time

Bilingual Nurse Case Manager

Remote · USA Full-time

Lead Machine Learning Engineer - Merchandising AI (ML Ops)

Remote · USA Full-time

Software Engineer, iOS Core Product - Fresno, CA, USA

Remote · USA Full-time

Experienced Customer Care Specialist - Onsite Training and Remote Work Opportunity at arenaflex

Remote · USA Full-time

Experienced Data Entry Clerk – Entry Level Remote Position at arenaflex

Remote · USA Full-time

Virtual Customer Service Representative – Remote Customer Support Specialist (Work From Home)

Remote · USA Full-time