All roles

Remote Bilingual Italian Generalist Evaluator Expert

Remote · USA Full-time New today

Mercor is seeking native Italian speakers from Switzerland or Italy with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Italian / English prompt–golden answer pairs that train and evaluate advanced language models. Job Details

  • Multilingual Prompt Design & Optimization: Create detailed prompts in Italian and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Italian-speaking users in Switzerland and Italy contexts.
  • Define and Document Evaluation Standards: Establish high-level expectations for correct responses in Switzerland and Italy consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions.
  • Model Testing and Grading (Bilingual): Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Italian, comparing results against English where needed.
  • Benchmarking & Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor—maintaining consistency and reliability across Italian-language benchmarks before integration into official evaluations.

Minimum Qualifications

  • Native-level fluency in Italian (written), specific to Switzerland or Italy usage, with strong reading/writing ability in English.
  • Must be native to Switzerland or Italy and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity.
  • BS or BA from a reputable institution (completed or in progress).
  • Strong writing and critical thinking skills.
  • Ability to work independently and meet deadlines.
  • Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests.
  • Based in Switzerland or Italy (or able to reliably produce Switzerland- or Italy-specific, culturally accurate Italian).

Preferred Qualifications

  • Experience in teaching, research, editing, or academic writing.
  • Experience creating evaluation criteria, rubrics, or grading guidelines.
  • Familiarity with LLMs, prompting, or model evaluation (helpful but not required).

Application & Onboarding Process

  • Complete an AI-led interview (about 15 minutes).
  • If approved, complete a paid assessment focused on writing and rubric creation.
  • Then, if selected, you will be invited to work on the project.

More Details About This Role

  • Expect to contribute at least 20 hours per week.
  • Expect a commitment of approximately 2–4 months.
  • You’ll be working in a structured project environment with clear goals and tools.
  • We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

Apply tot his job Apply To this Job

Related roles

Bilingual Insurance Evaluator Arabic/English

Remote · USA Full-time

STEM Master’s/Ph.D. Research Report Evaluator

Remote · USA Full-time

ARMENIAN (WESTERN) TESTING EVALUATOR

Remote · USA Full-time

[Remote] Technology Product Owner – Entry Filing for T01/T11

Remote · USA Full-time

CRM Product Owner

Remote · USA Full-time

Product Manager | 2 openings | MN or Telecommute

Remote · USA Full-time

Sr. Technical Product Manager- Microsoft

Remote · USA Full-time

Sr. Technical Product Owner – SAP Order to Cash & Commerce Cloud – Strategy & Ma

Remote · USA Full-time

Senior Full Stack Developer, Product Owner – Real-Time Intelligent Communication Systems

Remote · USA Full-time

Associate Product Owner - Provider Services (Open to hiring at the Product Owner level)

Remote · USA Full-time

Experienced Customer Chat Support Specialist – Remote Opportunity for Career Growth and Development

Remote · USA Full-time

Remote Sr Manager, eCommerce Experience & Content, Kay

Remote · USA Full-time

Senior Software Engineer, Applied AI Services

Remote · USA Full-time

Experienced Full Stack Customer Service Representative – Remote Support for arenaflex Clients

Remote · USA Full-time

Experienced Live Chat Support Specialist – Delivering Exceptional Customer Experiences at arenaflex

Remote · USA Full-time

Sr. Pharma Direct Sales Director

Remote · USA Full-time

Sr. Data Scientist, Payments

Remote · USA Full-time

Experienced Part-Time Remote Data Entry Specialist – Supporting arenaflex's Operations with Precision and Efficiency

Remote · USA Full-time

Manager, Virtual Advice - Calgary, AB

Remote · USA Full-time

Senior Software Engineer, Guest & Host (Partner Integrations)

Remote · USA Full-time