Mercor

Bilingual LLM Evaluation Analyst

Stellenbeschreibung:

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position

AI Model Evaluator

Type

Full-time or Part-time Contract Work

Compensation

$40/hour

Location

Geography restricted to Europe, USA

Role Responsibilities

  • Evaluate LLM-generated responses on their ability to effectively answer user queries.
  • Conduct fact-checking using trusted public sources and external tools.
  • Generate high-quality human evaluation data by annotating response strengths, areas for improvement, and factual inaccuracies.
  • Assess reasoning quality, clarity, tone, and completeness of responses.
  • Ensure model responses align with expected conversational behavior and system guidelines.
  • Apply consistent annotations by following clear taxonomies, benchmarks, and detailed evaluation guidelines.

Qualifications

Must-Have

  • Bachelor’s degree
  • Native speaker or ILR 5/primary fluency (C2 on the CEFR scale) in German
  • Significant experience using large language models (LLMs)
  • Excellent writing skills
  • Strong attention to detail
  • Adaptable and comfortable moving across topics, domains, and customer requirements
  • Background or experience in domains requiring structured analytical thinking
  • Excellent college-level mathematics skills

Preferred

  • Prior experience with RLHF, model evaluation, or data annotation work
  • Experience writing or editing high-quality written content
  • Experience comparing multiple outputs and making fine-grained qualitative judgments
  • Familiarity with evaluation rubrics, benchmarks, or quality scoring systems

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

  • For details about the interview process and platform information, please check:
  • For any help or support, reach out to:

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

#J-18808-Ljbffr
NOTE / HINWEIS:
EnglishEN: Please refer to Fuchsjobs for the source of your application
DeutschDE: Bitte erwähne Fuchsjobs, als Quelle Deiner Bewerbung

Stelleninformationen

  • Veröffentlichungsdatum:

    24 Mär 2026
  • Standort:

    WorkFromHome

    Einsatzort:

    Los Angeles
  • Typ:

    Vollzeit
  • Arbeitsmodell:

    Vor Ort
  • Kategorie:

  • Erfahrung:

    2+ years
  • Arbeitsverhältnis:

    Angestellt

KI Suchagent

AI job search

Möchtest über ähnliche Jobs informiert werden? Dann beauftrage jetzt den Fuchsjobs KI Suchagenten!

Diese Jobs passen zu Deiner Suche:

partner ad:Stepstone partner
Vollzeit St. Augustin
23 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit Hannover
23 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit München
23 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit Ulm-Jungingen
23 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit Essen
24 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit Hamburg
24 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit Hannover
25 Mär 2026Development & IT
partner ad:Stepstone partner
Vollzeit München
25 Mär 2026Development & IT