Mercor

Data Scientist | Remote Work

Mercor WorkFromHome

Stellenbeschreibung:

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: AI Task Evaluation & Statistical Analysis Specialist

Type: Contract

Compensation: $100–$120/hour

Location: Remote

Role Responsibilities

  • Conduct comprehensive statistical failure analysis to identify patterns in AI agent failures across task components such as prompts, rubrics, and templates.
  • Perform root cause analysis to determine if failures are due to task design, rubric clarity, file complexity, or agent limitations.
  • Analyze performance variations across finance sub‑domains, file types, and task categories to enhance understanding of AI model performance.
  • Create dashboards and reports to highlight failure clusters, edge cases, and improvement opportunities.
  • Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings.
  • Present insights to data labeling experts and technical teams to foster collaboration and drive improvements.

Qualifications

Must‑Have

  • Statistical Expertise: Strong foundation in statistical analysis, hypothesis testing, and pattern recognition.
  • Programming: Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis.
  • Data Analysis: Experience with exploratory data analysis and creating actionable insights from complex datasets.
  • AI/ML Familiarity: Understanding of LLM evaluation methods and quality metrics.
  • Tools: Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL.

Preferred

  • Experience with AI/ML model evaluation or quality assurance.
  • Background in finance or willingness to learn finance domain concepts.
  • Experience with multi‑dimensional failure analysis.
  • Familiarity with benchmark datasets and evaluation frameworks.
  • 2-4 years of relevant experience.

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

  • For details about the interview process and platform information, please check:
  • For any help or support, reach out to:

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

#J-18808-Ljbffr
NOTE / HINWEIS:
EnglishEN: Please refer to Fuchsjobs for the source of your application
DeutschDE: Bitte erwähne Fuchsjobs, als Quelle Deiner Bewerbung

Stelleninformationen

  • Veröffentlichungsdatum:

    10 Dez 2025
  • Standort:

    WorkFromHome
  • Typ:

    Vollzeit
  • Arbeitsmodell:

    Vor Ort
  • Kategorie:

  • Erfahrung:

    2+ years
  • Arbeitsverhältnis:

    Angestellt

KI Suchagent

AI job search

Möchtest über ähnliche Jobs informiert werden? Dann beauftrage jetzt den Fuchsjobs KI Suchagenten!

Diese Jobs passen zu Deiner Suche:

dbs Delta Business Service GmbH
Datacenter Admin (m/w/d)
dbs Delta Business Service GmbH
Vollzeit Poing
15 Dez 2025Development & IT
HUK-COBURG Versicherungsgruppe
Data Warehouse & Reporting Analyst Krankenversicherung (w/m/d)
HUK-COBURG Versicherungsgruppe
Vollzeit Coburg
17 Dez 2025Development & IT
MeData EDV-Systeme GmbH
Projektleiter IT (m/w/d)
MeData EDV-Systeme GmbH
Vollzeit Melle
14 Dez 2025Development & IT
Westnetz GmbH
Werkstudent Data Science (m/w/d)
Westnetz GmbH
Vollzeit Dortmund
19 Dez 2025Development & IT
Avacon Netz GmbH
Backend Engineer (Python) - Data & AI (m/w/d)
Avacon Netz GmbH
Vollzeit Lüneburg
19 Dez 2025Development & IT
company logo
Program Manager - Data & Analytics (f/m/x)
Carl Zeiss AG
Vollzeit München
15 Dez 2025
Vollzeit Hamburg
18 Dez 2025
company logo
Master Data Professional (m/w/d)
WEIDPLAS Germany GmbH
Vollzeit Treuen
17 Dez 2025