Mercor

Data Scientist | Work-from-Home

Mercor Berlin

Stellenbeschreibung:

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey. Position: AI Task Evaluation & Statistical Analysis Specialist Type: Contract Compensation: $100–$120/hour Location: Remote Role Responsibilities Conduct comprehensive statistical failure analysis to identify patterns in AI agent failures across task components such as prompts, rubrics, and templates.Perform root cause analysis to determine if failures are due to task design, rubric clarity, file complexity, or agent limitations.Analyze performance variations across finance sub-domains, file types, and task categories to enhance understanding of AI model performance.Create dashboards and reports to highlight failure clusters, edge cases, and improvement opportunities.Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings.Present insights to data labeling experts and technical teams to foster collaboration and drive improvements. Qualifications Must-Have Statistical Expertise: Strong foundation in statistical analysis, hypothesis testing, and pattern recognition.Programming: Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis.Data Analysis: Experience with exploratory data analysis and creating actionable insights from complex datasets.AI/ML Familiarity: Understanding of LLM evaluation methods and quality metrics.Tools: Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL. Preferred Experience with AI/ML model evaluation or quality assurance.Background in finance or willingness to learn finance domain concepts.Experience with multi-dimensional failure analysis.Familiarity with benchmark datasets and evaluation frameworks.2-4 years of relevant experience. Application Process (Takes 20–30 mins to complete) Upload resumeAI interview based on your resumeSubmit form Resources & Support For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcomeFor any help or support, reach out to: [email protected] PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. ,
NOTE / HINWEIS:
EnglishEN: Please refer to Fuchsjobs for the source of your application
DeutschDE: Bitte erwähne Fuchsjobs, als Quelle Deiner Bewerbung

Stelleninformationen

  • Veröffentlichungsdatum:

    01 Dez 2025
  • Standort:

    Berlin
  • Typ:

    Teilzeit
  • Arbeitsmodell:

    Remote
  • Kategorie:

    Development & IT
  • Erfahrung:

    Erfahren
  • Arbeitsverhältnis:

    Angestellt

KI Suchagent

AI job search

Möchtest über ähnliche Jobs informiert werden? Dann beauftrage jetzt den Fuchsjobs KI Suchagenten!

Diese Jobs passen zu Deiner Suche:

uni-assist e.V.
Requirements Engineer (m/w/d) Vollzeit / Teilzeit
uni-assist e.V.
Teilzeit Berlin
16 Dez 2025Development & IT
company logo
Program Manager - Data & Analytics (f/m/x)
Carl Zeiss AG
Vollzeit München
15 Dez 2025
Vollzeit Hamburg
18 Dez 2025
company logo
Bauspar- und Finanzierungsfachmann/-frau
BKM–Bausparkasse Mainz AG
Vollzeit Berlin
18 Dez 2025
company logo
Bauspar- und Finanzierungsfachmann/-frau
BKM–Bausparkasse Mainz AG
Vollzeit Berlin
18 Dez 2025
uni-assist e.V.
Mitarbeiter (w/m/d) für die Begutachtung internationaler Studienbewerbungen in Vollzeit / Teilzeit
uni-assist e.V.
Teilzeit Berlin
18 Dez 2025
company logo
Gesundheits- und Krankenpfleger / Operationstechnische Assistentin (OTA) - Zentral OP Berlin (m/w/d)
Krankenhaus Waldfriede e. V. Akademisches Lehrkrankenhaus der Charité
Vollzeit Berlin
20 Nov 2025
GROPYUS Technologies GmbH
Polier / Stellvertretender Bauleiter (m/w/d) im Hochbau
GROPYUS Technologies GmbH
Vollzeit Berlin
16 Dez 2025