Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Collaborate with an AI research lab to benchmark and improve AI model capabilities. Design prompts and evaluation sets for large language models (LLMs).
Responsibilities
Design and review consulting-style prompts, structured answers, and evaluation criteria.
Benchmark AI-generated responses against consulting frameworks and real-world standards.
Provide structured feedback on logic, clarity, and business rigor.
Conduct online research and synthesize insights from diverse sources to support evaluation.
Collaborate with AI research teams to refine model outputs and training data.
Requirements / Qualifications
Must-Have Qualifications
2+ years of experience at McKinsey, Bain, BCG, or a similarly competitive consulting firm
Strong online research and analytical skills
Ability to synthesize insights from diverse sources and data sets
Excellent written communication and attention to detail
Engagement Details
Compensation: $90–$110 USD/hr
Bonus: Weekly incentives ranging from $20–$100/hr
Contract Type: Independent contractor
Payment: Daily via Stripe Connect
Schedule: Fully remote and asynchronous
Application Process (Takes 20-30 mins to complete)
Upload resume
AI interview: A short, 15-minute conversational session to understand your background, experience, and interest in the role