Generalist Evaluator Expert

Remote Full-time
Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details:** - **Design and Optimize Prompts**: Create detailed prompts with multiple constraints and instructions. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### **Preferred Qualifications:** - Experience in teaching or research. ### **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### **More Details About This Role:** - This is a **remote and asynchronous** role — work on your own schedule. - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### **About** [**Mercor**]( - Our team is based in San Francisco, CA - We [specialize]( in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey
Apply Now

Similar Opportunities

AI Product Engineer

Remote

Key Account Manager

Remote

Program Associate

Remote

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Remote

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Remote

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Remote

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Remote

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Remote

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Remote

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Remote

Experienced Part-Time Customer Support Representative – Magical Experience Creation for Disney Enthusiasts with Competitive Compensation and Remote Work Opportunity

Remote

**Part-Time Remote Data Entry Specialist – Join arenaflex's Dynamic Team and Shape the Future of Air Travel**

Remote

[Remote] 100% Inbound Sales- $150-250K OTE for TV Documentary/Media Company

Remote

Global Petcare Strategy Director

Remote

Apply for DoorDash Driver

Remote

Entry Level arenaflex Remote Data Entry Specialist - Technology and Detail-Oriented Data Management Professional - $80,000/Yearly

Remote

Remote Market Insights Analyst - Flexible Hours - Now Hiring

Remote

**Experienced Customer Service/Sales Representative – Building Relationships and Driving Sales Growth at blithequark**

Remote

Software Engineer I - AI Platform

Remote

Sales Development Manager

Remote
← Back to Home