Bilingual Spanish Generalist Evaluator Expert

Mercor

We use AI to understand human ability and match talent with the opportunities they're best suited for

Remote

Chile

Mexico

Spain

Bilingual Spanish Generalist Evaluator Expert

Mercor is seeking native Spanish speakers from the United States, Spain, Chile, or Mexico with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Spanish/English prompt–golden answer pairs that train and evaluate advanced language models.

This role is strictly limited to candidates who are native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including distinctions such as U.S. Spanish, Peninsular Spanish, Chilean Spanish, and Mexican Spanish conventions).

This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded Spanish text while maintaining technical precision in English.

Job Details

Multilingual Prompt Design & Optimization:

Create detailed prompts in Spanish and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Spanish-speaking users in the United States, Spain, Chile, and Mexico contexts.

Define and Document Evaluation Standards:

Establish high-level expectations for correct responses in United States, Spain, Chile, and Mexico consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions.

Model Testing and Grading (Bilingual):

Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Spanish, comparing results against English where needed.

Benchmarking & Quality Assurance:

Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor—maintaining consistency and reliability across Spanish-language benchmarks before integration into official evaluations.

Minimum Qualifications

Native-level fluency in Spanish (written), specific to United States, Spain, Chile, or Mexico usage, with strong reading/writing ability in English.
Must be native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity.
BS or BA from a reputable institution (completed or in progress).
Strong writing and critical thinking skills.
Ability to work independently and meet deadlines.
Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests.
Based in the United States, Spain, Chile, or Mexico (or able to reliably produce country-specific, culturally accurate Spanish aligned with one of these regions).

Preferred Qualifications

Experience in teaching, research, editing, or academic writing.
Experience creating evaluation criteria, rubrics, or grading guidelines.
Familiarity with LLMs, prompting, or model evaluation (helpful but not required).

Application & Onboarding Process

Complete an AI-led interview (about 15 minutes).
If approved, complete a paid assessment focused on writing and rubric creation
Then, if selected, you will be invited to work on the project.

More Details About This Role

Expect to contribute at least 20 hours per week.
Expect a commitment of approximately 2–4 months.
You’ll be working in a structured project environment with clear goals and tools.

We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

Contract and Payment Terms

You will be engaged as an independent contractor.
This is a fully remote role that can be completed on your own schedule.
Projects can be extended, shortened, or concluded early depending on needs and performance.
Your work at Mercor will not involve access to confidential or proprietary information from any employer, client, or institution.
Payments are weekly on Stripe or Wise based on services rendered.
Please note: We are unable to support H1-B or STEM OPT candidates at this time.

About Mercor

Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will work on projects that focus on training and enhancing AI systems. You will be paid competitively, collaborate with leading researchers, and help shape the next generation of AI systems in your area of expertise.

Apply for this job

Please let Mercor know you found this position on NoDesk as a way to support us so we can keep providing you with quality remote jobs.

Bilingual Spanish Generalist Evaluator Expert

Mercor

We use AI to understand human ability and match talent with the opportunities they're best suited for

Bilingual Spanish Generalist Evaluator Expert

Job Details

Minimum Qualifications

Preferred Qualifications

Application & Onboarding Process

More Details About This Role

Contract and Payment Terms

About Mercor

About Mercor

Company profile

View more jobs 6

People also viewed

AI Vibe Coding Web Designer

B12

Bilingual German Generalist Evaluator Expert

Mercor

Bilingual French Generalist Evaluator Expert

Mercor

Search Generalist

Mercor

Scout Search Quality Rater - English (United States)

Welocalize

Survey - LifePoints (US)

InnovateMR

Business Operations Manager

Eight Sleep

Generalist Expert

Mercor

WFH Entry-Level Hotel Coordinator

Caribbean and Cruise Experience

Digital Marketing & Client Engagement Partner (Remote)

Following Your Heart

Remote Work Starts Here

Get the best new remote jobs and remote work stories straight to your inbox.