Back to Jobs

[Remote] Research Intern (LLM)

Remote, USA Full-time Posted 2025-11-24

Note: The job is a remote job and is open to candidates in USA. 2077AI Open Source Foundation is looking for a Research & Evaluation Intern to help build advanced QA datasets and evaluate large language models. This role is ideal for students passionate about LLMs, evaluation science, and the intersection of research and applied data work.


Responsibilities

  • Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers
  • Evaluate large language models on reasoning, factuality, and problem-solving benchmarks
  • Develop review pipelines and quality-control criteria for expert-level question generation
  • Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers
  • Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases

Skills

  • Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems
  • 1+ years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass)
  • Excellent written and verbal English skills and analytical reasoning
  • Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes
  • Experience with formal methods, chain-of-thought evaluation, or curriculum generation
  • Relevant publications in top conferences

Company Overview

  • The 2077AI Foundation, is at the forefront of AI data standardization and progression. It was founded in undefined, and is headquartered in Singapore, SG, with a workforce of 51-200 employees. Its website is https://www.2077ai.com/.

  •   Apply To This Job

    Similar Jobs

    Seasonal Sales Associate-282 Southeast Richmond...

    Remote, USA Full-time

    Experienced Customer Support Remote Representative – Delivering Magical Experiences to arenaflex Enthusiasts from the Comfort of Your Own Home

    Remote, USA Full-time

    Remote Care Manager - RN 3 Locations

    Remote, USA Full-time

    **Experienced Customer Service Representative – Pet Industry Expert (Remote in Florida)**

    Remote, USA Full-time

    Intelligence Analyst – RFI Triage (Remote, East...

    Remote, USA Full-time

    Business Development Director, Commercial Enter...

    Remote, USA Full-time

    Data Entry Remote Jobs-JetBlue Airline At Home ...

    Remote, USA Full-time

    [Hiring] Temporary Team Lead @TTEC

    Remote, USA Full-time

    Senior Data Scientist - Revenue Intelligence

    Remote, USA Full-time

    Delivery Director - US-Based

    Remote, USA Full-time

    Want Customer Service Agent - Remote/Hybrid in Cedar Falls, IA

    Remote, USA Full-time

    Principal Product Manager (Remote Eligible)

    Remote, USA Full-time

    Customer Service Agent - Remote Data Entry Agent – Survey Panelist

    Remote, USA Full-time

    Immediate Hiring: Routing & Channel Optimization Coordinator – Expert in Delivering Seamless Customer Experiences Across Omni-Channel Platforms

    Remote, USA Full-time

    Remote Booking and Customer Support Agent

    Remote, USA Full-time

    [Remote/WFM] Require MAA Professional Music Teacher Store 7170 in

    Remote, USA Full-time

    Experienced Entry Level Remote Customer Service Representative – Delivering Exceptional Travel Experiences through Outstanding Support and Care at blithequark

    Remote, USA Full-time

    Preschool Teacher – Growing Kids Learning Centers – Goshen, IN

    Remote, USA Full-time

    Temporary Admininstrative Assistant - Front Desk Operations - UT Online High School - (UTEMPS)

    Remote, USA Full-time

    Join Our Team: Travel Consultant at American Express - $30-$40/Hour

    Remote, USA Full-time