What are the responsibilities and job description for the Research Intern (LLM) position at abakaai?

Our recent related work: SuperGPQA (NeurIPS '25) and ACADREASON

Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers.
Evaluate large language models on reasoning, factuality, and problem-solving benchmarks.
Develop review pipelines and quality-control criteria for expert-level question generation.
Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers.
Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases.

Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems.
1 years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass).
Excellent written and verbal English skills and analytical reasoning.
Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes.
(Preferred) Experience with formal methods, chain-of-thought evaluation, or curriculum generation.
(Preferred) Relevant publications in top conferences.

Apply for this job

Receive alerts for other Research Intern (LLM) job openings

Job openings at abakaai

Quality Project Associate

Apply

abakaai Mountain View, CA
About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Gene... more
14 Days Ago

Technical Project Associate

Apply

abakaai Mountain View, CA
About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Gene... more
14 Days Ago

Summer Research Intern

Apply

abakaai Palo Alto, CA
Our Recent Related Work SuperGPQA (NeurIPS ’25) – https://supergpqa.github.io/ ACADREASON – https://arxiv.org/pdf/2510.11652 Objaverse – https://arxiv.org/... more
2 Days Ago

Not the job you're looking for? Here are some other Research Intern (LLM) jobs in the Palo Alto, CA area that may be a better fit.

Agentic AI Research Intern

Apply

Fujitsu Research Santa Clara, CA
At Fujitsu, we are driven by our purpose to make the world more sustainable by building trust in society through innovation. We have been a pioneer in tech... more
8 Days Ago

2026 Fall Intern, ML/NLP Research

Apply

Samsung Research America (SRA) Mountain View, CA
Lab Summary: AI Research Center (AIC) located in Mountain View, California focuses on research and development which directly impacts future Samsung produc... more
18 Days Ago

Research Intern (LLM)