Demo

Data Operations Engineer

Abaka AI
Mountain View, CA Full Time
POSTED ON 6/12/2026
AVAILABLE BEFORE 7/18/2026
About Abaka AI

Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Generative AI, Embodied AI, and Automotive AI rely on us to power their data pipelines. With our headquarters in Silicon Valley—and teams in Paris, Singapore, and Tokyo—we support global partners with fast, reliable, and scalable data solutions.

Our offerings include a diverse catalog of off-the-shelf datasets (image, video, multimodal, reasoning, 3D, and beyond) as well as comprehensive data collection and annotation services. Whether teams need raw data, curated datasets, or full-cycle data engineering, Abaka AI provides the foundation for building high-performance AI systems.

About The Role

We are hiring a Data Operations Engineer to own and operate Abaka AI’s internal dataset library. This role will serve as the central point of knowledge for all datasets across the company, working closely with engineering, product, and business teams to ensure fast, accurate, and scalable access to data.

You will develop a deep understanding of our dataset inventory, including structure, quality, and use cases, and act as the primary point of contact for internal data-related questions. You will translate ambiguous requests into clear solutions, validate dataset quality, and coordinate across global teams to resolve issues efficiently.

This role is highly cross-functional and requires strong problem-solving ability, technical fluency, and a high level of ownership. You will play a critical role in improving how datasets are organized, accessed, and utilized across the company.

Responsibilities

  • Develop and maintain a comprehensive understanding of Abaka AI’s dataset library, including data structure, quality, and applicable use cases across modalities (text, image, video, audio, 3D).
  • Serve as the internal point of contact for dataset-related inquiries, providing clear and timely responses to questions from engineering, product, and business teams.
  • Translate ambiguous or high-level requests into concrete dataset solutions, identifying appropriate data sources or gaps.
  • Inspect and validate datasets for quality, completeness, and consistency using SQL, Python, or other tools as needed.
  • Coordinate with global data teams, including teams in China, to resolve data issues, clarify requirements, and ensure timely delivery without unnecessary escalation.
  • Maintain and improve internal documentation, organization, and accessibility of datasets.
  • Identify inefficiencies in current workflows and propose improvements to systems, tooling, and processes that support dataset management and usage.
  • Support cross-functional initiatives by providing dataset insights, technical context, and operational guidance.

Qualifications

  • Bachelor’s degree in Computer Science, Data Engineering, or a related field, or equivalent practical experience.
  • 1–4 years of experience in data operations, data engineering, or a related role involving direct interaction with datasets.
  • Professional proficiency in Mandarin Chinese and English is required, as this role involves frequent collaboration with China-based vendors and external partners
  • Strong problem-solving skills and ability to operate effectively in ambiguous, fast-paced environments.
  • Proficiency in SQL and/or Python for data inspection, validation, and basic analysis.
  • Experience working with real-world datasets, including handling data quality issues, inconsistencies, and edge cases.
  • Strong communication skills, with the ability to work across technical and non-technical teams.
  • High level of ownership and accountability, with the ability to manage multiple requests and priorities simultaneously.

Preferred Qualifications

  • Experience with multimodal datasets (text, image, video, audio, or 3D).
  • Familiarity with data annotation, labeling workflows, or dataset preparation for machine learning.
  • Experience working with international teams, particularly in cross-border environments.
  • Exposure to AI/ML workflows, including training, fine-tuning, or evaluation datasets.

Compensation & Benefits

The base salary range for this position is $110,000 - $160,000 USD annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work at Abaka AI. This role is eligible for equity, as well as a comprehensive benefits package (health, dental, vision, PTO, flexible work schedule).

Salary : $110,000 - $160,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Data Operations Engineer?

Sign up to receive alerts about other jobs on the Data Operations Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,228 - $139,671
Income Estimation: 
$116,726 - $151,072
Income Estimation: 
$124,724 - $161,246
Income Estimation: 
$71,122 - $96,652
Income Estimation: 
$92,929 - $122,443
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Abaka AI

  • Abaka AI Mountain View, CA
  • About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Gene... more
  • 13 Days Ago

  • Abaka AI Mountain View, CA
  • About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Gene... more
  • 13 Days Ago

  • Abaka AI Palo Alto, CA
  • About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Gene... more
  • 1 Day Ago

  • Abaka AI Palo Alto, CA
  • About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Gene... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Data Operations Engineer jobs in the Mountain View, CA area that may be a better fit.

  • Data Capital Incorporation Sunnyvale, CA
  • Senior Data Engineer (Spark Streaming & GCP) Job Summary: We are seeking a highly skilled Senior Data Engineer with strong expertise in Apache Spark, Strea... more
  • 2 Days Ago

  • VAST Data Campbell, CA
  • VAST Data is seeking a Finance Manager to join our Operations Finance team as a core contributor in our mission to move from a reactive state to a stable, ... more
  • 23 Days Ago

AI Assistant is available now!

Feel free to start your new journey!