Demo

Senior AI Data Engineer

OP
Menlo Park, CA Contractor
POSTED ON 6/3/2026
AVAILABLE BEFORE 11/29/2026
If you're passionate about pushing the boundaries of data engineering and machine learning, we'd love to hear from you. We are on the hunt for a highly skilled Senior AI Data Engineer to operate at the forefront of Data Engineering and Machine Learning Systems. In this critical role, you will craft and oversee sophisticated end-to-end data pipelines that do more than just move and transform data; they enrich it through remote model inference, handling complex asynchronous processes, capacity management, retry and fallback strategies, and throughput optimization.

This isn t your typical ETL role. It requires hands-on experience with distributed inference infrastructure and a deep understanding of system architecture. You will be instrumental in building solutions that power cutting-edge image generation models, evaluating and enhancing visual quality, prompt adherence, identity preservation, and naturalness, driving innovation in AI-driven content creation.

Responsibilities

  • AI-Augmented Data Pipelines: Design and maintain AI-augmented, large-scale data pipelines (billions of images) integrating traditional transformations with ML models (classifiers, embeddings, LLMs) for cleaning and annotation.
  • Remote Inference Orchestration: Own the systems for remote ML model inference orchestration within pipelines, managing batching, retries, async jobs, and ensuring graceful degradation.
  • Feature Pipelines: Build and maintain scalable pipelines for generating, storing, and serving vector embeddings, including nearest-neighbor index management and quality validation.
  • Data Curation at Scale: Source, filter, and curate training datasets using a combination of SQL and model-derived signals (e.g., aesthetic scores, NSFW classifiers), owning the end-to-end data flow and maintaining governance, quality, and compliance.
  • Additional Responsibilities
  • LLM-Assisted Annotation: Design and operate pipelines that use LLMs and vision models for automated annotation of training data, including auditing workflows to measure and improve annotation model performance.
  • Tooling & Frameworks: Contribute to shared tooling and frameworks that make it easier for the broader team to build AI-augmented data pipelines, e.g., reusable operators for model invocation, standard patterns for async job management.

Required Skills

  • Advanced SQL & data pipeline expertise. Complex queries, query optimization, pipeline orchestration frameworks (Airflow, Dataswarm, or equivalent).
  • Experience integrating ML models into data pipelines. Calling inference endpoints, managing model versions, batching requests, and handling inference failures at scale.
  • Proficiency with AI-assisted coding agents (e.g., Copilot, Cursor, Codex). Expected to leverage AI tools as a force multiplier for writing, debugging, and reviewing code, building pipelines faster, and accelerating day-to-day engineering workflows
  • Strong verbal and written communication skills, problem-solving ability, and cross-functional collaboration.

Preferred Skills

  • Working knowledge of embeddings and vector representations, like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent).
  • Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, and aesthetic scoring.
  • Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs.
  • Knowledge of generative AI like diffusion models, image generation, and evaluation metrics (FID, CLIP score, etc.).

Education / Experience

  • Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.
  • Minimum 5 years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.
  • Demonstrated track record of building and operating production data pipelines that invoke ML models at scale.

Benefits

  • 401(k).
  • Dental Insurance.
  • Health insurance.
  • Vision insurance.
  • We are an equal-opportunity employer and value diversity, equality, inclusion, and respect for people.
  • The salary will be determined based on several factors, including, but not limited to, location, relevant education, qualifications, experience, technical skills, and business needs.

Additional Responsibilities

  • Participate in OP monthly team meetings and participate in team-building efforts.
  • Contribute to OP technical discussions, peer reviews, etc.
  • Contribute content and collaborate via the OP-Wiki/Knowledge Base.
  • Provide status reports to OP Account Management as requested.

About Us

At OP, we help you harness the power of technology for maximum impact. A technology consulting and solutions company, we offer advisory and managed services, innovative platforms, and staffing solutions across a wide range of fields including AI, cyber security, enterprise architecture, and beyond. For nearly two decades, we ve been challenging the status quo of the consulting industry, serving up fresh, ingenious thinking through a radically lean structure. Together, this strategy delivers unprecedented performance at an unparalleled pace for faster results that propel your business forward.

Salary : $62 - $68

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior AI Data Engineer?

Sign up to receive alerts about other jobs on the Senior AI Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at OP

  • OP Sunnyvale, CA
  • As a Product Configuration Analyst, you will take on a critical role in bringing AR/VR to our customers. You will work side-by-side with engineering and bu... more
  • Just Posted

  • OP Beverly Hills, CA
  • We are looking for a Principal Product Manager to lead product strategy and execution for large-scale eCommerce web and mobile app platforms serving millio... more
  • Just Posted

  • OP Beverly Hills, CA
  • An experienced Technical Program Manager (TPM) is needed to lead a large-scale Oracle Fusion ERP implementation integrated with Oracle Retail Merchandising... more
  • Just Posted

  • OP Redmond, WA
  • We're looking for a Technical Program Manager to drive program execution and manage stakeholders and dependencies of our uLED backplane silicon programs, f... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Senior AI Data Engineer jobs in the Menlo Park, CA area that may be a better fit.

  • NVIDIA AI Santa Clara, CA
  • Job Requisition ID JR2017502 Job Category Engineering Time Type Full time We are looking for outstanding Machine Learning Engineers to join our Physical AI... more
  • 21 Days Ago

  • Microsoft AI Mountain View, CA
  • Overview Microsoft AI (MAI) is seeking an experienced Senior Data Privacy & Governance Engineer to help build mission applications and platform components ... more
  • 26 Days Ago

AI Assistant is available now!

Feel free to start your new journey!