Demo

AI Data Engineer – Scientific Data Platforms (Remote)

Astrix
South San Francisco, CA Remote Full Time
POSTED ON 6/16/2026
AVAILABLE BEFORE 6/30/2027
Our client is a leading global biotechnology and pharmaceutical organization driven by a mission to innovate, continuously advance science, and ensure everyone has access to the healthcare they need.

Title: AI Data Engineer – Scientific Data Platforms

Location: Remote, Must work PST

Pay rate: $37-45/hr (Depends on experience level)

Schedule: Full-time (40 hours/week)

Duration: 1-year contract, (Plus benefits)

Position Overview

This role addresses a critical need in scaling our AI models for drug discovery by building largely automated, scalable, agent-driven data ingestion and curation pipelines for genomics data. This includes metadata inference, constructing performant query architectures, and transforming high-dimensional datasets (e.g., single-cell omics, clinical trials) into AI-ready training formats.

Key Responsibilities

  • Build an agentic data ingestion pipeline and move beyond bespoke steps toward agents that teams can reliably use as a shared, deployed service.
  • Triage and prioritize incoming requests to ingest specific datasets. Clean and organize data, building the first-pass cleaning and organization steps into the agentic flow.
  • Validate cross-modal linkage. Add automated checks that catch when ingested data does not connect correctly and flag low-quality or mismatched records.
  • Version every dataset, retaining and making prior versions addressable. Preserve raw data and provenance, ensuring agent workflows log validation and transformation steps so lineage is fully traceable.
  • Partner with AI, software engineering, and computational biology groups to co-define data standards and conventions.

Qualifications & Requirements

  • Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as LangGraph or LlamaIndex, including tool/function calling and asynchronous task execution.
  • Strong Python skills for data manipulation, working with APIs and databases, and handling heterogeneous data formats.
  • Familiarity with dataset versioning approaches (e.g., DVC, lakeFS, or equivalent).
  • Comfortable with or showing a strong willingness to learn common omics data formats like AnnData, H5AD, and TileDB.
  • No deep bioinformatics expertise required; just a basic conceptual understanding of different modalities (e.g., RNA-seq vs. scRNA-seq vs. WES; genomics vs. transcriptomics vs. proteomics vs. metabolomics).
  • Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible.
  • Degree in a technical field or equivalent practical experience.
  • Must be Authorized to work in the United States without Sponsorship.

Nice to Have

  • Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints).
  • Exposure to cloud platforms (AWS, GCP) and containerization (Docker).
  • Familiarity with scientific workflow managers such as Nextflow or Snakemake.

INDBH

Salary : $37 - $45

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a AI Data Engineer – Scientific Data Platforms (Remote)?

Sign up to receive alerts about other jobs on the AI Data Engineer – Scientific Data Platforms (Remote) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Astrix

  • Astrix Boston, MA
  • At Astrix, we're expanding our team to support a diverse range of clients across various industries. We're seeking talented Medical Science Liaisons to joi... more
  • Just Posted

  • Astrix Fayetteville, GA
  • Pay Rate Low: 27 | Pay Rate High: 30 A well-established, premier food manufacturing company is seeking an experienced Maintenance Mechanic Technician to jo... more
  • Just Posted

  • Astrix Fayetteville, GA
  • Pay Rate Low: 38 | Pay Rate High: 42 At this time, Astrix cannot transfer nor sponsor a work Visa for this position. Relocation assistance is not available... more
  • Just Posted

  • Astrix Mundelein, IL
  • We’re hiring a Manufacturing Site Supervisor to join a well-established company with over 50 years of industry experience and a growing international prese... more
  • Just Posted


Not the job you're looking for? Here are some other AI Data Engineer – Scientific Data Platforms (Remote) jobs in the South San Francisco, CA area that may be a better fit.

  • BioSpace San Francisco, CA
  • At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, ... more
  • 23 Days Ago

  • Eli Lilly and Company San Francisco, CA
  • At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, ... more
  • 3 Days Ago

AI Assistant is available now!

Feel free to start your new journey!