Demo

Data Scientist, Knowledge Graphs

Mithrl
San Francisco, CA Full Time
POSTED ON 12/3/2025
AVAILABLE BEFORE 5/31/2026

ABOUT MITHRL


We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.


Mithrl is building the world’s first commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions in natural language, and Mithrl responds with real analysis, novel targets, hypotheses, and patent-ready reports.

No coding. No waiting. No bioinformatics bottlenecks.


We are one of the fastest growing tech bio companies in the Bay Area with 12x year over year revenue growth. Our platform is used across three continents by leading biotechs and big pharmas. We power breakthroughs from early target discovery to mechanism-of-action. And we are just getting started.



ABOUT THE ROLE


We are hiring a Data Scientist, Knowledge Graphs to build and scale the biological knowledge layer that powers the Mithrl AI Co-Scientist. This role focuses on ingesting and harmonizing the world’s most important biological data sources and curating the relationships that allow our system to reason across pathways, targets, diseases, compounds, and multimodal datasets.


You will ingest data from public consortia and well maintained peer reviewed sources and unify them into a coherent, versioned knowledge graph. You will identify new node types, define relationship schemas, harmonize variable IDs, and ensure metadata remains consistent across all integrated sources. You will also build automated curation pipelines that expand and refine the knowledge graph using both data driven methods and domain logic.


Beyond ingestion and curation, you will create the tools and frameworks that allow users to interact with the knowledge graph and even build their own custom graphs based on the results they generate inside Mithrl. Your work will form the foundation for pathway reasoning, target scoring, evidence aggregation, and multimodal interpretation inside the AI Co-Scientist.



WHAT YOU WILL DO


  • Ingest, harmonize, and version high value public biological datasets such as CellxGene, Gemma, ARCHS4, ENCODE, GTEx, TCGA, etc.
  • Ingest well maintained peer reviewed knowledgebases including OpenTargets, HPA, and similar resources
  • Build automated pipelines to curate and expand relationships inside the knowledge graph
  • Define and evolve schemas for node types, relationships, metadata rules, and ontology alignment
  • Harmonize variable IDs and metadata fields across all imported sources to create a unified knowledge layer
  • Build and maintain versioning, change tracking, and provenance systems for all data and relationships
  • Develop the framework that allows users to build custom knowledge graphs from the analyses they run inside Mithrl
  • Build features that allow users to explore, query, and interact with their graphs
  • Work closely with ML engineers, bioinformatics teams, and discovery application teams to ensure the knowledge graph supports downstream reasoning and analysis
  • Validate the correctness, completeness, and integrity of the knowledge graph across releases



WHAT YOU BRING


Required Qualifications


  • Strong experience in data science, bioinformatics, computational biology, or a related field
  • Experience working with biological knowledgebases, public datasets, or ontology driven systems
  • Familiarity with graph data structures, relationship modeling, and knowledge graph concepts
  • Experience harmonizing heterogeneous biological datasets and mapping variable IDs across sources
  • Proficiency in Python and scientific computing libraries
  • Ability to build ingestion pipelines for structured or semi structured biological data
  • Strong understanding of metadata standards, biological ontologies, and domain logic
  • Ability to translate complex biological information into structured, machine readable representations
  • Excellent communication skills and comfort collaborating across engineering and scientific teams


Nice to Have


  • Experience with graph databases or graph query languages
  • Experience with KG curation, link prediction, relationship extraction, or graph based ML
  • Familiarity with multi modal data integration
  • Previous work on biological or chemical knowledge graphs
  • Experience with public consortia such as ENCODE, GTEx, TCGA, or ChEMBL, etc.
  • Prior experience in a tech bio startup or scientific software environment



WHAT YOU WILL LOVE AT MITHRL


  • You will build the core knowledge layer that the AI Co-Scientist uses to reason about biology
  • Team: Join a tight-knit, talent-dense team of engineers, scientists, and builders
  • Culture: We value consistency, clarity, and hard work. We solve hard problems through focused daily execution
  • Speed: We ship fast (2x/week) and improve continuously based on real user feedback
  • Location: Beautiful SF office with a high-energy, in-person culture
  • Benefits: Comprehensive PPO health coverage through Anthem (medical, dental, and vision) 401(k) with top-tier plans

Salary : $150,000 - $240,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Data Scientist, Knowledge Graphs?

Sign up to receive alerts about other jobs on the Data Scientist, Knowledge Graphs career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$142,836 - $179,016
Income Estimation: 
$177,911 - $222,488
Income Estimation: 
$159,877 - $204,987
Income Estimation: 
$73,798 - $89,311
Income Estimation: 
$90,112 - $113,166
Income Estimation: 
$90,112 - $113,166
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$142,836 - $179,016
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Mithrl

  • Mithrl San Francisco, CA
  • ABOUT MITHRLWe imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.M... more
  • 15 Days Ago

  • Mithrl San Francisco, CA
  • ABOUT MITHRLWe imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.M... more
  • 6 Days Ago

  • Mithrl San Francisco, CA
  • About Mithrl We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.... more
  • 13 Days Ago


Not the job you're looking for? Here are some other Data Scientist, Knowledge Graphs jobs in the San Francisco, CA area that may be a better fit.

  • CyberCoders San Mateo, CA
  • Technical Lead - Knowledge Graph Engineer Hybrid ~2x a week in San Mateo, CA $250k-$350k base (may have some flex here if needed) equity We just secured $4... more
  • 10 Days Ago

  • Vanguard-IP San Francisco, CA
  • REQUIREMENTS - Candidates must have a JD with strong academic credentials and be admitted to practice and be in good standing in the state in which they wi... more
  • 22 Days Ago

AI Assistant is available now!

Feel free to start your new journey!