What are the responsibilities and job description for the Principal Data Engineer position at Curative AI, Inc.?
About Curative AI, Inc.
Curative AI, Inc. is an ambitious innovative early-stage startup revolutionizing the healthcare industry through cutting-edge AI-powered SaaS solutions. We are currently delivering exceptional value to our customers in Revenue Cycle Management (RCM) and Clinical Operations, empowering them with industry-transforming AI technology, intelligent automation, and deep data insights. Unlike most tech startups, we have achieved financial break-even within our first year, with signed customer contracts for $30 M in 2025 projected revenues through our AI healthcare solutions deliveries. Headquartered in Bellevue/Seattle, we have built a sizable top-notch tech team in the US, and a large talented offshore team. We now enter a rapid growth phase, hiring aggressively to scale the US-based team to 100 employees in 2025. Our vision is bold: to achieve a valuation of over $1 billion within the next 18 months by continuing to deliver exceptional solutions and expanding our market presence. This is an exceptional opportunity to join an early-stage AI Healthcare tech company with a proven product, established customer base, solid revenue streams, and explosive growth potential.
The Opportunity
Curative AI, Inc. is seeking a highly skilled and experienced Principal Data Engineer for our rapidly growing company. Our cutting-edge AI platform and tools will transform healthcare management, leading with RCM solutions for streamlined documentation, faster claims processing, enhanced clinical decision support, and much more. You will play a pivotal role in designing, building, and maintaining the data infrastructure that supports our data-driven initiatives. You will work closely with cross-functional teams to identify and prioritize data infrastructure needs, develop and implement data pipelines, and ensure the quality and reliability of our data. You are humble, a dedicated team player, and excited for the road ahead. Come work with a CEO renowned in the AI field with a proven record of building high performing teams, fostering career growth, and creating a positive work culture. Let's make healthcare smarter together.
Responsibilities
Curative AI, Inc. is an ambitious innovative early-stage startup revolutionizing the healthcare industry through cutting-edge AI-powered SaaS solutions. We are currently delivering exceptional value to our customers in Revenue Cycle Management (RCM) and Clinical Operations, empowering them with industry-transforming AI technology, intelligent automation, and deep data insights. Unlike most tech startups, we have achieved financial break-even within our first year, with signed customer contracts for $30 M in 2025 projected revenues through our AI healthcare solutions deliveries. Headquartered in Bellevue/Seattle, we have built a sizable top-notch tech team in the US, and a large talented offshore team. We now enter a rapid growth phase, hiring aggressively to scale the US-based team to 100 employees in 2025. Our vision is bold: to achieve a valuation of over $1 billion within the next 18 months by continuing to deliver exceptional solutions and expanding our market presence. This is an exceptional opportunity to join an early-stage AI Healthcare tech company with a proven product, established customer base, solid revenue streams, and explosive growth potential.
The Opportunity
Curative AI, Inc. is seeking a highly skilled and experienced Principal Data Engineer for our rapidly growing company. Our cutting-edge AI platform and tools will transform healthcare management, leading with RCM solutions for streamlined documentation, faster claims processing, enhanced clinical decision support, and much more. You will play a pivotal role in designing, building, and maintaining the data infrastructure that supports our data-driven initiatives. You will work closely with cross-functional teams to identify and prioritize data infrastructure needs, develop and implement data pipelines, and ensure the quality and reliability of our data. You are humble, a dedicated team player, and excited for the road ahead. Come work with a CEO renowned in the AI field with a proven record of building high performing teams, fostering career growth, and creating a positive work culture. Let's make healthcare smarter together.
Responsibilities
- Build & Own Data Pipelines: Design, implement, and optimize scalable data ingestion and ETL pipelines in Azure Databricks to integrate diverse data sources (EHRs, billing/claims, CRM, HRIS, scheduling, etc.).
- Healthcare Data Integration: Work with APIs, HL7, FHIR, X12/EDI, and other healthcare data standards to connect with platforms like CollaborateMD, Availity, Salesforce Health Cloud, Ensora Health, and EMRs.
- Data Platform Innovation: Contribute to the design of our AI-first data platform, supporting real-time data flows, vector search, embeddings, and LLM integrations.
- Data Quality & Governance: Implement robust monitoring, error handling, observability, and governance for sensitive PHI/PII data in compliance with HIPAA.
- AI Enablement: Partner with data scientists and ML engineers to make high-quality, structured, and unstructured data available for training, inference, and real-time AI agents.
- Performance at Scale: Optimize pipelines and storage for high throughput, low latency, and cost efficiency.
- Innovation Mindset: Rapidly prototype solutions for complex data challenges — doing things no one has done before in AI-driven healthcare RCM and clinical operations.
- You must currently be located in the Seattle Metro Region and able to work hybrid on-site a minimum of three days at our Bellevue location
- Bachelor’s degree in Computer Science, Data Engineering, or a related field
- Core Engineering Skills
- 7 years professional experience as a Data Engineer (or equivalent).
- Expertise with Azure Databricks, Spark, Delta Lake, and Azure Data Lake.
- Strong in Python, PySpark, SQL, and API integrations (REST, GraphQL).
- Proven experience with real-time data pipelines (Kafka, Event Hubs, streaming).
- Healthcare Domain (Preferred but Not Required)
- Knowledge of EHRs, HL7, FHIR, X12/EDI, RCM, EMR data models.
- Familiarity with payer/provider workflows, claims, and clinical documentation.
- AI/Next-Gen Skills (Big Plus)
- Experience enabling LLM/AI pipelines (vector databases, embeddings, LangChain, RAG).
- Familiarity with agentic AI workflows and real-time orchestration.
- Interest in integrating unstructured data (clinical notes, PDFs, images) into structured pipelines.
- Mindset & Traits
- Entrepreneurial, resourceful, and fast-learning.
- Thrives in ambiguity and “greenfield” challenges.
- Excited to push boundaries in AI-powered healthcare data platforms.
- Opportunity to build a first-of-its-kind AI healthcare data platform used in real-world clinical and RCM workflows.
- A team that values innovation, boldness, and velocity.
- Competitive compensation, equity, and benefits.
- A culture of ownership, learning, and impact.
- Base Salary Range: $185,000 - $220,000 per year (commensurate with experience and qualifications)
- Target Annual Performance Bonus
- Equity Package: Generous equity participation in the company's future success
- Comprehensive benefits package including medical, dental, vision, Life and AD&D insurance. Paid time off and holidays
Salary : $185,000 - $220,000