What are the responsibilities and job description for the Data Engineer position at Underdog.io -Apply to top tech jobs in 60 seconds. A place where companies apply to you?
We are a mission-driven team on a mission to modernize the foundational infrastructure of healthcare. We believe that high-value care starts with connectivity—specifically, the ability to connect patients with the right providers. Currently, the healthcare industry relies on fragmented, outdated data and manual processes to build the networks that dictate patient access.
We are building a purpose-built software platform that combines best-in-class data with intelligent workflow tools. Our goal is to empower healthcare organizations to move beyond static, administrative network management and toward dynamic, value-driven network design. By solving this critical but overlooked problem, we are helping payers and providers ensure patients can find the right doctor for their medical and personal needs.
Backed by top-tier investors including Tiger Global and Primary Ventures, we are a collaborative group of healthcare operators, data experts, and technologists. We are growing thoughtfully, prioritizing a strong culture and real business impact over hyper-growth at any cost. We are looking for team members who want ownership, agency, and the chance to shape a product from the ground up.
The Role: The First Data Engineer
We are seeking a talented and entrepreneurial Data Engineer to become the founding member of our data engineering practice. This is a high-impact role for someone who thrives in a collaborative environment and wants to build infrastructure from the ground up. You will be responsible for architecting, building, and optimizing the production-grade data pipelines that power our analytics and our application.
Your primary mandate is to transform how we handle data—making it faster, more reliable, and more accessible. You will be the bridge that turns massive, raw healthcare datasets into actionable intelligence for our customers and core features for our platform.
Success in this role will be defined by:
- Productizing Insights: Transitioning custom, one-off reporting workflows into scalable, automated features within the core application.
- Architecting for Scale: Designing high-volume ingestion pipelines capable of processing the massive datasets required by our newest enterprise partners.
- Accelerating Time-to-Value: Drastically reducing the latency between raw data ingestion and tangible customer impact.
- Ensuring Trust: Implementing robust data observability, quality gates, and testing frameworks to guarantee data integrity.
What You’ll Do
- Architect, design, develop, and maintain scalable data architecture to support the extraction, transformation, and storage of complex healthcare data.
- Partner closely with data analysts, scientists, and software engineers to understand their needs and empower them with clean, reliable data.
- Build and optimize batch processing workflows to handle large-scale data volumes efficiently.
- Establish and enforce best practices for data observability, quality, and lineage to ensure trust throughout the pipeline.
- Tune and optimize existing pipelines and database queries for performance and reliability as we scale.
- Collaborate with the engineering team to integrate data engineering best practices with a Python-based application codebase.
Who You Are
- You have 3-5 years of professional data engineering experience, preferably in a fast-paced startup or SaaS environment.
- You possess deep, production-level experience with both columnar data warehouses (e.g., Snowflake, BigQuery, Redshift) and transactional databases (e.g., Postgres), with the expertise to design schemas and tune queries for both.
- You are highly proficient in dbt and have strong opinions on how to use it for transformation and testing.
- You have extensive experience with batch processing of large datasets.
- You are familiar with modern orchestration tools such as Dagster or Airflow .
- You write clean, maintainable Python code and have strong opinions on how to leverage modern tooling, including LLM-assisted development, to improve engineering velocity.
- You thrive in small, cross-functional teams where collaboration, clear communication, and a "no-ego" attitude are paramount.
- Bonus: Experience working with healthcare data (claims, provider directories, FHIR standards) is a plus, but a passion for solving hard problems is non-negotiable.
Why Join Us?
- Impact: As the first dedicated data engineer, you will define the data architecture and set the standard for years to come.
- Location: This role is based in New York City, with an expectation to work from the office three days per week to foster collaboration. (Exceptional candidates outside of NYC willing to travel quarterly may be considered).
- Compensation: Competitive salary ($150,000 – $175,000) commensurate with experience, plus significant equity ownership.
- Benefits: Comprehensive health insurance, commuter benefits, and a stipend for home office equipment.
- Time Off: Unlimited PTO—we trust you to manage your time and recharge when needed.
- Culture: Join a world-class team of proven, humble entrepreneurs. We are building a business for the long haul, and we prioritize psychological safety, curiosity, and fun.
Salary : $150,000 - $175,000