Demo

Sr. Data Engineer

Greylock Partners
York, NY Full Time
POSTED ON 5/23/2026
AVAILABLE BEFORE 8/29/2026

Summary:

Growth-stage company is building a new class of AI-native platform where data is the product. The system processes millions of structured and unstructured outputs daily across a rapidly growing set of external sources, and the underlying data infrastructure directly determines product reliability, correctness, and user trust.


The company is forming its first dedicated data function. This role will define how ingestion, processing, and data quality systems are architected and operated at scale. The decisions made here will shape the data platform for years, with direct impact on product velocity and customer experience.


What You’ll Do:

  • Own the architecture and reliability of a multi-source ingestion platform operating at high throughput and increasing scale
  • Design and evolve distributed data systems that support asynchronous processing, fault tolerance, and horizontal scalability
  • Define and enforce data contracts across ingestion, transformation, and serving layers to ensure end-to-end correctness
  • Establish best practices for pipeline observability, including monitoring, alerting, and system health tracking
  • Drive performance improvements across ingestion and processing layers, ensuring throughput stays ahead of product demand
  • Partner with product and engineering leadership to ensure data systems are designed proactively for upcoming features
  • Set technical direction for the data platform and mentor engineers as the team grows


Where You’ll Work in the Stack:

  • Ingestion: resilient, multi-provider data collection and normalization
  • Processing: distributed pipelines, async job orchestration, and transformation logic
  • Storage: columnar systems and query optimization for large-scale analytics
  • Serving: tight integration with application systems and product features
  • Observability: system-wide monitoring, alerting, and debugging infrastructure


What We’re Looking For:

  • Deep experience designing and operating production data systems at scale, including ingestion, ETL, and distributed processing
  • Strong expertise with modern data infrastructure, including columnar databases (e.g., ClickHouse or similar)
  • Proven track record owning system reliability, data quality, and observability in production environments
  • Experience working with external data sources, APIs, or scraping systems at scale
  • Comfort operating close to the application layer and understanding how data systems power user-facing features
  • Strong engineering fundamentals (Python or similar; experience with distributed systems required)
  • Ability to lead technical direction and influence system design across teams


Signals We’re Especially Excited About:

  • You’ve owned data platforms where correctness and latency directly impacted customers
  • You’ve built high-throughput systems with strict reliability and completeness requirements
  • You’ve designed observability systems that provide real-time insight into system health
  • You’ve operated in early-stage environments and built systems from first principles
  • You’ve led or mentored engineers while remaining deeply hands-on


Why This Role:

  • Define the data foundation for a rapidly scaling AI-native product
  • High ownership across architecture, reliability, and long-term system design
  • Direct impact on customer-facing product quality and trust
  • Opportunity to build and shape a data function from the ground up
  • Close collaboration with experienced founders and engineering leadership


About Greylock

Greylock is a leading early-stage venture capital firm that partners with exceptional founders building category-defining companies. Our portfolio includes Figma, Anthropic, Ramp, Abnormal Security, Rubrik, Airbnb, LinkedIn, Roblox, Dropbox, and Coinbase.


About the Greylock Recruiting Team

As full-time employees of Greylock, our team provides candidate referrals and introductions to our portfolio companies. We work closely with founders to build exceptional teams and bring deep experience across startups and large-scale technology companies.

Salary.com Estimation for Sr. Data Engineer in York, NY
$118,859 to $149,336
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Sr. Data Engineer?

Sign up to receive alerts about other jobs on the Sr. Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Greylock Partners

  • Greylock Partners San Francisco, CA
  • Greylock’s latest seed investment is building the next interface for how enterprise employees interact with computers. Founded by two second-time founders,... more
  • 4 Days Ago

  • Greylock Partners Redwood, CA
  • Early-stage, cybersecurity investment (valued over $100M at Seed), founded by a successful serial entrepreneur, is looking to hire a Founding MLE with a st... more
  • 4 Days Ago

  • Greylock Partners San Francisco, CA
  • Summary: Seed startup with 100 employees (company valued $XXXM already) is looking to continue to build out their ML team (already staffed with multiple AR... more
  • 6 Days Ago

  • Greylock Partners San Francisco, CA
  • One of our early-stage investments is looking to bring on a Senior AI Engineer to join the founding technical team. We’re looking for someone who has built... more
  • 6 Days Ago


Not the job you're looking for? Here are some other Sr. Data Engineer jobs in the York, NY area that may be a better fit.

  • Veracity Software Inc York, NY
  • Role: Sr. Fabric Data Engineer Engagement Type: Contract-to-Hire (C2H - 3 Months) Work Location: Remote (EST time zone) Start: Immediate / ASAP We are seek... more
  • 2 Months Ago

  • Circle York, NY
  • Circle (NYSE: CRCL) is one of the world’s leading internet financial platform companies, building the foundation of a more open, global economy through dig... more
  • 6 Days Ago

AI Assistant is available now!

Feel free to start your new journey!