Demo

Infra Engineer - API

Medal
York, NY Full Time
POSTED ON 6/15/2026
AVAILABLE BEFORE 7/21/2026
About General Intuition

We are the frontier research lab dedicated to building foundation models for environments that require deep spatial and temporal reasoning. For the past year, we've been pushing the forefront of AI across agents capable of navigating space and time, world models that provide training environments for those agents, and video understanding models with a focus on transfer to the real world.

We raised a seed round of $133M from General Catalyst and Khosla to discover the next generation of intelligence.

The Role

We're hiring an Infra Engineer to own General Intuition's API.

Our research team builds frontier models — agents that reason about space and time, world models, video understanding. Your job is to turn those models into a production API that developers love: low-latency, highly available, billing-grade reliable, and able to scale from our first hundred users to tens of thousands of concurrent ones.

You'll work directly with the founding team. You'll own the API end to end: the client libraries developers integrate with, how we receive frames from clients and stream actions back, how requests route to the right GPU, how sessions spin up and tear down, how k8s clusters get stood up in new regions, and how our GPU fleet scales.

This is a true generalist infrastructure role. We are not looking for a pure API person or a pure GPU person — we are looking for someone who is exceptional at both, and who wants to own the entire surface end-to-end.

Key Responsibilities

  • Own the video streaming protocol. Orchestrating how we receive frames from clients and route them to servers as efficiently as possible.
  • Own the runtime layer of our API. Stateful request routing, GPU session lifecycle, inference orchestration — the whole runtime stack.
  • Scale our k8s footprint across regions. Lead new regional deployments.
  • Own the GPU hosting strategy. Move us from dozens of GPUs today to potentially thousands (and beyond) without breaking the bank or the latency budget.
  • Drive latency and throughput. Own the inference-performance backlog
  • Partner with product engineering. Work closely on developer-facing reliability, observability, metering, and billing-grade uptime.

Qualifications

You almost certainly have:

  • A track record of personally scaling a high-traffic, low-latency API in production, whether at a gaming company, a video streaming company, a payments company, or a hyperscaler.
  • Deep k8s experience, including multi-region deployments.
  • Comfort with SLOs and capacity planning.
  • Strong ownership instinct — you've taken systems end-to-end, not just contributed to them.

Bonus Points For Any Of

  • Experience deploying streaming video or audio inference models (the dream hire).
  • Experience with low-latency game streaming or video streaming infra.
  • Experience scaling GPU fleets across providers (GCP, Coreweave, Lambda, etc.).
  • Experience with frontier model inference (LLMs, world models, multimodal).
  • Experience with on-device / edge inference (ExecuTorch, Core ML, etc.).

Our stack

  • GPUs: GCP today, Coreweave as we scale.
  • Orchestration: Kubernetes, multi-region.
  • Models: In-house frontier research — agents, world models, video understanding.
  • API surface: Client libraries in TypeScript, Python, Rust, and C
  • In-office, 5 days/week. NYC, Stockholm, London, Paris, or Geneva.

Benefits

  • Competitive salary and meaningful equity
  • Comprehensive medical, dental, and vision coverage
  • 401(k)
  • Wellhub membership for fitness and wellness
  • Mental health support through Spring Health and Headspace
  • Fertility and maternal health benefits
  • Paid parental leave
  • Generous PTO, 11 paid company holidays, and paid sick time
  • Daily meals and commuter benefits at our NYC HQ
  • Learning and development stipend

Benefits vary by country and employment type.

Compensation Range: $250K - $400K

Salary : $250,000 - $400,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Infra Engineer - API?

Sign up to receive alerts about other jobs on the Infra Engineer - API career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Medal

  • Medal York, NY
  • About General Intuition General Intuition is the frontier research lab for acting in space and time. We build large action models that can perceive, predic... more
  • 1 Day Ago

  • Medal York, NY
  • The Company Medal Medal is the world’s largest and fastest-growing platform for gaming clips, where millions of gamers capture, share, and relive their bes... more
  • 3 Days Ago

  • Medal York, NY
  • Medal Medal is the world’s largest and fastest-growing platform for gaming clips, where millions of gamers capture, share, and relive their best moments. E... more
  • 3 Days Ago

  • Medal York, NY
  • About General Intuition The most powerful foundation models are trained on written words. But human intelligence far exceeds language. Truly intelligent ma... more
  • 9 Days Ago


Not the job you're looking for? Here are some other Infra Engineer - API jobs in the York, NY area that may be a better fit.

  • Merge API York, NY
  • Merge is the leading provider of agentic tools and customer-facing integrations for frontier LLMs, Fortune 500 organizations, and B2B SaaS companies. Our p... more
  • 11 Days Ago

  • Net2Source (N2S) York, NY
  • Systems Engineer – Enterprise IoT & Infrastructure (L2–L3) NYC, NY (Onsite – physical device handling required) What are the top 3 skills required for this... more
  • 22 Days Ago

AI Assistant is available now!

Feel free to start your new journey!