What are the responsibilities and job description for the LLM Infrastructure & Backend Engineer (4518) position at HIRECLOUT?
Job Title: LLM Infrastructure & Backend Engineer
Role Overview
Join a mission-driven AI company focused on building secure, scalable platforms for enterprise-grade machine learning products. Backed by one of the world’s most powerful large language models, this organization is transforming conversational AI into real-world business applications. You’ll be part of a collaborative, fast-paced environment that values kindness, innovation, and impact.
This role is ideal for backend engineers who are passionate about ML systems and excited to shape the technical foundation of a cutting-edge enterprise AI product.
Key Responsibilities
Role Overview
Join a mission-driven AI company focused on building secure, scalable platforms for enterprise-grade machine learning products. Backed by one of the world’s most powerful large language models, this organization is transforming conversational AI into real-world business applications. You’ll be part of a collaborative, fast-paced environment that values kindness, innovation, and impact.
This role is ideal for backend engineers who are passionate about ML systems and excited to shape the technical foundation of a cutting-edge enterprise AI product.
Key Responsibilities
- Design and build backend services to support LLM integration, inference orchestration, and data flow.
- Develop clean, production-grade Python code for experimentation, model serving, and system integration.
- Partner closely with ML researchers to rapidly prototype and deploy new product features.
- Build infrastructure capable of handling scalable inference workloads and complex enterprise use cases.
- Take ownership of backend components—ensuring reliability, observability, and maintainability.
- Proficient in backend development using Python, TypeScript, or Node.js.
- Experienced in building and deploying production PyTorch models, including inference pipelines and checkpoint handling.
- Strong foundation in developing scalable, secure APIs and backend services.
- Familiar with FastAPI, Postgres, Redis, Kubernetes, and React.
- Comfortable working in a high-velocity startup environment with evolving goals and systems.
- Experience bridging ML systems and backend engineering at the infrastructure level.
- Hands-on work in real-time inference orchestration or ML platform tooling.
- Familiarity with startup or early-stage product development environments.
- Impact: Directly influence the next generation of ML-powered enterprise applications.
- Team: Join a kind, experienced, and collaborative team of engineers and researchers.
- Innovation: Help shape infrastructure for one of the most advanced LLMs in the world.
- Career Growth: Opportunity to take on leadership roles as the company scales.
- Competitive salary ($200k-350k depending on experience).
- Diverse medical, dental and vision options
- 401k matching program
- Unlimited paid time off
- Parental leave and flexibility for all parents and caregivers
- Support of country-specific visa needs for international employees living in the Bay Area
Salary : $200,000 - $350,000