What are the responsibilities and job description for the Senior Infrastructure Engineer position at Jobright.ai?
Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust.
Job Summary:
Groq is building a custom cloud from the ground up, focusing on creating production-ready Kubernetes clusters for AI workloads. The Infrastructure Engineer will play a key role in provisioning, automation, and collaborating with data center and networking teams to scale infrastructure efforts.
Responsibilities:
• Support the provisioning and deployment of Kubernetes clusters on bare metal servers.
• Help build and maintain tooling for bare metal provisioning — including DHCP, DNS, PXE/iPXE/HTTPBoot, and Talos Linux Machine Configuration.
• Write and maintain scripts and services (Go, Python, Bash) to automate deployment workflows across new and existing sites.
• Partner with data center operations and networking teams to ensure hardware is correctly configured, connected, and ready for use.
• Manage infrastructure configuration using tools like Git, Flux, and Terraform.
• Contribute to system documentation, runbooks, and tooling that makes our infrastructure reliable and repeatable.
Qualifications:
Required:
• Experience with Linux / Kubernetes systems and comfort working in a terminal.
• Familiarity with infrastructure-as-code and Git-based workflows (e.g., Terraform, Flux, Kustomize).
• Ability to write and maintain basic tooling in Go, Python, or Bash.
• Understanding of networking fundamentals (IPAM, VLANs, DHCP, DNS).
• Working knowledge of storage concepts (block vs object, NFS, RAID, etc.).
• Strong sense of ownership and a willingness to dive into hardware, firmware, or low-level provisioning issues.
Preferred:
• Experience provisioning physical machines in a data center environment.
• Exposure to Talos Linux, Kubernetes bootstrapping, or Kubernetes platform engineering.
• Previous collaboration with facilities, hardware, or network teams in an operational role.
Company:
Groq radically simplifies compute to accelerate workloads in artificial intelligence, machine learning, and high-performance computing. Founded in 2016, the company is headquartered in Mountain View, California, USA, with a team of 201-500 employees. The company is currently Late Stage. Groq has a track record of offering H1B sponsorships.