What are the responsibilities and job description for the Senior Infrastructure Software Engineer position at Voltage Park?
Voltage Park is your enterprise AI factory. We offer scalable compute power, on-demand and reserved bare metal AI infrastructure using NVIDIA GPUs, with world-class service, performance, and value. Founded with the mission of making accessible AI computing for all, our flexible, affordable GPU solutions power everyone from builders to enterprises.
As a part of these efforts, we are seeking to add a Senior Infrastructure Software Engineer to our Infrastructure Engineering team. Our team is responsible for building automation, tooling, and API-driven systems to bridge the gap between our physical infrastructure and the systems that our customers depend on for AI/ML training, inference, and HPC workloads at scale. In this role, you’ll design and implement systems that enable humans and software to interact programmatically with thousands of bare-metal servers, storage clusters, and high-performance networks. You will work closely with teams across Voltage Park to drive new infrastructure rollouts and improve the lifecycle management of existing resources.
This is an on-site role in our Redmond, Washington office. We are not able to provide visa sponsorship for this position at this time.
What You Will Do
Required Qualifications
As a part of these efforts, we are seeking to add a Senior Infrastructure Software Engineer to our Infrastructure Engineering team. Our team is responsible for building automation, tooling, and API-driven systems to bridge the gap between our physical infrastructure and the systems that our customers depend on for AI/ML training, inference, and HPC workloads at scale. In this role, you’ll design and implement systems that enable humans and software to interact programmatically with thousands of bare-metal servers, storage clusters, and high-performance networks. You will work closely with teams across Voltage Park to drive new infrastructure rollouts and improve the lifecycle management of existing resources.
This is an on-site role in our Redmond, Washington office. We are not able to provide visa sponsorship for this position at this time.
What You Will Do
- Design, build and maintain tools, APIs, and automation frameworks to manage physical infrastructure at scale.
- Build and extend systems for server lifecycle management.
- Implement observability, telemetry, and logging systems that enable visibility and insights into the health of our hardware.
- Collaborate with our Network, Infrastructure Operations, Platform Engineering, and Customer Experience teams to define requirements for and build new tools.
- Participate in architectural discussions to help define the direction of infrastructure engineering at Voltage Park.
- Write clear design documents and technical documentation.
Required Qualifications
- 8 years of professional experience in software engineering, infrastructure engineering, or related fields.
- Strong experience with Linux in production environments.
- Proficiency in Python or similar object-oriented programming languages.
- Familiarity with containerization and orchestration concepts.
- Understanding of HPC infrastructure fundamentals, bare-metal provisioning and out-of-band management.
- Experience balancing pragmatic shipping with good long-term architecture.
- Comfortable with navigating ambiguity
- Strong written and verbal communication skills.
- Experience with bare metal hardware troubleshooting and provisioning, extra points for working with Dell hardware. - Experience with GPU servers, both in bare metal form or under virtualization.
- Deep experience with network switches, routers, and firewalls, particularly SONiC switches, Palo Alto firewalls and Juniper Networks as vendors.
- Experience with VAST storage systems.
- You enjoy collaborating with a growing, motivated team focused on execution.
- You are comfortable operating with a high degree of autonomy and able to independently prioritize tasks aligning with company objectives.
- You possess a breadth of knowledge in your domain while also embracing the opportunity to take on diverse responsibilities.
- You value the importance of clear communication and documentation in driving success.