What are the responsibilities and job description for the Manager, HPC Solutions Architecture position at GTN Technical Staffing?
Manager, HPC Solutions Architecture
Location: Dallas, TX (Hybrid)
Type: Direct Hire
• Competitive base salary performance bonus
• 100% company-paid benefits
• Relocation available
Overview
We are seeking a Manager, HPC Solutions Architecture to lead a team of domain-specialist architects responsible for designing and delivering advanced compute platforms across HPC, AI/ML workloads, and next-generation CaaS / GPUaaS environments.
This is a highly customer-facing leadership role that spans the full solution lifecycle—from early engagement and requirements discovery through architecture design, proof-of-concept, deployment, and ongoing optimization. The role serves as a strategic bridge between customers, engineering, and product teams, ensuring solutions are scalable, secure, and aligned with both technical and business objectives.
You will lead the design and delivery of multi-tenant, GPU-accelerated platforms, enabling GPU-as-a-Service (GPUaaS) and Container-as-a-Service (CaaS) offerings across complex, distributed infrastructure environments.
The ideal candidate brings a strong blend of technical depth, leadership capability, and customer engagement experience, with a proven ability to drive architectural excellence across large-scale HPC and AI platforms.
Key Responsibilities
Leadership & Team Development
- Lead, mentor, and grow a high-performing team of Solutions Architects across compute, storage, networking, Kubernetes, and security
- Foster a culture of technical excellence, accountability, and continuous improvement
- Provide architectural guidance across complex, multi-domain solution design
Customer Engagement & Strategic Advisory
- Build trusted advisor relationships with customers, aligning solutions to business objectives and technical requirements
- Guide customers through the full lifecycle, including discovery, design, proof-of-concept, deployment, and optimization
- Lead executive-level architecture discussions, workshops, and technical deep-dives
Solution Architecture & Platform Design
- Oversee the design of scalable, resilient, and secure architectures across HPC and CaaS / GPUaaS platforms
- Define architectures supporting multi-tenant GPU environments, workload isolation, and high-performance compute delivery
- Conduct design reviews and workload assessments to optimize performance, scalability, and cost efficiency
Delivery & Execution Excellence
- Guide proof-of-concept initiatives to validate performance and accelerate customer adoption
- Establish reusable reference architectures, design patterns, and best practices
- Ensure consistency and quality across solution delivery and documentation
Product & Engineering Collaboration
- Act as a strategic partner to product and engineering teams, translating field insights into platform capabilities
- Influence roadmap development for HPC, AI infrastructure, and GPUaaS / CaaS platform evolution
- Contribute to the development of reusable frameworks and architectural standards
Innovation & Emerging Technologies
- Stay at the forefront of emerging technologies across GPUs, accelerators, interconnects, distributed storage, and orchestration
- Drive innovation across AI/ML workloads, containerized HPC, and next-generation compute platforms
- Translate emerging technologies into scalable, production-ready solutions
Required Experience
- 10 years of experience in HPC, Solutions Architecture, or large-scale distributed systems design
- 3 years of experience leading architecture or engineering teams
- Proven experience delivering complex, multi-domain solutions across compute, storage, networking, Kubernetes, and security
- Strong experience designing or supporting CaaS, GPUaaS, or multi-tenant compute platforms
- Deep expertise in HPC technologies including:
- GPU acceleration (NVIDIA ecosystem)
- Workload schedulers (Slurm, Kubernetes)
- Distributed storage (VAST, Lustre, GPFS, object storage)
- Experience designing secure and compliant architectures (identity, encryption, regulatory considerations)
- Strong customer-facing experience with the ability to translate business requirements into scalable technical solutions
- Ability to communicate complex architectures to both technical and executive audiences
Technical & Domain Expertise
- Experience supporting AI/ML, simulation, scientific computing, or large-scale data workloads
- Familiarity with data platforms such as Kafka, Spark, or similar within HPC workflows
- Experience with automation and DevOps practices including CI/CD and Infrastructure-as-Code (Terraform, Ansible)
- Knowledge of high-performance networking (InfiniBand, RDMA, RoCE) and containerized HPC environments
Preferred Experience
- Experience leading customer workshops, architecture reviews, and technical presentations
- Exposure to cloud platforms (AWS, Azure, GCP) and hybrid HPC architectures
- Advanced degree in Computer Science, Engineering, or related field
- Relevant certifications (AWS, Azure, GCP, Kubernetes, networking, Linux)
Additional Requirements
- This position requires applicants to be currently authorized to work in the U.S. without employer sponsorship.
- We are unable to sponsor or take over sponsorship of employment visas at this time.