What are the responsibilities and job description for the Service Engineer position at Yochana?
Role: Service engineer
Location: Redmond. WA ( Hybrid)
Job Title: Service Engineer – High Performance Computing (HPC)
Overview
We are seeking a highly skilled Service Engineer with 7–10 years of experience in infrastructure management and a deep understanding of High Performance Computing (HPC) environments. The ideal candidate will have proven expertise in managing complex systems, integrating third-party applications, and ensuring seamless operations across distributed computing platforms. This role demands exceptional problem-solving skills, strong communication abilities, and a proactive approach to onboarding and supporting HPC workloads.
Required Skills & Qualifications
- Development Experience: Minimum 3 years in software development using PowerShell, Azure Bash, Go, or Python.
- Administration Experience: 7 years in system administration with a focus on PLM and HPC infrastructure.
- Communication & Collaboration: Excellent written and oral communication skills; ability to drive initiatives and collaborate effectively.
- Backlog Management: Proactively manage backlog and deliver efficiencies through solution design and development.
Preferred Attributes
- Passion for HPC technologies and infrastructure optimization.
- Ability to learn and adapt to new tools and frameworks quickly.
- Strong analytical and problem-solving mindset.
- Collaborative approach with cross-functional teams.
Key Responsibilities
- Drive integration and operational support for third-party applications on HPC and PLM infrastructure.
- Manage and maintain Windows Server and Linux OS environments, including VM and VMSS.
- Configure and optimize HPC schedulers such as Windows HPC Pack, OpenPBS, Slurm, or similar.
- Support distributed computing workloads, including MPI-based applications.
- Troubleshoot and optimize networking components (TCP/IP, name resolution, RDMA).
- Implement and manage Azure CycleCloud and Azure Batch for HPC workload orchestration.
- Collaborate with application owners to understand infrastructure dependencies and optimize performance.
- Ensure compliance with security and operational best practices across all systems.
Salary : $105,000 - $110,000