What are the responsibilities and job description for the HPC System engineer position at Hallmark Global Solutions Ltd?
Title: HPC System engineer
Location: Atlanta, GA 30339 (Remote)
Essential Skills: AWS terra form| devops| HPC profile
Skills: Digital : Amazon Web Service(AWS) Cloud Computing~Digital: Terraform~High Performance Computing Architecture Experience Required: 6-8
Full Job Description
Strong experience with AWS services: EC2, VPC, IAM, S3, FSx, EBS, CloudWatch.
Hands-on experience with AWS Parallel Cluster.
Solid understanding of Linux system administration (RHEL).
Experience with HPC schedulers (Slurm preferred).
Familiarity with GPU computing (NVIDIA drivers, CUDA, NCCL).
Scripting skills in Bash and Python.
DevOps & Automation ( Github Action / Code Build) Infrastructure as Code using Terraform / CloudFormation.
Monitoring and performance tuning tools.
Experience with AWS Batch and containerized HPC workloads.
Understanding of hybrid HPC (on-prem AWS).
AWS Certifications (Solutions Architect, SysOps, or Specialty).
Experience in HPC CFD & FEA applications.
Roles & Responsibilities
Primary Skills: AWS, Linux Administrator, HPC Applications and Administration, Terraform, Cloud formation for Automation.
Secondary skills: Shell scripting, Windows Administrator
Generic Managerial Skills, If any Yes
Key Words to search in Resume
Primary Skills: AWS, Linux Administrator, HPC Applications and Administration, Cloud formation for Automation.
Secondary skills: Shell scripting, Windows Administrator