Demo

HPC on AWS Lead /Specialist/ SME- REMOTE

Simple Solutions
Jacksonville, FL Remote Full Time
POSTED ON 4/7/2026
AVAILABLE BEFORE 5/7/2026

Job Title: HPC on AWS Lead /Specialist/ SME- REMOTE

  • MUST HAVE : a resource with US Citizenship and Active Secret Service Clearence( SCI )
  • Skill Set:  DevOps/HPC tooling resource, exp with infra, landing zone etc.  for a project for Secret Deployment/AWS GovCloud
Overview:
The AWS HPC LEAD & SME is responsible for designing, implementing, and optimizing high-performance computing solutions on the AWS Cloud platform. This role combines deep technical expertise in distributed computing, data-intensive workflows, and AWS HPC services with the ability to lead architecture design sessions, define best practices, and ensure scalability, performance, and cost efficiency across enterprise or research workloads.


Key Responsibilities:

  • Lead the Design & Build: Develop scalable, high-performance architectures leveraging AWS HPC services such as AWS ParallelCluster, FSx for Lustre, EFA (Elastic Fabric Adapter), AWS Batch, and EC2 HPC instances.

  • Solution Implementation: Deploy, automate, and optimize HPC clusters and data pipelines for compute- and memory-intensive workloads, including modeling, simulation, genomics, CFD, AI/ML training, and financial risk analysis.

  • Performance Optimization: Benchmark, tune, and monitor system performance for compute, storage, and networking components to achieve optimal throughput and cost efficiency.

  • Infrastructure as Code (IaC): Implement reproducible environments using Terraform, AWS CDK, or CloudFormation to streamline provisioning, CI/CD, and configuration management.

  • Data and Storage Management: Design high-throughput parallel storage solutions using S3, FSx for Lustre, EBS, and EFS; integrate with hybrid and on-prem HPC environments.

  • Security and Compliance: Apply AWS Well-Architected Framework and HPC security best practices to ensure compliance with enterprise, academic, or government standards.

  • Collaboration and Leadership: Partner with application scientists, DevOps teams, and business stakeholders to translate workload requirements into optimized HPC architectures. Provide mentoring and technical leadership across multidisciplinary teams.

  • Documentation and Knowledge Sharing: Develop architecture diagrams, reference implementations, and technical playbooks to support ongoing HPC adoption and operations.

Required Skills & Experience:

  • 8-10 years of experience in high-performance computing, distributed systems, or cloud architecture.

  • Proven expertise in AWS HPC services (EC2 HPC, ParallelCluster, Batch, FSx for Lustre, EFA).

  • Strong knowledge of Linux systems administration, networking (Infiniband, EFA, MPI), and job schedulers (Slurm, Torque, PBS Pro).

  • Hands-on experience with automation and IaC (Terraform, Ansible, CloudFormation).

  • Scripting and development proficiency (Python, Bash, or similar).

  • Experience with monitoring tools (CloudWatch, Grafana, Prometheus) and cost-optimization strategies.

  • AWS Certified Solutions Architect – Professional or AWS Certified Advanced Networking preferred.

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related technical field.

Preferred Attributes:

  • Experience with GPU workloads, containerized HPC (ECS/EKS with ParallelCluster), or hybrid/on-prem to cloud HPC migrations.

  • Strong communication and presentation skills for executive and technical audiences.

  • Demonstrated thought leadership in HPC strategy, performance benchmarking, and AWS innovation.



Salary : $95

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Simple Solutions

  • Simple Solutions York, NY
  • Job Title: Senior Software Backend Engineer - Remote or Hybrid 2-3 days a week Location: NYC, NY or SF, CA for hybrid BUT also open to Remote anywhere in t... more
  • 11 Days Ago

  • Simple Solutions Jacksonville, FL
  • GenAI Senior Data Scientist There will be 2 Interviews and then a 2-3 hour take home coding assignment that they must have run the code before an interview... more
  • 11 Days Ago

  • Simple Solutions Jacksonville, FL
  • Job Title: Dedicated Support Engineer (DSE) / Rocky Linux Consultant Job Summary: SME for Architecture in Linux Red Hat Role: Dedicated Support Engineer – ... more
  • 11 Days Ago

  • Simple Solutions Santa Clara, CA
  • Wireless Network Engineer Skills 5 to 8 years experience with enterprise Wi-Fi including but not limited to 802.11 standards, encryption,, 802.1x, RADIUS, ... more
  • 11 Days Ago


Not the job you're looking for? Here are some other HPC on AWS Lead /Specialist/ SME- REMOTE jobs in the Jacksonville, FL area that may be a better fit.

  • CitiusTech Jacksonville, FL
  • Who we are: - At CitiusTech, we constantly strive to solve the industry's greatest challenges with technology, creativity, and agility. With over 8,500 hea... more
  • 1 Day Ago

  • Simple Solutions Jacksonville, FL
  • Job Title: Senior Platform Engineer- Websphere & DevSec Ops SME - REMOTE Team : Middleware Engineering Role :Senior Platform Engineer Overview: Summary The... more
  • 1 Month Ago

AI Assistant is available now!

Feel free to start your new journey!