Demo

HPC Infrastructure Platform Engineer

Oak Ridge National Laboratory
Oak Ridge, TN Full Time
POSTED ON 6/3/2026
AVAILABLE BEFORE 7/4/2026
Requisition Id 16521

Overview:

The High-Performance Computing Systems Section within the National Center for Computational Sciences (NCCS) is seeking an HPC Infrastructure Platform Engineer to join the HPC Infrastructure group. The preferred candidate will possess commensurate knowledge, skills and abilities in addition to relevant education, certifications, experience and demonstrated ability to work as a member of a team.

NCCS Provides state-of-the-art computational and data science infrastructure coupled with dedicated technical and scientific professionals tackling large-scale problems across a broad range of scientific domains for accelerating scientific discovery and engineering advances. NCCS hosts the Oak Ridge Leadership Computing Facility (OLCF), one of the Department of Energy's (DOE) National User Facilities which operates Frontier, the nation's first exascale supercomputer.

Major Duties/Responsibilities:

Linux Administration:
  • Deploy, configure and manage HPC-scale services in a Linux environment, primarily RedHat and Rocky
  • Perform regular patches, updates and backups
  • Monitor systems using tools like Nagios and Grafana
  • Respond to and assist in troubleshooting issues

Kubernetes Administration:
  • Build and maintain foundational internal platforms and tools to enable the HPC Infrastructure team to reliably deploy, monitor and scale applications
  • Design standardized and automated workflow patterns, build and maintain CI/CD pipelines
  • Offer self-service, excellent documentation and assistance to HPC Infrastructure group members for efficient consumption of platform services
  • Develop, maintain and review high quality code for internal tools using programming languages such as Python, Golang, or Rust

Identity Management and Security:
  • Deploy, configure and support identity and access management services using LDAP and PingFederate
  • Maintain and enable secure access for human users and automated workloads in Kubernetes

Virtualization and Automation:
  • Deploy and manage resources in the NCCS VMware environment
  • Identify potential automation targets and lead efforts to automate processes
  • Define policies and procedures for automation and configuration management for the team and organization as a whole

Project Management and Leadership:
  • Lead small Infrastructure projects through the project lifecycle
  • Mentor and train junior staff, creating training documentation, holding knowledge sharing sessions, and fostering skill growth throughout the team
  • Propose and implement improvements to existing Infrastructure systems as well as new systems, processes and procedures

Basic Qualifications:
  • Bachelor's degree in computer science or closely related field and a minimum of 5 years of experience in Linux systems and Kubernetes platform administration, or a master's degree and a minimum of 4 years of experience in Linux systems and Kubernetes platform administration
  • An equivalent combination of education and experience will be considered

Preferred Qualifications:
  • Excellent interpersonal/communication skills and the ability to work within a team
  • Strong experience designing, building and maintaining Kubernetes platform tools
  • Strong working knowledge of Linux system fundamentals and common network protocols
  • Programming and scripting skills in common languages such as Python and bash
  • Understanding of versioning and code review tools like GitHub and GitLab
  • Experience implementing and supporting highly-available systems and services
  • Experience with configuration management tools such as Puppet or Ansible
  • Experience deploying and maintaining virtual environments using VMWare
  • Experience deploying, maintaining and troubleshooting a variety of infrastructure services such as OpenLDAP, DNS, DHCP, etc.
  • Ability to plan, prioritize and complete assigned projects with minimal supervision

Special Requirements:
  • This position requires the ability to obtain and maintain a clearance from the Department of Energy. As such, this position is a Workplace Substance Abuse (WSAP) testing designated position. WSAP positions require passing a pre-placement drug test and participation in an ongoing random drug testing program

About ORNL:

As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an impressive 80-year legacy of addressing the nation's most pressing challenges. Our team is made up of over 7,000 dedicated and innovative individuals! Our goal is to create an environment where a variety of perspectives and backgrounds are valued, ensuring ORNL is known as a top choice for employment. These principles are essential for supporting our broader mission to drive scientific breakthroughs and translate them into solutions for energy, environmental, and security challenges facing the nation.

ORNL offers competitive pay and benefits programs to attract and retain individuals who demonstrate exceptional work behaviors. The laboratory provides a range of employee benefits, including medical and retirement plans and flexible work hours, to support the well-being of you and your family. Employee amenities such as on-site fitness, banking, and cafeteria facilities are also available for added convenience.

Other benefits include the following: Prescription Drug Plan, Dental Plan, Vision Plan, 401(k) Retirement Plan, Contributory Pension Plan, Life Insurance, Disability Benefits, Generous Vacation and Holidays, Parental Leave, Legal Insurance with Identity Theft Protection, Employee Assistance Plan, Flexible Spending Accounts, Health Savings Accounts, Wellness Programs, Educational Assistance, Relocation Assistance, and Employee Discounts.

If you have difficulty using the online application system or need an accommodation to apply due to a disability, please email:

This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.

If you have trouble applying for a position, please email

ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.

Salary.com Estimation for HPC Infrastructure Platform Engineer in Oak Ridge, TN
$100,831 to $125,420
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a HPC Infrastructure Platform Engineer?

Sign up to receive alerts about other jobs on the HPC Infrastructure Platform Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$110,730 - $135,754
Income Estimation: 
$128,617 - $162,576
Income Estimation: 
$117,033 - $148,289
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Oak Ridge National Laboratory

  • Oak Ridge National Laboratory Oak Ridge, TN
  • Requisition Id 16283 Overview: The Cyber Resilience and Intelligence Division (CRID) in the National Security Sciences Directorate at Oak Ridge National La... more
  • 2 Days Ago

  • Oak Ridge National Laboratory Oak Ridge, TN
  • Requisition Id 16332 Overview: As a U.S. Department of Energy (DOE) Office of Science national laboratory, Oak Ridge National Laboratory (ORNL) has an extr... more
  • 2 Days Ago

  • Oak Ridge National Laboratory Oak Ridge, TN
  • Requisition Id 16431 Overview: We're hiring a Senior AI Software Engineer who has AI/ML development and deployment skills! The position and its focus are t... more
  • 2 Days Ago

  • Oak Ridge National Laboratory Oak Ridge, TN
  • Requisition Id 16433 Overview: The Controls Integration Group at the Spallation Neutron Source (SNS) is seeking a Control System Software Engineer who will... more
  • 2 Days Ago


Not the job you're looking for? Here are some other HPC Infrastructure Platform Engineer jobs in the Oak Ridge, TN area that may be a better fit.

  • Oak Ridge National Laboratory Oak Ridge, TN
  • Requisition Id 16407 Overview: The National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL) operates the fastest High Perf... more
  • 3 Days Ago

  • Oak Ridge National Laboratory Oak Ridge, TN
  • Requisition Id 16429 Overview: We are hiring a HPC Systems Engineer to design, operate and maintain clusters, servers, and workstations supporting services... more
  • 3 Days Ago

AI Assistant is available now!

Feel free to start your new journey!