What are the responsibilities and job description for the Network Platform Engineer (Senior) position at Berkeley Lab?
Lawrence Berkeley National Lab’s National Energy Research Scientific Computing Center (NERSC) is seeking a Network Platform Engineer (Senior). Join us in accelerating scientific discovery for the DOE Office of Science by advancing data movement, computation, and evolving data management workflows.
As a Network Platform Engineer (Senior), you will bring your skills in automation, software development and AI-driven networking to shape our High-Performance Computing (HPC) and Artificial Intelligence (AI) network technology, improving performance and driving optimization of our HPC and Data Center infrastructure. Our group manages the 1 Tb/s border bandwidth to ESnet and an 800G/400G data center network backbone which support the NERSC-9 & NERSC-10 supercomputers, multi-tier storage, archive and edge services. We enable real-time scientific computation for our >10,000 users and your expertise will contribute to our efforts in enhancing performance, scalability, automation, and reliability of scientific workflows
.
We’re here for the same mission, to bring science solutions to the world. We welcome candidates from all backgrounds, including those with non-traditional paths. We value a growth mindset and believe skills and experience are transferable. If you’re eager to learn and meet the minimum qualifications below, we encourage you to apply. Join our team where your work can have a high impact for an organization associated with 17 Nobel Prizes… and countin
g.
Network Engineer Level 3 wi
- ll:Manage software, automation, and observability solutions for network provisioning, configuration, monitoring, and troubleshooting in HPC and Data Center environmen
- ts.Contribute to Data Center modernization efforts and NERSC’s Smart Facility initiat
- iveSupport efforts to design and deliver network services to address emerging needs (e.g., American Science Cloud, new Edge service
- s).Continuously monitor and optimize network performance, focusing on reducing latency, maximizing throughput, and improving fault toleran
- ce.Create and maintain comprehensive network documentation, including physical & logical topological diagra
- ms.Collaborate with the Security Group to implement security measures for data integrity and privacy, ensuring high availability and reliability through redundancy and failover mechanis
- ms.Share on-call rotation with colleagues and serve as an escalation contact for service inciden
- ts.Work on and resolve complex issues where analysis of situations or data requires an in-depth evaluation of multiple variabl
- es.Exercise judgment in selecting methods, techniques and evaluation criteria for obtaining resul
- ts.Network with key contacts outside your own area of experti
s
e. In addition to the above, the Senior Network Engineer Level 4 wil
- l: Develop software, automation, and observability solutions for network provisioning, configuration, monitoring, and troubleshooting in HPC and Data Center environmen
- ts.Lead Data Center modernization efforts in support of NERSC’s Smart Facility initiative and emerging needs e.g., American ScienceCl
- oudDesign, develop, and maintain automation frameworks, infrastructure-as-code, and software solutions to manage, optimize, and self-heal the HPC and data center netwo
- rk.Build and integrate AI/ML-driven observability, predictive analytics, and automated remediation capabilities into network operatio
- ns.Work on and resolve significant and/or unique issues where analysis of situations or data requires an evaluation of intangibl
- es.Determine methods and procedures on new assignments and may coordinate activities of other personn
- el.Exercise independent judgment in methods, techniques and evaluation criteria for obtaining resul
ts.
Network Engineer Level 3 will h
- ave:Typically requires a minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or equivalent experie
- nce.5 years of IP networking experience in Data Center or LAN environme
- nts.2 years of network automation and optimization experie
- nce.Project management experience in developing technical project scope, schedule and bud
- get.Strong written, verbal, listening, and presentation skills; effective collaboration skills with technical peers, vendors, and custom
- ers.Ability to resolve complex issues in creative and effective w
- ays.Ability to network and collaborate with key contacts outside their own area of expert
- ise.Demonstrated ability to work effectively as part of a cross-disciplinary t
eam.
Senior Network Engineer Level 4 will h
- ave: Typically requires a minimum of 12 years of related experience with a Bachelor’s degree; or 8 years and a Master’s degree; or equivalent experi
- ence.10 years of IP networking experience in Data Center or LAN environm
- ents.4 years of network automation and optimization experi
- ence.Demonstrated project management expertise, specifically, in leading the development of technical project scope, schedule and bu
- dget.Excellent written, verbal, listening, and presentation skills; excellent communication and collaboration skills with technical peers, vendors, and custo
mers.
Desired skills/know
ledge:(At Level 3, at least 3 of the following; at Level 4, at least 5 of the follo
- wing):Expert-level capabilities in configuring, troubleshooting, and using IPv4/IPv6 routing protocols, preferably BGP, VXLAN, L3VPN, OSPF, and/or ISIS in a WAN or LAN enviro
- nment.Strong software development skills in Python or Go and experience with network automation tools (Ansible, Terraform, eAPI,
- etc.).Demonstrated experience with AI-driven networking, intent-based networking, or ML-based network optimiz
- ation.Demonstrated experience in leading network technology migrations or upgrades in high-performance computing environ
- ments.Demonstrated experience in automating the network’s daily operational, deployment and monitoring
- tasks.Demonstrated experience with RDMA (Remote Direct Memory Access), Infiniband, HPC Protocol, RoCE (RDMA over Converged Ethe
- rnet).Demonstrated experience working with Network Management platforms like Arista CloudVision, Nvidia UFM, Del
- l SFM.Demonstrated experience in designing and implementing network security architectures, including segmentation (e.g., VRFs, micro-segmentation), access control (ACLs), encryption (IPsec/MACsec), and threat detection/mitigation in large-scale enterprise or HPC environ
- ments.Demonstrated experience in hybrid cloud networking, including connectivity models (VPN, Direct Connect/ExpressRoute), multi-cloud architectures, and integrating on-prem HPC or data center networks with public cloud environ