Demo

Senior Benchmark & Performance Engineer - AI & Storage Systems

DataDirect Networks
Los Angeles, CA Full Time
POSTED ON 12/20/2025
AVAILABLE BEFORE 1/19/2026
Senior Benchmark & Performance Engineer - AI & Storage Systems
Job Locations US-Remote
Job ID 2025-5400 Name Linked Remote: US Country United States City Remote Worker Type Regular Full-Time Employee
Overview

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.

"DDN's A3I solutions are transforming the landscape of AI infrastructure." - IDC

"The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments" - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIA

DDN is the global leader in AI and multi-cloud data management at scale. Our cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.

Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.

Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage.

Job Description

We are seeking an experienced Senior Benchmark Engineer with deep expertise in AI workloads, parallel applications, and storage systems. You will be responsible for designing, executing, and analyzing complex benchmarks to evaluate and optimize performance across a range of infrastructure stacks - including AI inference, training, NVIDIA NIMs, RAG pipelines, and MPI-based HPC codes.

This role involves compiling and debugging large-scale distributed applications, creating automated benchmark pipelines, writing up detailed technical reports, and working closely with both engineering and field teams to communicate findings and architectural advantages.

Key Responsibilities:

    Design and execute performance benchmarks across AI, HPC, and storage platforms.
  • Run and tune AI inference workloads using frameworks such as PyTorch, TensorFlow, Triton, NVIDIA NIMs, and vector databases.
  • Benchmark large-scale RAG pipelines including data ingestion, retrieval, and inference performance.
  • Profile and optimize MPI and multi-node distributed applications.
  • Compile and debug C/C , Python, and CUDA-based codes across heterogeneous systems.
  • Generate automated test scripts and benchmarking workflows (e.g., with Bash, Python, or Slurm job scripts).
  • Analyze and visualize results using Excel, Jupyter, or reporting tools; create comparison graphs and KPIs.
  • Write clear, concise performance reports for both technical and non-technical stakeholders.
  • Present findings internally and externally, translating results into architectural guidance for field engineers and sales teams.
  • Collaborate with system engineers, product managers, and partners to tune and improve software/hardware stack performance.
  • Validate and tune performance on storage systems including parallel file systems (e.g., Lustre, GPFS), object storage, and NVMe over Fabrics.
  • Contribute to internal tooling to automate test cycles and performance regression tracking.

Required Qualifications:

  • 7 years of experience in performance engineering, benchmarking, or HPC/AI systems.
  • Deep experience with AI/ML and deep learning frameworks (PyTorch, TensorFlow, ONNX, Triton).
  • Familiarity with NVIDIA NIMs and containerized model serving stacks.
  • Proven expertise with MPI, OpenMP, Slurm or similar schedulers in large-scale compute environments.
  • Solid understanding of file and storage systems (e.g., POSIX, Lustre, S3, NVMe-oF).
  • Strong Linux skills (debugging, tuning, networking, storage stack).
  • Proficiency in scripting (e.g., Bash, Python) for job orchestration and result parsing.
  • Ability to create clear Excel graphs and presentations from raw benchmark data.
  • Strong communication skills - able to convey technical results and trade-offs to engineering and customer-facing teams.

Preferred Skills:

  • Experience with RAG pipelines, vector databases (e.g., FAISS, Milvus, Qdrant).
  • Familiarity with Kubernetes and CSI-based persistent volume systems.
  • Understanding of GPU profiling tools (Nsight, nvprof, PyTorch Profiler).
  • Knowledge of telemetry and monitoring frameworks (e.g., Prometheus, Grafana).
  • Prior work publishing or presenting technical performance results.

Personal Attributes:

  • Self-driven, resourceful, and capable of independent problem-solving.
  • Able to context-switch between deep technical work and high-level communication.
  • Comfortable working across distributed teams and time zones.
DDN

DDN has a very strong orientation towards these 4 characteristics and any successful employee will demonstrate these capabilities:

Self-Starter - Takes independent action to identify and solve problems. Seeks out relevant information needed to make decisions. Gets involved with new initiatives.

Success/Achievement Orientation - Delivers quality results consistently. Targets, achieves (or exceeds) measurable results. Sets challenging goals, focuses on critical priorities, and is accountable.

Problem Solving - Recognizes problems and responds with a systematic assessment that identifies and addresses cause of issue. Practical, realistic, and resourceful.

Innovative - Builds and improves key business processes that enhance the effectiveness of DDN. Generates new ideas, challenges the status quo, and solves problems creatively.

DataDirect Networks, Inc. is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

#LI-Remote

 

Salary.com Estimation for Senior Benchmark & Performance Engineer - AI & Storage Systems in Los Angeles, CA
$93,498 to $126,108
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Benchmark & Performance Engineer - AI & Storage Systems?

Sign up to receive alerts about other jobs on the Senior Benchmark & Performance Engineer - AI & Storage Systems career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$51,050 - $68,081
Income Estimation: 
$59,001 - $77,833
Income Estimation: 
$76,886 - $129,770
Income Estimation: 
$112,685 - $163,282
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at DataDirect Networks

  • DataDirect Networks Columbia, MD
  • Staff Performance Engineer Job Locations US-MD-Columbia | US-CO-Colorado Springs Job ID 2025-5301 Name Linked Office: Columbia, MD Country United States Ci... more
  • 13 Days Ago

  • DataDirect Networks Columbia, MD
  • Staff Software Engineer Job Locations US-MD-Columbia Job ID 2025-5403 Name Linked Office: Columbia, MD Country United States City Columbia Worker Type Regu... more
  • 15 Days Ago

  • DataDirect Networks Los Angeles, CA
  • Implementation Architect Job Locations US-Remote Job ID 2025-5471 Name Linked Remote: US Country United States City Remote Worker Type Regular Full-Time Em... more
  • 6 Days Ago

  • DataDirect Networks Columbia, MD
  • Software Systems Engineer Job Locations US-MD-Columbia Job ID 2025-5404 Name Linked Office: Columbia, MD Country United States City Columbia Worker Type Re... more
  • 11 Days Ago


Not the job you're looking for? Here are some other Senior Benchmark & Performance Engineer - AI & Storage Systems jobs in the Los Angeles, CA area that may be a better fit.

  • Salt Ai Malibu, CA
  • Our MissionSalt AI is founded by industry veterans in high-performance computing (HPC), artificial intelligence and life sciences. Salt AI is dedicated to ... more
  • 2 Months Ago

  • Art Logic Pasadena, CA
  • Senior Embedded AI / Video Systems Engineer(Remote – Contract through Art Logic)Join us to solve real problems with real people, in a remote-first, flexibl... more
  • 2 Months Ago

AI Assistant is available now!

Feel free to start your new journey!