Demo

Kubernetes Big Data Engineer

Apptad
Raleigh, NC Contractor
POSTED ON 5/20/2026 CLOSED ON 5/24/2026

What are the responsibilities and job description for the Kubernetes Big Data Engineer position at Apptad?

Role: Kubernetes Big Data Engineer

Location: Rockville, MD/F2F Interview (3 days onsite & 2 days remote)

Duration: 6 months with long-term extensions

Job Description Summary

We are seeking a highly skilled and experienced Big Data Engineer to design, develop, and optimize large-scale data processing systems. In this role, you will work closely with cross-functional teams to architect data pipelines, implement data integration solutions, and ensure the performance, scalability, and reliability of big data platforms. The ideal candidate will have deep expertise in distributed systems, cloud platforms, and modern big data technologies such as Hadoop, Spark, and Kubernetes-based orchestration.

Responsibilities:

· Design, develop, and maintain large-scale data processing pipelines using Big Data technologies (e.g., Hadoop, Spark, Python, Scala).

· Architect and deploy containerized big data workloads on Amazon EMR on EKS (Elastic Kubernetes Service).

· Design and implement Kubernetes-based infrastructure for running Spark applications at scale.

· Implement data ingestion, storage, transformation, and analysis solutions that are scalable, efficient, and reliable.

· Stay current with industry trends and emerging Big Data technologies to continuously improve the data architecture.

· Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.

· Optimize and enhance existing data pipelines for performance, scalability, and reliability.

· Develop automated testing frameworks and implement continuous testing for data quality assurance.

· Conduct unit, integration, and system testing to ensure the robustness and accuracy of data pipelines.

· Work with data scientists and analysts to support data-driven decision-making across the organization.

· Ability to write and maintain automated unit, integration, and end-to-end tests.

· Monitor and troubleshoot data pipelines in production environments to identify and resolve issues.

· Manage Kubernetes clusters, pods, services, and deployments for big data workloads.

Essential Technical Skills:

AI Tool Proficiency:

· Hands-on experience with AI development tools (GitHub Copilot, Q Developer, ChatGPT, Claude, etc.)

Big Data Technologies:

· Experience with Big data technologies such as Hadoop, Spark, Hive & Trino

· Understanding of common issues like data skew and strategies to mitigate it, working with massive data volumes in PetaBytes, and troubleshooting job failures due to resource limitations, bad data, and scalability challenges.

· Real-world experience with debugging and mitigation strategies.

Container Orchestration & Kubernetes:

· Strong experience with Kubernetes architecture, concepts, and operations (pods, services, deployments, namespaces, ConfigMaps, Secrets)

· Hands-on experience with Amazon EMR on EKS (Kubernetes) for running Apache Spark workloads

· Experience with Kubernetes resource management, scheduling, and auto-scaling

· Knowledge of Helm charts for deploying and managing applications on Kubernetes

· Understanding of Kubernetes networking, storage (PVs, PVCs), and security best practices

· Experience with kubectl and Kubernetes YAML manifests

· Ability to troubleshoot Kubernetes cluster issues, pod failures, and resource constraints

· Experience integrating Spark with Kubernetes operators and dynamic allocation

AI Skills:

· Prompt Engineering: Proficiency in crafting effective prompts for AI coding assistants and analysis tools

· AI Workflow Design: Experience redesigning development processes to leverage AI capabilities

· Data Analysis: Ability to interpret AI-generated insights and translate them into actionable team improvements

· Change Management: Experience leading teams through AI adoption and workflow transformation

Apache Spark (Development, Internals & Tuning):

· Deep understanding of Spark's core architecture - executors, tasks, stages, DAG

· Expertise in Spark performance tuning techniques: partitioning, caching, broadcast joins, etc.

· Experience troubleshooting slow running/stuck jobs or resource issues in Spark

· Proven ability to optimize Spark jobs for large-scale datasets

· Experience running Spark on Kubernetes and understanding Spark-on-K8s architecture

Cloud Technologies:

· Experience with AWS services like S3, EMR, EMR on EKS, Glue, Lambda, Athena, etc.

· Hands-on experience using S3 with Spark (e.g., dealing with file formats, consistency issues)

· Strong experience with Amazon EKS (Elastic Kubernetes Service) architecture and best practices

· Experience with AWS IAM roles for service accounts (IRSA) for Kubernetes workloads

· Knowledge of AWS networking for EKS (VPC, subnets, security groups)

· Experience with AWS monitoring and logging tools (CloudWatch, CloudTrail) for Kubernetes workloads

· Serverless knowledge (Lambda, Fargate)

Programming - Python or Scala:

· Ability to write clean, modular, and perform code

· Experience with functional programming concepts (e.g., immutability, higher-order functions)

· Real-world use cases where scalable data processing code was implemented

· Strong understanding of collections, concurrency, and memory management

SQL Skills (Window Functions, Joins, Complex Queries):

· Proficiency with SQL window functions, multi-table joins, and aggregations

· Ability to write and optimize complex SQL queries

· Experience handling edge cases like NULLs, duplicates, and ordering

Good to have:

· Experience with managing production data pipelines/ETL systems

· Experience with CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions, ArgoCD)

· Experience with Infrastructure as Code (Terraform, CloudFormation) for provisioning EKS clusters and EMR on EKS

· Experience writing comprehensive test cases and test automation

· Experience with Docker and container image optimization

· Knowledge of service mesh technologies (Istio, Linkerd)

· Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack)

· AWS certifications (AI practitioner, Solutions Architect, Big Data Specialty, or Kubernetes certifications like CKA/CKAD)

· Experience with GitOps practices for Kubernetes deployments


Skillset:

  • Manager called this a Kubernetes Engineer
  • Transitioning from EMR on EC2 over to EMR on EKS
  • Experience with Helm charts, Spark on K8’s, Kubectl, etc…
  • Must also have Spark experience
  • Must be willing to create Pyspark data pipelines
  • Certifications are preferred but not mandatory (CKAD, CKA)


Education/Experience Requirements:

· Bachelor's degree in Computer Science, Information Systems or related discipline with at least five (5) years of related experience, or equivalent training and/or work experience; Master's degree and past Financial Services industry experience preferred.

· Demonstrated technical expertise in Object Oriented and database technologies/concepts which resulted in deployment of enterprise quality solutions.

· Extensive knowledge of industry leading software engineering approaches including Test Automation, Build Automation and Configuration Management frameworks.

· Strong written and verbal technical communication skills.

· Demonstrated ability to develop effective working relationships that improved the quality of work products..

· Ability to maintain focus and develop proficiency in new skills rapidly.

· Ability to work in a fast paced environment.

Hourly Wage Estimation for Kubernetes Big Data Engineer in Raleigh, NC
$58.00 to $73.00
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Kubernetes Big Data Engineer?

Sign up to receive alerts about other jobs on the Kubernetes Big Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$124,724 - $161,246
Income Estimation: 
$147,901 - $186,323
This job has expired.
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Apptad

  • Apptad Pittsburgh, PA
  • Job Title: Oracle HCM Payroll Configurator & Integration Lead Job Location: Pittsburgh, PA Onsite Day 1 in office Job Duration: Contract Job Description: S... more
  • 2 Days Ago

  • Apptad Pittsburgh, PA
  • Job Title: Oracle HCM Cloud Configuration Consultant Job Location: Pittsburgh, PA Onsite Day 1 in office Job Duration: Contract Position Summary: We are se... more
  • 2 Days Ago

  • Apptad Milwaukee, WI
  • Title : Desktop support Technician Location : Milwaukee, Wisconsin 53212 - (onsite) About Job role As a member of the Field Service Operations team the can... more
  • 3 Days Ago

  • Apptad West Roxbury, MA
  • Job Title: IT Field Service Support Location: West Roxbury, MA 02132 - Onsite Job responsibilities: Ability to travel to remote locations Have own vehicle ... more
  • 3 Days Ago


Not the job you're looking for? Here are some other Kubernetes Big Data Engineer jobs in the Raleigh, NC area that may be a better fit.

  • Infosys Raleigh, NC
  • Overview The Strategic Technology Group (STG) unit at Infosys is designed for power programmers and senior technologists who want to go beyond traditional ... more
  • 18 Days Ago

  • Amazon Data Services, Inc. Raleigh, NC
  • AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we?re the people who keep ... more
  • 1 Month Ago

AI Assistant is available now!

Feel free to start your new journey!