Demo

Data Quality Engineer (Databricks, Kafka, AWS)

Plugins Inc
Dallas, TX Full Time
POSTED ON 6/3/2026
AVAILABLE BEFORE 7/3/2026
We are looking for a Data Quality Engineer to own validation across batch and streaming data pipelines. This role focuses on ensuring data correctness, reliability, and performance across platforms built on Databricks, Kafka, AWS, SQL, and Python.
This is a hands-on role focused on building scalable data validation frameworks and ensuring production-grade data systems.

Key Responsibilities
End-to-End Data Validation
* Validate data pipelines for accuracy, completeness, consistency, and timeliness
* Build SQL-based validations for business rules and transformations
* Implement reconciliation between source and downstream systems
* Ensure data lineage and traceability

ETL / ELT & Spark Testing
* Test pipelines built on AWS (Glue, Lambda, EMR, Step Functions)
* Validate transformations using SQL and Python
* Test ingestion, transformation, aggregation, and serving layers
* Handle backfills, reprocessing, and historical data loads
* Validate Spark pipelines (PySpark/Scala) on Databricks

Streaming (Kafka)
* Validate data integrity, ordering, and delivery guarantees
* Test producer and consumer logic and serialization formats (Avro, JSON, Protobuf)
* Validate topics, partitions, offsets, retention, and schema evolution
* Simulate late events, duplicates, and failure scenarios

Automation & Frameworks
* Build Python-based data testing frameworks
* Develop reusable validation utilities and synthetic datasets
* Integrate data tests into CI/CD pipelines
* Enable automated alerts for data quality issues

Performance & Reliability
* Validate throughput, latency, and concurrency at scale
* Test retry logic, idempotency, and recovery mechanisms
* Perform regression, soak, and failover testing

Observability
* Validate logs, metrics, and alerts using tools such as CloudWatch, Prometheus, and Grafana
* Define and monitor data SLAs and SLOs
* Support incident response, root cause analysis, and postmortems

Required Qualifications & Experience
* 7 years of total experience in QA, SDET, or Data Quality Engineering
* Minimum 4–6 years of hands-on experience working with data platforms, data pipelines, or data engineering ecosystems
* 3 years of hands-on experience with Databricks and Apache Spark
* Strong SQL skills for data validation, reconciliation, and complex analysis
* Proficiency in Python for automation and data validation
* Experience testing ETL/ELT pipelines (batch and streaming)
* Hands-on experience with Kafka or similar streaming platforms
* Strong understanding of AWS data services (S3, Glue, Lambda, Redshift, Athena)
* Experience working with large-scale distributed data systems
* Strong debugging, analytical, and problem-solving skills

Nice to Have
* Experience with data quality or observability tools such as Great Expectations or Monte Carlo
* Knowledge of schema registry and data contracts
* Experience with CI/CD tools such as GitHub Actions or Jenkins

Salary.com Estimation for Data Quality Engineer (Databricks, Kafka, AWS) in Dallas, TX
$91,392 to $118,604
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Data Quality Engineer (Databricks, Kafka, AWS)?

Sign up to receive alerts about other jobs on the Data Quality Engineer (Databricks, Kafka, AWS) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$122,860 - $148,594
Income Estimation: 
$159,276 - $189,136
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Plugins Inc

  • Plugins Inc Cleveland, OH
  • Network Engineer πŸ“ Locations: Cleveland, OH | Middletown, OH | Butler, PA πŸ“„ Duration: 6 Months Contract-to-Hire 🏒 Work Type: 100% Onsite 🎯 Interview Pr... more
  • 7 Days Ago

  • Plugins Inc Houston, TX
  • Network Architect Greetings from Plugins Inc.!! We have an urgent requirement for a Network Architect position with our client. Position: Network Architect... more
  • 11 Days Ago


Not the job you're looking for? Here are some other Data Quality Engineer (Databricks, Kafka, AWS) jobs in the Dallas, TX area that may be a better fit.

  • Lennar Irving, TX
  • Job Description Data Quality Engineer We are Lennar Lennar is one of the nation's leading homebuilders, dedicated to making an impact and creating an extra... more
  • 23 Days Ago

  • CX Data Labs Mc Kinney, TX
  • Hi Title: Data Engineer with SAP BW Location: McKinney, TX, Onsite Type: Full Time About CX Data Labs: At CX Data Labs, we believe that systematically unde... more
  • 8 Days Ago

AI Assistant is available now!

Feel free to start your new journey!