Demo

Data Engineer

Rapsys Technologies
Boston, MA Contractor
POSTED ON 4/23/2026
AVAILABLE BEFORE 5/23/2026

Data Engineer - : Apache Airflow, Apache NiFi
• Orchestration: Apache Airflow, Apache NiFi.
• Programming: Java (Core), Python (for Airflow), Unix Shell Scripting.
• Big Data/Storage: Apache Spark, MinIO, AWS S3.
• Security: SSL/TLS, Certificate Management, IAM, Java Keystores.
• OS: Linux/Unix (RHEL/Ubuntu).

Key Responsibilities:
• Pipeline Orchestration: Design and develop complex, reusable DAGs in Apache Airflow to automate data workflows, scheduling, and monitoring across the enterprise.
• Data Ingestion & Flow: Create and optimize high-volume data streams using Apache NiFi, leveraging custom processors and controller services for diverse data sources and sinks.
• Custom Development: Utilize Java to develop custom NiFi processors or troubleshoot the core NiFi framework, ensuring the platform meets specific architectural requirements.
• Object Storage Management: Implement and manage data storage solutions using MinIO and AWS S3, ensuring high availability and efficient data retrieval patterns.
• Large-Scale Processing: Develop and maintain Apache Spark jobs for heavy-duty data transformations, integrating them into NiFi and Airflow orchestration layers.
• Security & Certificate Management: Secure NiFi clusters and data flows using TLS/mTLS, managing Java KeyStores (JKS), TrustStores, and SSL certificate lifecycles to ensure data-in-motion security.
• System Administration: Perform environment setup, performance tuning, and troubleshooting within Unix/Linux environments, including shell scripting for task automation.
• Cloud Integration: Deploy and manage data infrastructure components on AWS, utilizing IAM for access control and integrating cloud-native services with hybrid data pipelines.
• Monitoring & Optimization: Establish robust logging and alerting for data pipelines to proactively identify bottlenecks, ensuring 99.9% reliability of data delivery.



Role Descriptions: Key ResponsibilitiesPipeline Orchestration Design and develop complex| reusable DAGs in Apache Airflow to automate data workflows| scheduling| and monitoring across the enterprise.Data Ingestion Flow Create and optimize high-volume data streams using Apache NiFi| leveraging custom processors and controller services for diverse data sources and sinks.Custom Development Utilize Java to develop custom NiFi processors or troubleshoot the core NiFi framework| ensuring the platform meets specific architectural requirements.Object Storage Management Implement and manage data storage solutions using MinIO and AWS S3| ensuring high availability and efficient data retrieval patterns.Large-Scale Processing Develop and maintain Apache Spark jobs for heavy-duty data transformations| integrating them into NiFi and Airflow orchestration layers.Security Certificate Management Secure NiFi clusters and data flows using TLSmTLS| managing Java KeyStores (JKS)| TrustStores| and SSL certificate lifecycles to ensure data-in-motion security.System Administration Perform environment setup| performance tuning| and troubleshooting within UnixLinux environments| including shell scripting for task automation.Cloud Integration Deploy and manage data infrastructure components on AWS| utilizing IAM for access control and integrating cloud-native services with hybrid data pipelines.Monitoring Optimization Establish robust logging and alerting for data pipelines to proactively identify bottlenecks| ensuring 99.9 reliability of data delivery.
Essential Skills: Key ResponsibilitiesPipeline Orchestration Design and develop complex| reusable DAGs in Apache Airflow to automate data workflows| scheduling| and monitoring across the enterprise.Data Ingestion Flow Create and optimize high-volume data streams using Apache NiFi| leveraging custom processors and controller services for diverse data sources and sinks.Custom Development Utilize Java to develop custom NiFi processors or troubleshoot the core NiFi framework| ensuring the platform meets specific architectural requirements.Object Storage Management Implement and manage data storage solutions using MinIO and AWS S3| ensuring high availability and efficient data retrieval patterns.Large-Scale Processing Develop and maintain Apache Spark jobs for heavy-duty data transformations| integrating them into NiFi and Airflow orchestration layers.Security Certificate Management Secure NiFi clusters and data flows using TLSmTLS| managing Java KeyStores (JKS)| TrustStores| and SSL certificate lifecycles to ensure data-in-motion security.System Administration Perform environment setup| performance tuning| and troubleshooting within UnixLinux environments| including shell scripting for task automation.Cloud Integration Deploy and manage data infrastructure components on AWS| utilizing IAM for access control and integrating cloud-native services with hybrid data pipelines.Monitoring Optimization Establish robust logging and alerting for data pipelines to proactively identify bottlenecks| ensuring 99.9 reliability of data delivery.
Desirable Skills:
Keyword:
Skills: Digital : Python~Digital : Apache Spark~Digital : Databricks~Core Java~Unix / Linux Basics and Commands
Experience Required: 6-8

Salary : $55 - $60

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Data Engineer?

Sign up to receive alerts about other jobs on the Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$92,929 - $122,443
Income Estimation: 
$122,257 - $154,284
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Rapsys Technologies

  • Rapsys Technologies Biloxi, MS
  • (Core Banking – Episys Platform) 1. Technical Core Skills Symitar / Core Banking · Strong hands-on experience with Symitar Episys Core platform · Advanced ... more
  • Just Posted

  • Rapsys Technologies Stamford, CT
  • 1. 12 years of hands-on development experience in ServiceNow platform. 2. 5 years of experience specifically in Security Incident Response (SIR) and Vulner... more
  • Just Posted

  • Rapsys Technologies Morrisville, NC
  • Sr. IOS Developer (Swift UI, iOS, MVVM / Clean Architecture / Modularization Strong expertise in Swift and iOS SDKs Deep understanding of UIKit and/or Swif... more
  • Just Posted

  • Rapsys Technologies Cupertino, CA
  • Device Automation Test Engineer ***Must include Linkedin profile in submission*** Technical Skills: • Execute E2E testing across devices, backend services,... more
  • Just Posted


Not the job you're looking for? Here are some other Data Engineer jobs in the Boston, MA area that may be a better fit.

  • Tiger Data (creators of TimescaleDB) Boston, MA
  • At Tiger Data, formerly Timescale, we empower developers and businesses with the fastest PostgreSQL platform designed for transactional, analytical, and ag... more
  • 16 Days Ago

  • Connexions Data Inc Boston, MA
  • Hybrid role : AWS Test Automation Engineer 1 year option years Hybrid Near Boston Preferred, Active Secret Clearance needed Education and Years of Experien... more
  • 2 Days Ago

AI Assistant is available now!

Feel free to start your new journey!