What are the responsibilities and job description for the Data Automation Engineer position at Aptonet?
Data Automation Engineer
Location: Remote (Quarterly travel to Washington, DC as needed)
Position Type: Remote
Contract Duration: 6 Months (Extension Possible)
Target Start Date: 06/22/2026
Program: ATF
Clearance Requirement: Public Trust
Citizenship Requirement: U.S. Citizen Required
Conversion Potential: Possible Full-Time Opportunity
Role Summary
We are seeking a Data Automation Engineer to design and implement innovative data automation solutions supporting an enterprise-scale Microsoft Azure-based analytics and reporting platform. This role will work closely with customers, data engineers, developers, and subject matter experts to build scalable data pipelines, automate workflows, support AI/ML initiatives, and optimize cloud-based data environments.
The ideal candidate will have hands-on experience with Azure data services, ETL development, data engineering, scripting, automation, and cloud-based analytics platforms while supporting mission-critical federal programs.
Key Responsibilities
- Design, develop, and maintain high-performance data pipelines using:
- Azure Data Factory
- Azure Synapse Pipelines
- Apache Spark Notebooks
- Python
- SQL
- Stored Procedures
- Translate business requirements into scalable data engineering and AI-driven solutions
- Continuously improve automation tools for reliability, scalability, and adaptability
- Research and implement AI/ML and Generative AI solutions to automate data processes and eliminate workflow bottlenecks
- Collaborate with implementation specialists, engineering teams, and customers to develop data-driven solutions
- Design and implement data ingestion, transformation, integration, and processing solutions
- Support advanced analytics, reporting, visualization, and AI/ML initiatives
- Implement:
- Data migration
- Data quality
- Data integrity
- Metadata management
- Data security functions
- Monitor, troubleshoot, and optimize data pipeline performance
- Execute ETL performance testing and validate benchmark results
- Analyze:
- Pipeline runtime
- Throughput
- Latency
- Resource utilization
- Participate in performance testing for:
- Azure Data Factory (ADF)
- Azure Synapse
- Databricks
- Support performance tuning activities including:
- Query optimization
- Partitioning
- Indexing
- Validate data consistency and completeness after performance testing
- Collaborate with DevOps and infrastructure teams on compute, memory, and scaling optimization
- Document test results, findings, and recommendations
- Support Agile DevOps processes including Program Increment planning
- Maintain strict versioning and configuration control processes
Required Technical Skills
- 2 years of experience with two or more of the following:
- SQL
- T-SQL
- MDX/DAX
- Python
- PySpark
- Experience designing and building ETL and data engineering solutions
- Experience with:
- Azure Data Lake Services
- Azure Synapse Analytics
- Azure Data Factory
- Integration Runtime
- Experience with Microsoft data and BI technologies including:
- SQL Server
- Stored Procedures
- SSIS
- SSRS
- SSAS (Cubes)
- Power BI
- Experience automating data processes using:
- Azure CLI
- AWS CLI
- Bash
- PowerShell
- Experience with:
- Azure DevOps Repos
- GitHub
- Pipeline versioning
- Release management
- Experience supporting:
- Production environments
- Development environments
- Testing environments
- Integration environments
- Knowledge of Agile development methodologies
- Strong analytical, troubleshooting, and problem-solving skills
- Ability to support multiple projects simultaneously
- Strong communication and collaboration skills
Preferred / Nice-to-Have Skills
- Generative AI development experience
- Generative AI for Data Analytics experience
- Microsoft certifications including:
- Azure Fundamentals
- Azure Data Engineer
- Power BI
- Azure AI
- AWS Certified Data Engineer certification
- Experience with:
- Databricks
- REST APIs
- Docker
- Enterprise ETL toolsets
- Performance tuning experience including:
- Indexing
- Execution plans
- Query analytics
- Data profiling
- Knowledge of:
- Data encryption
- Cloud virtual networks
- Routing
- Firewalls
- Log Analytics
- Monitoring tools
- Experience with:
- ARM templates
- Bicep templates
- RBAC access controls
- Data lineage and impact analysis experience using:
- Microsoft Purview
- Synapse Pipeline Tracing
Qualifications & Experience
- Bachelor's degree in:
- Computer Science
- Related technical field
- 2 years of relevant professional experience
- U.S. Citizenship required
- Ability to successfully obtain and maintain a Public Trust clearance
- Demonstrated commitment to continuous learning and professional development
About the Team / Company
This role supports a federal ATF program focused on modernizing enterprise data analytics and automation capabilities through cloud-native technologies, AI-driven solutions, and scalable data engineering practices. Team members work collaboratively to deliver secure, high-performance data solutions that improve operational effectiveness and decision-making.
Salary : $40 - $47