What are the responsibilities and job description for the Data Engineer :: 4 days a week onsite in Bentonville, AR :: W2 position at HYR Global Source Inc?
Position Name : Data Engineer
Location: 4 days a week onsite in Bentonville, AR
Position Type: W2/Fulltime
Key Responsibilities
Design and build scalable ETL/ELT pipelines using Apache Airflow, Apache Spark, and GCP Dataflow
Develop and maintain BigQuery data models, schemas, and performance-optimized SQL queries
Build and maintain data pipelines feeding AI/ML feature stores and forecasting models
Collaborate with AI Developers to ensure high-quality, low-latency data access for model training
Manage and optimize Cloud Composer DAGs and pipeline orchestration
Implement data quality monitoring, alerting, and lineage tracking
Participate in data platform architecture decisions and documentation
Required Qualifications
3 years (Intermediate) or 5 years (Specialist) of data engineering experience
Hands-on Experience With Apache Airflow For Pipeline Orchestration
Proficiency in Apache Spark for large-scale data processing
Strong SQL skills including complex query optimization and BigQuery-specific capabilities
Experience with GCP data services: BigQuery, Cloud Storage, Pub/Sub, Dataflow
Solid understanding of ETL/ELT patterns and data warehousing principles
Preferred Qualifications
GCP Professional Data Engineer certification
Experience Supporting ML/AI Data Infrastructure (feature Engineering, Training Datasets)
Familiarity with real-time streaming (Kafka, Dataflow/Flink)
Retail Or Large-scale Consumer Data Experience
https://www.linkedin.com/company/hyr-global-source-inc
Location: 4 days a week onsite in Bentonville, AR
Position Type: W2/Fulltime
Key Responsibilities
Design and build scalable ETL/ELT pipelines using Apache Airflow, Apache Spark, and GCP Dataflow
Develop and maintain BigQuery data models, schemas, and performance-optimized SQL queries
Build and maintain data pipelines feeding AI/ML feature stores and forecasting models
Collaborate with AI Developers to ensure high-quality, low-latency data access for model training
Manage and optimize Cloud Composer DAGs and pipeline orchestration
Implement data quality monitoring, alerting, and lineage tracking
Participate in data platform architecture decisions and documentation
Required Qualifications
3 years (Intermediate) or 5 years (Specialist) of data engineering experience
Hands-on Experience With Apache Airflow For Pipeline Orchestration
Proficiency in Apache Spark for large-scale data processing
Strong SQL skills including complex query optimization and BigQuery-specific capabilities
Experience with GCP data services: BigQuery, Cloud Storage, Pub/Sub, Dataflow
Solid understanding of ETL/ELT patterns and data warehousing principles
Preferred Qualifications
GCP Professional Data Engineer certification
Experience Supporting ML/AI Data Infrastructure (feature Engineering, Training Datasets)
Familiarity with real-time streaming (Kafka, Dataflow/Flink)
Retail Or Large-scale Consumer Data Experience
https://www.linkedin.com/company/hyr-global-source-inc