What are the responsibilities and job description for the Data Engineer (Python, Google Cloud Platform, SQL) w2 position at ExecutivePlacements.com?
Overview
Job Title: Data Engineer (Python, Google Cloud Platform, SQL)
Location: Dallas, TX (Hybrid 2-3 days onsite)
Duration: Long-term Contract
Visa: OPT, (No CPT)
Experience: 8 Years
Screening Questions
Job Title: Data Engineer (Python, Google Cloud Platform, SQL)
Location: Dallas, TX (Hybrid 2-3 days onsite)
Duration: Long-term Contract
Visa: OPT, (No CPT)
Experience: 8 Years
Screening Questions
- Python Google Cloud Platform Pipeline Design How would you design a pipeline in Python to load raw JSON data from Google Cloud Storage into BigQuery? Mention the exact Google Cloud Platform services and Python libraries youd use.
- SQL Hands-On Write a query to return the top 3 highest-paid employees in each department from an Employees table. How would you optimize this if the table had 50M rows?
- BigQuery Performance Debugging A BigQuery job normally takes 5 minutes but suddenly takes over an hour. What are the first three things youd check?
- Dataflow / Streaming Cisco wants near real-time reporting from IoT devices. Which Google Cloud Platform services will you use, and how will you ensure the pipeline can scale and not drop data?
- Hybrid Environment Challenge Suppose Cisco has data in an on-prem SQL Server and wants it available daily in BigQuery. What steps and tools would you propose for migration and daily sync?
- Python Data Cleaning Check If you have a dataset with duplicates, missing values, and inconsistent date formats, explain which Pandas or PySpark functions you would use to clean it.