What are the responsibilities and job description for the Data Engineer with Pyspark position at Hexplora?
Title: Data Engineer (PySpark Required)
Location: Rocky Hill, CT (Onsite)
Location: Rocky Hill, CT (Onsite)
Job Summary
We are seeking a skilled Data Engineer with strong experience in PySpark to join our team in Rocky Hill, CT. This is a fully onsite role where you will be responsible for designing, building, and optimizing scalable data pipelines and data processing systems. The ideal candidate is passionate about big data technologies, data architecture, and delivering high-quality data solutions that drive business insights.
Key Responsibilities
- Design, develop, and maintain scalable data pipelines using PySpark
- Build and optimize ETL/ELT workflows for large-scale data processing
- Work with structured and unstructured datasets from multiple sources
- Collaborate with data analysts, data scientists, and business stakeholders to understand data requirements
- Ensure data quality, integrity, and security across all data platforms
- Optimize data processing performance and troubleshoot data-related issues
- Implement best practices for data governance and data lifecycle management
- Maintain documentation for data processes, workflows, and systems
Required Qualifications
- Bachelor''s degree in Computer Science, Information Technology, or related field
- Hands on experience as a Data Engineer .
- Strong hands-on experience with PySpark (required)
- Proficiency in Python and SQL
- Experience working with big data technologies (e.g., Apache Spark, Hadoop ecosystem)
- Experience with cloud platforms such as AWS, Azure, or Google Cloud
- Familiarity with data warehousing concepts and tools
- Strong problem-solving and analytical skills
Infowave Systems is an equal opportunity employer that is committed to diversity and inclusion in the workplace.