What are the responsibilities and job description for the Data Engineer – OrcaWorks AI position at Orcaworks | Co-design Your Digital Coworker?
Location: Hybrid / Atlanta, GA
Experience Level: Entry-level (Master’s preferred)
About OrcaWorks AI
At OrcaWorks AI, we’re building next-generation AI systems that empower businesses to make data-driven decisions with intelligence and speed. We’re seeking passionate Data Engineers who love solving real-world data challenges and want to be part of a growing team building cutting-edge AI infrastructure.
Key Responsibilities
- Design, develop, and maintain data pipelines using tools like Airbyte and Prefect to feed AI and machine learning models.
- Integrate data from multiple structured and unstructured sources into unified and queryable layers using ElasticSearch or Vespa.
- Implement data validation, transformation, and storage solutions using modern ETL frameworks.
- Collaborate with AI, LLM, and data science teams to ensure reliable and optimized data flow for model training.
- Support database management, SQLModel, and data governance practices across services.
Required Skills & Qualifications
- Master’s degree (or Bachelor’s with equivalent experience) in Computer Science, Information Systems, or Data Engineering.
- Proficiency in Python and SQL; experience with PySpark or equivalent ETL frameworks.
- Hands-on experience with Airbyte, Prefect, and DBT.
- Familiarity with search and indexing systems like Vespa or ElasticSearch.
- Knowledge of cloud data platforms (AWS, GCP, or Azure) and API integration.
- Strong understanding of data security and applied AI workflows.