What are the responsibilities and job description for the Data Engineer (Data Pipelines & Modeling) position at Katalyst Healthcares and Lifesciences?
Responsibilities:
- Design and implement robust data ingestion pipelines from multiple sources (APIs, databases, files, streaming systems).
- Support C4C offline database migration, ensuring data accuracy and consistency.
- Integrate data from enterprise systems into centralized data platforms.
- Design and implement data models for Workforce planning.
- Service operations forecasting.
- Develop optimized schemas for reporting and analytics.
- Ensure data quality, integrity, and consistency across models.
- Strong experience in data engineering and pipeline development.
- Proficiency in Python / SQL.
- Hands-on experience with Apache Spark or similar big data tools.
- Strong understanding of ETL/ELT concepts and data warehousing.
- bility to work independently and in cross-functional teams.
- Bachelor's / Master's in Computer Science, IT, or related field.
- Exposure to CI/CD tools like Jenkins or GitHub Actions.
- Knowledge of cloud platforms (AWS / Azure / Google Cloud Platform).
- Experience in healthcare or regulated environments.