What are the responsibilities and job description for the CAS 2026 Data Engineer Summer Intern position at CAS?
CAS uses intuitive technology, unparalleled scientific content, and unmatched human expertise to help companies create groundbreaking innovations that benefit the world. As the scientific information solutions division of the American Chemical Society, CAS manages the largest curated reservoir of scientific knowledge, and for over 117 years, has helped innovators mine, assess, and apply that information to keep businesses thriving. The CAS team is global, diverse, endlessly curious, and strives to make scientific insights accessible to innovators worldwide.
CAS is currently seeking a Data Engineer Intern for Summer 2026. This position will be located in our headquarters in Columbus, Ohio.
Data Engineers at CAS are passionate about building scalable data pipelines, transforming raw data into usable formats, and enabling data-driven decision-making across the organization. They work closely with data scientists, analysts, and software engineers to ensure data is accessible, reliable, and optimized for performance. Data Engineers are skilled in designing and implementing robust data architectures, integrating diverse data sources, and applying best practices in data governance and engineering. The Internship will run May 18th, 2026- August 7th, 2026.
Job Accountabilities
CAS is currently seeking a Data Engineer Intern for Summer 2026. This position will be located in our headquarters in Columbus, Ohio.
Data Engineers at CAS are passionate about building scalable data pipelines, transforming raw data into usable formats, and enabling data-driven decision-making across the organization. They work closely with data scientists, analysts, and software engineers to ensure data is accessible, reliable, and optimized for performance. Data Engineers are skilled in designing and implementing robust data architectures, integrating diverse data sources, and applying best practices in data governance and engineering. The Internship will run May 18th, 2026- August 7th, 2026.
Job Accountabilities
- Design, build, and maintain scalable data pipelines and ETL processes
- Collaborate with cross-functional teams to understand data requirements and deliver solutions
- Clean, transform, and organize data from structured and unstructured sources
- Support the development and optimization of data models and database schemas
- Monitor data quality and integrity across systems and proactively address issues
- Assist in the migration and integration of data across platforms and environments
- Document data workflows, processes, and technical specifications
- Contribute to the development of internal tools and automation scripts to improve data operations
- Perform other duties as assigned
- Pursuing a degree in Computer Science, Data Engineering, Information Systems, or a related field (Junior or Senior status as of Fall 2025)
- Experience with Python or Java, and SQL
- Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and data tools (e.g., Spark, Airflow, Kafka) is a plus
- Demonstrated problem-solving, analytical, and organizational skills
- Ability to work with large datasets and understand data structures and formats
- Strong written and verbal communication skills
- Ability to work effectively in an open, agile environment as well as independently
- Team-oriented mindset with a willingness to learn and contribute