What are the responsibilities and job description for the AWS Python Data Scientist position at Capgemini?
Your address must be within 100 miles and work 2 days in office in the office in Houston.
As a Data Scientist, you will work within the Analytics team to develop models using Natural Language Processing along with many other techniques.
- Developing ML models that allow us to be prescriptive as opposed to reactive.
- Staying informed of next generation data and analytical tools.
- Converting ad hoc analyses into scalable solutions and tools that can be used on demand as self-serve.
- Helping reveal actionable insights for individuals within various business functions.
- Cleaning, prepping and verifying the integrity of data for analyses.
- Demonstrable knowledge of common coding languages (Python, SQL, R) with ability to learn new skills and apply skills in new environments.
- Experience in Developing and Deploying GenAI Solutions: Building and fine-tuning generative AI/LLM solutions using AWS Bedrock and available foundation models like Claude, Llama, and Titan.
- Building ML Workflows: Leveraging a wide range of AWS services (S3, Lambda, SageMaker, API Gateway, etc.) to build end-to-end machine learning workflows.
- Implementing RAG Architectures: Designing and building Retrieval Augmented Generation (RAG) pipelines and developing custom embeddings and vector search solutions.
- Prompt Engineering and Optimization: Creating effective prompt engineering strategies, evaluation frameworks, and model optimization techniques.
- Collaboration: Working with cross-functional teams, including data engineers and software engineers, to integrate AI applications into existing systems and translate business problems into scalable AI solutions.
- Monitoring and Governance: Ensuring data quality, security, and compliance, and setting up monitoring and governance for production-ready AI applications.
Requirements
"6 Years of experience in the industry
- Design and Development: Design, develop, and implement data pipelines using AWS services such as AWS Glue, Lambda, S3, Kinesis, and Snowflake to process large-scale data.
- ETL Processes: Build and maintain robust ETL processes for efficient data extraction, transformation, and loading, ensuring data quality and integrity across systems.
- Data Warehousing: Design and manage data warehousing solutions on AWS, particularly with Redshift, for optimized storage, querying, and analysis of structured and semi-structured data.
- Data Lake Management: Implement and manage scalable data lake solutions using AWS S3, Glue, and related services to support structured, unstructured, and streaming data.
- Collaboration: Work closely with data scientists, analysts, and business stakeholders to understand data needs and deliver data solutions aligned with business goals.
- Documentation: Create and maintain documentation for data infrastructure, data pipelines, and ETL processes to support internal knowledge sharing and compliance.
- Required hands-on experience with Python and SQL (2 years)
Life at Capgemini
Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:
- Collaborating with teams of creative, fun, and driven colleagues
- Flexible work options enabling time and location-based flexibility
- Company-provided home office equipment
- Virtual collaboration and productivity tools to enable hybrid teams
- Comprehensive benefits program (Health, Welfare, Retirement and Paid time off)
- Other perks and wellness benefits like discount programs, and gym/studio access.
- Paid Parental Leave and coaching, baby welcome gift, and family care/illness days
- Back-up childcare/elder care, childcare discounts, and subsidized virtual tutoring
- Tuition assistance and weekly hot skill development opportunities
- Experiential, high-impact learning series events
- Access to mental health resources and mindfulness programs
- Access to join Capgemini Employee Resource Groups around communities of interest
About Capgemini
Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, cloud and data, combined with its deep industry expertise and partner ecosystem. The Group reported 2023 global revenues of €22.5 billion