Recent Searches

You haven't searched anything yet.

2 Data Engineer Jobs in Westport, CT

SET JOB ALERT
Details...
Bridgewater Associates
Westport, CT | Full Time
$129k-158k (estimate)
2 Months Ago
Catalytic Data Science
Westport, CT | Full Time
$129k-159k (estimate)
2 Weeks Ago
Data Engineer
$129k-159k (estimate)
Full Time | IT Outsourcing & Consulting 2 Weeks Ago
Save

Catalytic Data Science is Hiring a Data Engineer Near Westport, CT

Data Engineer III (Large Language Models)

About Catalytic Data Science (CDS):

Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment. By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy. Our customers are passionate about making the world a better place, and we are inspired by the opportunity to help them.

The Role

You are a Data Engineer with experience in processing terabytes of data and working with large language models (LLMs). You have experience in creating and automating scalable, fault-tolerant, and reproducible data pipelines for natural language processing (NLP) using Amazon AWS technologies. You will design and implement data ingestion, processing, and storage solutions that can handle massive amounts of text data from various sources. You are interested in helping to create a platform completely built on top of AWS. You are eager to join a team of Life Scientists and Software Engineers that believe the brightest minds in research should have the best tools to drive innovation.

What You’ll Do

  • Build, test, and operate automated Extract, Transform, and Load (ETL) pipelines that process terabytes of text data nightly
  • Develop service frontends around our various backend data stores (AWS Aurora, MySQL, Elasticsearch, S3)
  • Rapidly protype, test, and deploy data pipelines for LLMs using AWS.
  • Collaborate with data scientists and NLP engineers to understand the data requirements and specifications for LLMs and related tasks such as text summarization, translation, and question answering.
  • Optimize the performance, reliability, and scalability of the data pipelines and LLMs by applying best practices and techniques such as data partitioning, caching, compression, and monitoring.
  • Ensure the quality, integrity, and security of the data by implementing data validation, cleaning, and governance policies and procedures.
  • Research and evaluate new technologies and methods for data engineering and LLMs and stay updated with the latest trends and developments in the field.
  • Participate in data architecture and engineering decisions, bringing your strong experience and knowledge to bear.

Qualifications

  • Bachelor's degree or higher in computer science, engineering, or a related field.
  • 3 years of experience in data engineering, preferably with large-scale text data and LLMs and 6 years of any software engineering experience (including data engineering).
  • Proficient in Python 3 or Java, preferably both.
  • Experience with data modeling, ETL, and data warehouse design and implementation.
  • Expertise with ETL schedulers such as Airflow, Prefect or similar frameworks.
  • Familiar with LLMs and NLP concepts and frameworks such as Transformers, BERT, GPT, PaLM, and LLaMA.
  • Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS
  • Experience extracting, processing, storing, and querying of petabyte-scale datasets
  • Familiarity with building and using containers
  • Familiarity with event-based microservices
  • Strong communication, collaboration, and problem-solving skills.

Core Skills:

  1. ETL Processes
  2. Data Modeling and Database Design
  3. Proficiency in Large Language Models
  4. Data Pipeline Optimization
  5. Cross-functional Collaboration
  6. Problem-solving and Analytical Skills

Nice-to-Haves

  • Prior experience with Elasticsearch (custom development and/or administration) is a huge plus
  • Knowledge of Graph databases

What Do We Love in Team Members?

Your specialization is less important than your ability to learn fast and adapt to shifting technologies. We’re especially fond of people who:

  • Focus on customer’s needs and our company’s goals, not just writing code
  • Iterate until customers love what you’ve built
  • Self-start and initiate
  • Self-organize
  • Strive to grow personally and professionally, beyond just expanding technical abilities
  • Love to experiment with new technology and share knowledge with the team

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

Job Summary

JOB TYPE

Full Time

INDUSTRY

IT Outsourcing & Consulting

SALARY

$129k-159k (estimate)

POST DATE

04/24/2024

EXPIRATION DATE

06/23/2024

WEBSITE

catalyticds.com

HEADQUARTERS

WILTON, CT

SIZE

<25

FOUNDED

2010

CEO

SCOTT SACANE

REVENUE

<$5M

INDUSTRY

IT Outsourcing & Consulting

Related Companies
About Catalytic Data Science

We are a team of life scientists and software engineers who believe the brightest minds in science should have access to the best tools that are key to driving innovation. The Catalytic Platform is a new kind of R&D cloud built specifically for how life scientists work. By providing researchers with the best digital tools and networking them with colleagues, we're empowering R&D teams to generate novel insights, compress the time and money required to achieve key R&D milestones and produce knowledge that can be monetized to drive business forward.

Show more

Catalytic Data Science
Full Time
$87k-117k (estimate)
5 Months Ago
Catalytic Data Science
Full Time
$125k-159k (estimate)
6 Months Ago
Catalytic Data Science
Full Time
$92k-107k (estimate)
11 Months Ago

The job skills required for Data Engineer include Python, Data Engineering, AWS, Data Warehouse, Computer Science, Java, etc. Having related job skills and expertise will give you an advantage when applying to be a Data Engineer. That makes you unique and can impact how much salary you can get paid. Below are job openings related to skills required by Data Engineer. Select any job title you are interested in and start to search job requirements.

For the skill of  Python
STS Technical Services
Full Time
$147k-174k (estimate)
2 Weeks Ago
For the skill of  Data Engineering
Bridgewater Associates
Full Time
$129k-158k (estimate)
2 Months Ago
For the skill of  AWS
BestLogic Staffing
Full Time
$50k-65k (estimate)
1 Week Ago
Show more

The following is the career advancement route for Data Engineer positions, which can be used as a reference in future career path planning. As a Data Engineer, it can be promoted into senior positions as a Database Engineer IV that are expected to handle more key tasks, people in this role will get a higher salary paid than an ordinary Data Engineer. You can explore the career advancement for a Data Engineer below and select your interested title to get hiring information.

Impact Solutions
Full Time
$127k-159k (estimate)
1 Month Ago
Mitsubishi HC Capital America Inc
Full Time
$164k-199k (estimate)
2 Months Ago

If you are interested in becoming a Data Engineer, you need to understand the job requirements and the detailed related responsibilities. Of course, a good educational background and an applicable major will also help in job hunting. Below are some tips on how to become a Data Engineer for your reference.

Step 1: Understand the job description and responsibilities of an Accountant.

Quotes from people on Data Engineer job description and responsibilities

The data engineer develops and maintains the enterprise data framework for continued use.

03/12/2022: Dothan, AL

A data engineer prepares data for analytical or operational uses.

03/03/2022: Boulder, CO

Data engineers simplify complex data structure and prevent the reduplication of data.

03/28/2022: New Suffolk, NY

Data Engineers are the technical professionals who prepare data that can be used by data scientists for valuable decisions and strategies.

04/13/2022: Harrisburg, PA

Step 2: Knowing the best tips for becoming an Accountant can help you explore the needs of the position and prepare for the job-related knowledge well ahead of time.

Career tips from people on Data Engineer jobs

Changing oil, running basic checks, topping off fluids and checking tire pressure are common job duties.

01/22/2022: Fort Wayne, IN

Oil changes are an essential component of preventative maintenance.

02/26/2022: Newark, NJ

A data engineer should be aligned with a data scientist’s needs while creating a data system.

04/10/2022: Fayetteville, NC

Start with an entry-level position.

02/10/2022: Winston Salem, NC

Consider pursuing additional professional engineering or big data certifications.

03/09/2022: Saginaw, MI

Step 3: View the best colleges and universities for Data Engineer.

Butler University
Carroll College
Cooper Union
High Point University
Princeton University
Providence College
Show more