What are the responsibilities and job description for the Jr. Strategic Project Lead, DataFactory position at ChatGPT Jobs?
Job Description
Jr. Strategic Project Lead, DataFactory
Location: Santa Clara, CA / REMOTE
Duration: 12 months
Rate Range: $60-$100/hr
Company:
A global leader in AI and high-performance computing, powering innovations in gaming, robotics, and data science.
Job Description:
Seeking a seasoned, techno-functional leader to drive the development and execution of large-scale LLM training programs. This role is deeply technical and customer-facing, with significant emphasis on dataset creation, annotation workflows, and high-quality data pipelines for foundational model training. The ideal candidate combines deep technical understanding of the LLM training lifecycle, strong operational rigor, and exceptional cross-functional leadership.
Key Responsibilities:
Individual compensation is determined by skills, qualifications, experience, and location. Full-time roles are eligible for Medical, Dental, Vision, Commuter, and 401K benefits with company matching.
Jr. Strategic Project Lead, DataFactory
Location: Santa Clara, CA / REMOTE
Duration: 12 months
Rate Range: $60-$100/hr
Company:
A global leader in AI and high-performance computing, powering innovations in gaming, robotics, and data science.
Job Description:
Seeking a seasoned, techno-functional leader to drive the development and execution of large-scale LLM training programs. This role is deeply technical and customer-facing, with significant emphasis on dataset creation, annotation workflows, and high-quality data pipelines for foundational model training. The ideal candidate combines deep technical understanding of the LLM training lifecycle, strong operational rigor, and exceptional cross-functional leadership.
Key Responsibilities:
- Lead the generation and delivery of high-quality, scalable LLM training datasets (SFT, RLHF, rubric-based evaluation, reasoning, and agentic workflows).
- Oversee the end-to-end data lifecycle from customer intake to delivery.
- Serve as the primary contact for large engagements, managing stakeholder expectations, requirements, and timelines.
- Collaborate with engineering, product, research, and delivery teams.
- Develop and refine best practices for prompt evaluation, data schema design, evaluation metrics, and human-in-the-loop QA.
- Manage and mentor leads, program managers, and annotators.
- Operate as a strategic business partner to customers.
- Build internal capability and contribute to improvements in data quality, tools, and processes.
- 10 years of experience building and leading large scale technical delivery teams (100 ).
- Bachelor's degree in Engineering, Computer Science or equivalent practical experience.
- Demonstrated experience managing large-scale dataset generation or annotation for LLMs (RLHF or SFT pipelines preferred).
- Strong understanding of quality review mechanisms (prompt win rate, agreement metrics, inter-annotator consistency, preference modeling).
- Experience with human data generation across at least one modality (Text, Image, Video, Audio).
- Technical fluency in data platforms, machine learning concepts, and modern ML tooling (e.g., HuggingFace, LangChain, Weights & Biases).
- Strong communication skills with experience presenting to executives and managing client escalations.
- Prior experience at AI Data Platform companies.
- Past experience with retrieval augmented generation (RAG), fine-tuning, and human evaluation workflows.
- Familiarity with fine-tuning LLMs, prompt engineering at scale, and instruction dataset design.
- Technical fluency in Python and cloud infrastructure.
- Familiarity with benchmarks (SWE Bench, MMMLU, or equivalent).
Individual compensation is determined by skills, qualifications, experience, and location. Full-time roles are eligible for Medical, Dental, Vision, Commuter, and 401K benefits with company matching.
Salary : $60 - $100