What are the responsibilities and job description for the Artificial Intelligence Researcher position at SoTalent?
Job Title: AI Researcher
Location: New York, NY
Type: Fulltime
Our client is looking for an AI Researcher who is passionate about advancing AI and applying cutting-edge research to solve real-world problems, helping shape the next generation of intelligent systems.
What You’ll Do
- Conduct applied research to develop and optimize large-scale AI models, including foundation models and LLMs.
- Design, train, evaluate, and deploy models across language, vision, graphs, and sequential data.
- Explore state-of-the-art techniques in self-supervised learning, robustness, explainability, RLHF, and more.
- Work with advanced AI stacks such as PyTorch, HuggingFace, Lightning, AWS Ultraclusters, VectorDBs.
- Translate research insights into production-ready solutions that deliver measurable business impact.
What We’re Looking For
- PhD (or Master’s with research experience) in Computer Science, AI, Machine Learning, Mathematics, or related field.
- Strong programming skills in Python, Go, Scala, or Java.
- Deep understanding of AI fundamentals and experience with large-scale deep learning models.
- Proven ability to publish or contribute to impactful research in top-tier conferences (e.g., NeurIPS, ICML, ICLR, ACL).
- Ability to define and execute a research agenda, from problem selection to implementation.
Preferred Expertise
- Large Language Models: Pretraining, finetuning, optimization, and scaling (10B parameters).
- Graph & Sequential Models: GNNs, time-series, recommender systems, and large-scale graph modeling.
- Optimization: Model sparsification, quantization, parallelism, gradient checkpointing, and compiler-level improvements.
- Contributions to open-source frameworks (e.g., PyTorch Geometric, DGL).
- Experience deploying research-driven models in production environments.