What are the responsibilities and job description for the Machine Learning Systems Engineer position at SageBeans RPO?
Job Title: Machine Learning Systems Engineer
Requirements:
· 2–5 years of experience in ML Systems Engineering, focused on infrastructure for model training and inference systems.
· Experience building and optimizing high-performance model serving systems.
· Strong experience with distributed systems and cloud platforms (AWS, GCP, or Azure).
· Proficiency in Python and at least one systems programming language (C , Rust, or Go).
Preferred:
· Experience with ML serving frameworks such as vLLM, TensorRT, or ONNX Runtime.
· Familiarity with distributed training techniques (data parallel, model parallel, pipeline parallel).
· Familiarity with high-performance computing and GPU programming (CUDA).
· Experience with MLOps practices and tooling.
Tech Stack:
· vLLM, TensorRT, ONNX Runtime, PyTorch, TensorFlow, CUDA, Docker,Kubernetes, Python, AWS, Azure, Kubeflow, SGLang