What are the responsibilities and job description for the Sr Full Stack Engineer – Generative AI & AWS, Python position at InfoVision, Inc.?
Job title: Sr Full Stack Engineer – Generative AI & AWS, Python
Location: Irving, TX
Duration: Long-term
Role Summary:
We are seeking a Full Stack Developer – AI & Cloud to design, build, and deploy scalable enterprise applications at the intersection of Java/Python server-side development, AWS cloud services, and AI/LLM edge deployments.
Required Qualifications:
- 12 years of full stack development experience with strong server-side proficiency in Java (Spring Boot) and Python.
- Telecom Industry experience is a must.
- Hands-on experience building and deploying microservices on AWS, including services such as Lambda, ECS, API Gateway, and SageMaker.
- Demonstrated experience fine-tuning LLM models using Hugging Face Transformers, PEFT, or LoRA.
- Proven ability to deploy and optimize LLM inference on edge devices (CPU/edge GPU) using runtimes such as Ollama, llama.cpp, or ExecuTorch.
- Proficiency with containerization and orchestration tools including Docker and Kubernetes.
- Strong understanding of RESTful API design, event-driven architectures, and distributed microservices patterns.