[Remote] Sr Data Scientist - Gen AI ML - Tampa/Irving
Note: The job is a remote job and is open to candidates in USA. Photon is seeking a Generative AI Engineer to build, optimize, and scale production-ready AI applications. The role involves designing complex multi-agent systems, implementing advanced RAG pipelines, and managing the deployment of both frontier and local LLMs.
Responsibilities
- Develop and orchestrate sophisticated AI workflows using LangGraph and multi-agent architectures
- Build and maintain Advanced RAG systems utilizing LlamaIndex and vector databases for high-accuracy retrieval
- Integrate and swap diverse LLMs (commercial and open-source) based on performance and cost requirements
- Design and deploy high-performance, scalable backend services using FastAPI and Async Python
- Fine-tune large language models (LLMs) using PyTorch/TensorFlow to improve domain-specific performance
- Optimize GenAI workflows for latency, cost, and reliability using advanced prompt engineering and monitoring tools
- Containerize and deploy AI services via Docker to production environments
Skills
- 7+ years of experience; Hands-on experience building and deploying GenAI applications in a production setting
- Strong proficiency in Python and the modern AI library ecosystem (LangChain, LlamaIndex, etc.)
- Experience with vector search, embedding models, and advanced data retrieval patterns
- Knowledge of model fine-tuning techniques and local LLM quantization/hosting
- Familiarity with production-grade monitoring, API security, and CI/CD for ML
Benefits
- Medical, vision, and dental benefits
- 401k retirement plan
- Variable pay/incentives
- Paid time off
- Paid holidays
Company Overview
Company H1B Sponsorship