Proven experience with Azure AI services, including Azure Machine Learning, Cognitive Services, Copilot Studio, and AI Foundry (or similar ISP tools)
Working knowledge of machine learning frameworks such as TensorFlow, PyTorch, or scikit-learn
Deep understanding of Generative AI and LLMs, evaluation metrics, and theory behind LLMs & conversational AI
Full understanding of the Retrieval Augmented Generation (RAG) pattern and various tradeoffs
Working knowledge of MCP and A2A protocols for Agentic based flows
Working knowledge of AI-based AI frameworks such as LangChain, AutoGen, LangGraph, LlamaIndex, etc
Deep and working knowledge of IaC and automation
Well versed in Software Quality Management (SQM) concepts and procedures
Familiar with containerization i.e. Azure Kubernetes Service (AKS) for managing and scaling containerized AI applications
Bachelor’s degree in Computer Science or related field required
5+ years of experience in AI/ML development
Excellent problem-solving, analytical skills, and strong communication and collaboration abilities
Knowledge of Infrastructure as Code (IaC) experience in any language, Terraform preferred
Full stack Azure application development
Security related concepts and controls, particularly in the cloud environment: OAuth, virtual networks, subnets, private endpoints, authentication mechanisms, identities, and RBAC
Data protection and encryption techniques and tooling such as key vaults, encryption keys, masking, and concepts such as BYOK
DevOps including CI/CD techniques and tools particularly Azure based and GitHub repos
Running large scale (either in parallel and/or distributed) training and inference jobs on private or public cloud infrastructure
Responsibilities
Lead, mentor, and grow a team of AI developers, fostering a culture of innovation, collaboration, and continuous learning
Define and execute the technical strategy for AI solution development on Microsoft Azure, ensuring alignment with business objectives
Oversee the design and development of AI models and algorithms to solve complex business problems
Drive the adoption of best practices in MLOps & LLMOps, ensuring scalable, reliable, and efficient AI model deployment and management
Stay abreast of the latest advancements in AI/ML technologies, particularly within the Azure ecosystem, and evaluate their potential application
Conduct code reviews, provide technical guidance, and contribute to the architectural design of AI systems
Ensure the scalability, reliability, and security of AI solutions
Manage project timelines, budgets, and resources
Communicate project status and results to stakeholders
Comply with industry standards, best practices, and regulations for AI solutions
Work across multiple projects in a fluid environment where work is required across the full research lifecycle from forming a hypothesis, acquiring data, and developing ETL-style software to presenting findings
Provide guidance to other software development teams as prototypes and frameworks are engineered for full production environments