Machine Learning Engineer
SweedFull Time
Mid-level (3 to 4 years)
Key technologies and capabilities for this role
Common questions about this position
Previous experience working as a Data Scientist, Machine Learning Engineer, or as an Engineer working with ML models or GenAI applications in production is required.
Candidates need comfort working in public Cloud environments (AWS, Azure, GCP), knowledge of machine learning frameworks such as TensorFlow, PyTorch, or Scikit-learn, and knowledge of LLM/Agentic frameworks such as Llamaindex, LangGraph, and DSPy.
The engineering team is composed of industry veterans who have built deep learning infrastructure, autonomous drones, ridesharing marketplaces, ad tech, and much more, building systems that interact with some of the most complex software ever deployed in production.
This information is not specified in the job description.
A strong candidate is client-obsessed with entrepreneurial tendencies, even if they don't check every requirement, and has an understanding of ML/DS concepts, model evaluation strategies, lifecycle, and engineering considerations.
AI observability and model evaluation platform
Arize AI provides a platform focused on AI observability and evaluating language models. The platform allows companies to monitor, troubleshoot, and assess the performance of various machine learning models, including those used for natural language processing, computer vision, and recommendations. Users can access analytics and workflows that help identify and resolve issues within their AI systems, ensuring optimal performance. Key features include task-based evaluations for aspects like hallucination and relevance, as well as tools for visualizing query and knowledge base embeddings to enhance retrieval accuracy. Unlike many competitors, Arize AI specifically targets the needs of top AI companies, offering tailored solutions for continuous improvement of their models. The goal of Arize AI is to empower these companies to enhance their AI capabilities through effective monitoring and evaluation.