AI Engineer & Researcher, Inference
SpeechifyFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
Candidates need 4-8 years of experience in product management, AI infrastructure, or ML engineering roles, with at least 2 years focused on AI/ML workloads, plus a deep understanding of AI/ML workflows including model deployment, inference optimization, and data-access patterns.
Responsibilities include defining product roadmaps for AI-inference workflows, engaging with technical users to understand bottlenecks, partnering with engineering on features like GPU scheduling, working with customers to validate features, and staying current with AI infrastructure trends.
This information is not specified in the job description.
This information is not specified in the job description.
Alluxio has a world-class team of empathetic, enthusiastic, and creative people who work on tough big data problems.
Data management solutions for AI workloads
Alluxio.io focuses on optimizing data management for Artificial Intelligence (AI) and Machine Learning (ML) workloads. It offers two main products: Alluxio Enterprise Data and Alluxio Enterprise AI, which help businesses manage their data and AI tasks across various infrastructure setups. By providing a single interface, Alluxio simplifies the management of data silos, enhances performance, and reduces the complexity of handling different technology stacks. Its solutions can accelerate model training by 20 times and model serving by 10 times, while also maximizing the return on investment for infrastructure and achieving high GPU utilization. Alluxio's goal is to help businesses improve efficiency and performance in their AI and ML operations by eliminating data copies and enabling seamless data access.