Machine Learning Engineer, Ads Training Platform
RedditFull Time
Mid-level (3 to 4 years), Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
At least 2 years of relevant work experience is required.
A solid background in algorithms, data structures, system design, and experience in building scalable and fault-tolerant distributed systems are required. Knowledge of distributed model training and inference, as well as GPU programming, is preferred.
This information is not specified in the job description.
This information is not specified in the job description.
The Ray Core team develops and maintains the Ray C++ backend, including the distributed scheduler, language runtime integration, I/O and memory subsystems, focusing on reliability, scalability, performance, and supporting higher-level libraries.
Platform for scaling AI workloads
Anyscale provides a platform designed to scale and productionize artificial intelligence (AI) and machine learning (ML) workloads. Its main product, Ray, is an open-source framework that helps developers manage and scale AI applications across various fields, including Generative AI, Large Language Models (LLMs), and computer vision. Ray allows companies to enhance the performance, fault tolerance, and scalability of their AI systems, with some users reporting over 90% improvements in efficiency, latency, and cost-effectiveness. Anyscale primarily serves clients in the AI and ML sectors, including major companies like OpenAI and Ant Group, who rely on Ray for training large models. The company operates on a software-as-a-service (SaaS) model, charging clients a subscription fee for access to the Ray platform. Anyscale's goal is to empower organizations to effectively scale their AI workloads and optimize their operations.