Infrastructure Software Engineer
BasetenFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
Candidates should have a Bachelor's degree in Computer Science, Engineering, or equivalent practical experience, plus 3+ years of experience. A strong background in control plane and data plane development, expertise in Kubernetes, container orchestration, and cloud-native infrastructure is essential.
The team builds the scalable, secure, and robust backbone for Anyscale's cloud platform, handling both the control plane for cluster management, scheduling, and user access, and the data plane for high-performance execution of distributed workloads.
This information is not specified in the job description.
This information is not specified in the job description.
Projects include designing services to orchestrate Ray clusters across cloud and on-prem, optimizing control plane for AI/ML workloads, building scheduling systems, enhancing reliability and observability, supporting accelerators like GPUs and TPUs, and handling container image management.
Platform for scaling AI workloads
Anyscale provides a platform designed to scale and productionize artificial intelligence (AI) and machine learning (ML) workloads. Its main product, Ray, is an open-source framework that helps developers manage and scale AI applications across various fields, including Generative AI, Large Language Models (LLMs), and computer vision. Ray allows companies to enhance the performance, fault tolerance, and scalability of their AI systems, with some users reporting over 90% improvements in efficiency, latency, and cost-effectiveness. Anyscale primarily serves clients in the AI and ML sectors, including major companies like OpenAI and Ant Group, who rely on Ray for training large models. The company operates on a software-as-a-service (SaaS) model, charging clients a subscription fee for access to the Ray platform. Anyscale's goal is to empower organizations to effectively scale their AI workloads and optimize their operations.