AI/ML Engineer
BambooHRFull Time
Mid-level (3 to 4 years)
Key technologies and capabilities for this role
Common questions about this position
Compensation packages include base salary, equity, and benefits. The salary range displayed on each job posting reflects the minimum and maximum target for new hires, determined by work location, skills, experience, interview performance, and education. Your recruiter can share more about the specific salary range for your preferred location.
This information is not specified in the job description.
Ideal candidates have 1-3 years of LLM training in production, experience with post-training methods like RLHF/RLVR and PPO/GRPO, multi-node LLM training and inference, proficiency in CUDA, Pytorch, transformers, flash attention, and strong software engineering skills. A PhD or Masters in Computer Science or related field is preferred, along with passion for system optimization and GPU cluster architecture knowledge.
You'll collaborate with ML teams in the Enterprise ML Research Lab to accelerate their research and development, working cross-functionally with other MLREs and AAIs on the Enterprise AI team serving enterprise clients.
Strong candidates demonstrate 1-3+ years of production LLM training experience, expertise in post-training algorithms like RLHF and PPO, proficiency with GPU clusters and tools like Pytorch and CUDA, plus a relevant advanced degree.
AI platform for data and models
Scale AI provides a platform that helps businesses develop AI applications by utilizing their enterprise data to customize generative models. The platform includes tools for collecting, curating, and annotating data, as well as features for evaluating and optimizing models. Scale works with a variety of clients, including major tech companies like Microsoft and Meta, government agencies such as the U.S. Army and Airforce, and startups like Brex and OpenSea. What sets Scale apart from its competitors is its comprehensive suite of tools and services that focus on safely unlocking the value of AI. The company's goal is to enhance the performance of advanced language models and generative models, making AI more accessible and effective for its clients.