Research Staff, LLMs
DeepgramFull Time
Expert & Leadership (9+ years)
Candidates should have 5+ years of hands-on experience in large language model, NLP, and Transformer modeling, with a track record of landing major research impacts in a fast-paced environment. Experience supporting and leading research teams, excellent written and verbal communication skills, and published research in major machine learning conferences or journals are required. Previous experience in a customer-facing role is also preferred.
The Tech Lead Manager will lead a team of research scientists and engineers focused on developing and implementing novel evaluation methodologies, metrics, and benchmarks for large language models. This includes conducting research on existing evaluation techniques, designing new benchmarks for instruction following, factuality, robustness, and fairness, and implementing scalable evaluation pipelines. The role also involves communicating and collaborating with clients and peer teams, refining metrics, creating standardized evaluation protocols, publishing research findings, and staying up-to-date on research trends.
AI platform for data and models
Scale AI provides a platform that helps businesses develop AI applications by utilizing their enterprise data to customize generative models. The platform includes tools for collecting, curating, and annotating data, as well as features for evaluating and optimizing models. Scale works with a variety of clients, including major tech companies like Microsoft and Meta, government agencies such as the U.S. Army and Airforce, and startups like Brex and OpenSea. What sets Scale apart from its competitors is its comprehensive suite of tools and services that focus on safely unlocking the value of AI. The company's goal is to enhance the performance of advanced language models and generative models, making AI more accessible and effective for its clients.