Machine Learning Engineer
Hang- Full Time
- Senior (5 to 8 years)
Candidates must have over 5 years of experience in building production-grade, distributed systems on AWS or similar cloud platforms. Exceptional programming skills in Go, Rust, Python, or C++ are required, with a focus on reliability and scale. Proficiency with deep learning and NLP frameworks such as Scikit-learn, PyTorch, and TensorFlow is essential, along with hands-on experience in building pipelines that utilize the latest LLMs. Strong teamwork skills and the ability to communicate effectively with both technical and non-technical team members are necessary.
As a Machine Learning Engineer specializing in Memory, you will design and implement a scalable approach to personalize LLM outputs based on user interactions. You will develop a system for compacting, storing, and retrieving bots' memories to enhance real-time LLM inference. Additionally, you will train ML models to incorporate personalization signals using techniques like LoRA, Adapters, MoE, HyperNetworks, and prompt tuning, while leveraging industry and academic research to advance the state of the art.
Social gaming platform with AI interactions
Cantina.com offers a platform that merges social gaming with artificial intelligence, allowing users to create and interact with AI-driven bots in a virtual space called "The Cantina." Users can customize their experiences by adding bots with unique personalities, engaging in conversations, and creating AI art, making each interaction personalized. The company stands out by focusing on user-driven content and a dynamic experience, appealing to tech-savvy individuals and social gamers. Cantina operates on a subscription model, aiming to provide an engaging environment that leverages the growing interest in AI and social gaming.