Machine Learning Engineer
SweedFull Time
Mid-level (3 to 4 years)
Key technologies and capabilities for this role
Common questions about this position
Yes, this is a remote position in a remote-friendly environment, with the Model Efficiency team concentrated in EST and PST time zones.
Required skills include 5+ years of experience writing high-performance, production-quality code, strong programming skills in C++ or Python (Rust/Go welcome), experience with large language models and the LLM inference ecosystem (e.g., vLLM, SGLang), ability to diagnose performance bottlenecks, and a strong bias for action.
This information is not specified in the job description.
Cohere obsesses over what they build, works hard and moves fast for customers, values a team of top experts in their fields, and believes diverse perspectives are essential for great products.
A strong candidate has 5+ years of high-performance coding experience in C++ or Python, works with LLMs and inference tools, diagnoses bottlenecks, ships fast, and preferably has GPU/CUDA or transformer optimization experience.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.