Machine Learning Engineer
HangFull Time
Senior (5 to 8 years)
London, England, United Kingdom
Key technologies and capabilities for this role
Common questions about this position
Yes, the position is fully remote with no restrictions on location, though the company has offices in London, Paris, Toronto, San Francisco, and New York.
This information is not specified in the job description.
Required skills include extremely strong software engineering, proficiency in Python and ML frameworks like JAX, PyTorch, and XLA/MLIR, experience with distributed training infrastructures such as Kubernetes and Slurm, large-scale distributed training strategies, and hands-on experience training large models with a focus on post-training optimization.
Cohere emphasizes working hard and moving fast, obsessing over builds, high individual responsibility for model capabilities, a diverse range of perspectives, and a collaborative environment blending engineering and research with top talent.
Strong candidates have hands-on experience with post-training large models at scale, proficiency in Python/ML frameworks and distributed systems, plus bonus for publications at top-tier venues like NeurIPS or ICML.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.