Senior Machine Learning Engineer - Machine Learning Infrastructure
FlipFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
Yes, this is a remote position.
Required skills include strong engineering experience in large-scale distributed training or HPC systems, deep familiarity with JAX internals and distributed training libraries, experience with multi-node cluster orchestration like Slurm or Kubernetes, comfort debugging performance issues across CUDA/NCCL and data pipelines, and experience with containerized environments like Docker.
This information is not specified in the job description.
Cohere obsesses over what they build, works hard and moves fast to serve customers, values a team of the best researchers, engineers, and designers passionate about their craft, and believes diverse perspectives are essential for great products.
A strong candidate has a track record of building tools that increase developer velocity for ML teams, excellent judgment on trade-offs like performance vs complexity, and strong collaboration skills to work with infra, research, and deployment teams.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.