Senior Staff Machine Learning Engineer
Flex- Full Time
- Senior (5 to 8 years)
Candidates should possess extremely strong software engineering skills and proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR. Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray) is required, along with hands-on experience on training large models at scale.
As a Member of Technical Staff, you will design and write high-performant and scalable software for training models, consistently post-train the models to reach SOTA level performance, coordinate with other specialist teams, craft and implement techniques to improve model performance, research, implement, and experiment with ideas on supercompute and data infrastructure, and learn from and work with the best researchers in the field.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.