Member of Technical Staff, Training Performance Engineer at Cohere

London, England, United Kingdom

Not SpecifiedCompensation

Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level

Full TimeJob Type

UnknownVisa

AI & Machine Learning, Data & Analytics, Enterprise SoftwareIndustries

Skills

Key technologies and capabilities for this role

PythonJAXPytorchXLAMLIRCUDATritonDistributed TrainingTransformersSoftware EngineeringKernel Design

Questions & Answers

Common questions about this position

What is the work arrangement for this role?

The position is hybrid, with offices in London, Toronto, San Francisco, New York, and remote-friendly options.

What are the required skills for this position?

Candidates need extremely strong software engineering skills, proficiency in Python and ML frameworks like JAX, Pytorch, and XLA/MLIR, experience writing CUDA and Triton kernels for GPUs, large-scale distributed training strategies, and familiarity with autoregressive sequence models like Transformers.

What is the salary for this role?

This information is not specified in the job description.

What is the company culture like at Cohere?

Cohere emphasizes a culture of hard work, fast movement, and customer focus, with a diverse team of professionals passionate about scaling intelligence to serve humanity through training frontier models.

What makes a strong candidate for this role?

A strong candidate has extremely strong software engineering skills, GPU kernel experience with CUDA and Triton, proficiency in Python ML frameworks, distributed training expertise, and familiarity with Transformers; publications at top ML conferences like NeurIPS or ICML are a bonus.

Cohere

Provides NLP tools and LLMs via API

About Cohere

Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.

Toronto, CanadaHeadquarters

2019Year Founded

$914.4MTotal Funding

SERIES_DCompany Stage

AI & Machine LearningIndustries

501-1,000Employees

Risks

Competitors like Google and Microsoft may overshadow Cohere with seamless enterprise system integration.

Reliance on Nvidia chips poses risks if supply chain issues arise or strategic focus shifts.

High cost of AI data center could strain financial resources if government funding is delayed.

Differentiation

Cohere's North platform outperforms Microsoft Copilot and Google Vertex AI in enterprise functions.

Rerank 3.5 model processes queries in over 100 languages, enhancing multilingual search capabilities.

Command R7B model excels in RAG, math, and coding, outperforming competitors like Google's Gemma.

Upsides

Cohere's AI data center project positions it as a key player in Canadian AI.

North platform offers secure AI deployment for regulated industries, enhancing privacy-focused enterprise solutions.

Cohere's multilingual support breaks language barriers, expanding its global market reach.

Land your dream remote job 3x faster with AI

Try Jobo Free