Cohere

Member of Technical Staff, Data Analysis and Evaluation

London, England, United Kingdom

Not SpecifiedCompensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, AI & Machine Learning, Data ScienceIndustries

Position Overview

  • Location Type: Remote
  • Job Type: Full-Time
  • Salary: Not specified

Cohere's mission is to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises. We are building AI systems for content generation, semantic search, RAG, and agents, aiming for widespread AI adoption. We are a team of passionate researchers, engineers, designers, and more, each being one of the best in the world at their craft. We value diversity and believe it's a requirement for building great products.

Why this role?

As a Member of Technical Staff in Data Analysis and Evaluation, you will be crucial in ensuring the quality, reliability, and performance of our large language models (LLMs). Your responsibilities will include designing and conducting data collection, assessing dataset quality, and analyzing model robustness and generalisability. You will collaborate with researchers, engineers, and data annotators to drive data-driven decisions and enhance our AI systems. This role requires expertise in statistics, experimental design (including human annotators), and machine learning to ensure high-quality data and reliable model performance.

Responsibilities

  • Design and oversee data collection tasks, including supporting human annotators and ensuring data quality.
  • Develop and apply statistical methods to evaluate the quality and reliability of datasets.
  • Analyze and assess the generalisability and robustness of ML systems across diverse use cases.
  • Collaborate with teams to improve dataset quality and model performance.
  • Train and fine-tune large language models (LLMs) on distributed training infrastructures.
  • Conduct experiments to evaluate model performance and identify areas for improvement.

Requirements

  • Extremely strong software engineering skills.
  • Strong expertise in designing and conducting data collection tasks, including working with human annotators.
  • Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance.
  • Experience analyzing datasets with respect to their quality, biases, and suitability for training ML models.
  • Hands-on experience training large language models (LLMs) on distributed training infrastructures.
  • Familiarity with evaluating and improving the generalisability and robustness of ML systems.
  • Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX).
  • Excellent communication skills to collaborate effectively with cross-functional teams and present findings.
  • One or more papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).

Note: We encourage applications even if not all criteria are met. Diverse backgrounds and perspectives are highly valued.

Company Information

  • Company: Cohere
  • Mission: To scale intelligence to serve humanity.
  • Focus: Training and deploying frontier models for AI systems.
  • Values: Obsession with product, hard work, fast pace, customer focus, passion for craft, diversity of perspectives.
  • Office Locations: London, Toronto, San Francisco, and New York.
  • Work Environment: Remote-friendly.

Application Instructions

Please apply if you are passionate about working hard on a glorious mission with like-minded teammates.

Skills

Data Analysis
Evaluation
Statistics
Experimental Design
Machine Learning
Data Collection
Model Robustness
Model Generalisability
Data Quality Assessment

Cohere

Provides NLP tools and LLMs via API

About Cohere

Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.

Toronto, CanadaHeadquarters
2019Year Founded
$914.4MTotal Funding
SERIES_DCompany Stage
AI & Machine LearningIndustries
501-1,000Employees

Risks

Competitors like Google and Microsoft may overshadow Cohere with seamless enterprise system integration.
Reliance on Nvidia chips poses risks if supply chain issues arise or strategic focus shifts.
High cost of AI data center could strain financial resources if government funding is delayed.

Differentiation

Cohere's North platform outperforms Microsoft Copilot and Google Vertex AI in enterprise functions.
Rerank 3.5 model processes queries in over 100 languages, enhancing multilingual search capabilities.
Command R7B model excels in RAG, math, and coding, outperforming competitors like Google's Gemma.

Upsides

Cohere's AI data center project positions it as a key player in Canadian AI.
North platform offers secure AI deployment for regulated industries, enhancing privacy-focused enterprise solutions.
Cohere's multilingual support breaks language barriers, expanding its global market reach.

Land your dream remote job 3x faster with AI