[Remote] Software Engineer, Internal Infrastructure (North America) at Cohere

Toronto, Ontario, Canada

Cohere Logo
Not SpecifiedCompensation
Junior (1 to 2 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, TechnologyIndustries

Requirements

  • Deep experience running Kubernetes clusters at scale and/or scaling and troubleshooting Cloud Native infrastructure, including Infrastructure as Code
  • Strong programming skills in Go or Python
  • Prefer contributing to Open Source solutions rather than building solutions from the ground up
  • Self-directed and adaptable, excelling at identifying and solving key problems
  • Draw motivation from building systems that help others be more productive
  • See mentorship, knowledge transfer, and review as essential prerequisites for a healthy team
  • Excellent communication skills and thrive in fast-paced environments
  • Willingness to participate in a 24x7 on-call rotation

Responsibilities

  • Build and operate Kubernetes compute superclusters across multiple clouds
  • Partner with cloud providers to optimize infrastructure costs, performance, and reliability for AI workloads
  • Work closely with research teams to understand their infrastructure needs and identify ways to improve stability, performance, and efficiency of novel model training techniques
  • Design and build resilient, scalable systems for training AI models, focusing on creating intuitive user interfaces that empower researchers to self-serve to troubleshoot and resolve problems
  • Encourage software best practices across the company and participate in team processes such as knowledge sharing, reviews, and on-call

Skills

Key technologies and capabilities for this role

KubernetesGPUMulti-cloudSuperclustersObservabilityScalabilityStability

Questions & Answers

Common questions about this position

Is this position remote?

Yes, the position is remote.

What skills are required for this role?

Candidates need deep experience running Kubernetes clusters at scale and/or scaling and troubleshooting Cloud Native infrastructure including Infrastructure as Code, strong programming skills in Go or Python, and a preference for contributing to Open Source solutions.

What is the compensation or salary for this role?

This information is not specified in the job description.

What is the company culture like at Cohere?

Cohere obsesses over what they build, works hard and moves fast to serve customers, values a team of the best in the world who are passionate about their craft, and believes diverse perspectives are required for great products.

What makes a strong candidate for this position?

Strong candidates are self-directed and adaptable, excel at identifying and solving key problems, have experience with Kubernetes at scale, programming in Go or Python, and prefer Open Source contributions.

Cohere

Provides NLP tools and LLMs via API

About Cohere

Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.

Toronto, CanadaHeadquarters
2019Year Founded
$914.4MTotal Funding
SERIES_DCompany Stage
AI & Machine LearningIndustries
501-1,000Employees

Risks

Competitors like Google and Microsoft may overshadow Cohere with seamless enterprise system integration.
Reliance on Nvidia chips poses risks if supply chain issues arise or strategic focus shifts.
High cost of AI data center could strain financial resources if government funding is delayed.

Differentiation

Cohere's North platform outperforms Microsoft Copilot and Google Vertex AI in enterprise functions.
Rerank 3.5 model processes queries in over 100 languages, enhancing multilingual search capabilities.
Command R7B model excels in RAG, math, and coding, outperforming competitors like Google's Gemma.

Upsides

Cohere's AI data center project positions it as a key player in Canadian AI.
North platform offers secure AI deployment for regulated industries, enhancing privacy-focused enterprise solutions.
Cohere's multilingual support breaks language barriers, expanding its global market reach.

Land your dream remote job 3x faster with AI