Infrastructure Software Engineer
Baseten- Full Time
- Junior (1 to 2 years)
Candidates should have over 5 years of engineering experience managing production infrastructure at scale. Applicants must possess experience in designing highly available distributed systems using Kubernetes and GPU workloads, along with familiarity in Kubernetes development and production support. Proficiency in cloud services such as GCP, Azure, AWS, and hybrid environments is essential. Candidates should also have experience in complex Linux-based computing environments, resource management, and troubleshooting. Strong collaboration skills and the ability to adapt to solve evolving technical challenges are required, along with a solid understanding of distributed systems and programming experience in languages like Golang or C++.
The Member of Technical Staff will develop, deploy, and operate the AI platform that delivers Cohere's large language models via API endpoints. The role involves working closely with various teams to deploy optimized NLP models in production environments characterized by low latency, high throughput, and high availability. Additionally, the position includes interfacing with customers to create customized deployments to meet their specific needs.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.