AI Engineer & Researcher, Inference
SpeechifyFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
Yes, the position is remote with a remote-friendly environment. The Model Efficiency team is concentrated in the EST and PST time zones, which are preferred locations.
Required skills include significant experience developing high-performance audio or machine learning inference systems, proficiency with C++ and Python, hands-on experience with deep learning models for audio, speech, or language applications, and a bias for action with a results-oriented mindset.
This information is not specified in the job description.
Cohere obsesses over what they build, works hard and moves fast to serve customers, and consists of top experts passionate about their craft who value diverse perspectives.
Strong candidates have significant experience in high-performance audio or ML inference systems, proficiency in C++ and Python, experience with deep learning for audio/speech, and a results-oriented mindset. Big pluses include GPU programming, real-time streaming architectures, ML framework internals, inference frameworks, and sequence modeling for audio.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.