Research Engineer, Pre-training
AnthropicFull Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
Candidates should possess strong software engineering skills, with proficiency in Python and experience building data pipelines, along with familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools. Experience working with large-scale datasets, including web data, code data, and multilingual corpora is required, as well as knowledge of data quality assessment techniques and experimentation with data mixtures.
As a Pre-Training Data Engineer, you will design and build scalable data pipelines to ingest, clean, filter, and optimize diverse datasets, conduct data ablations to assess data quality and experiment with data mixtures to enhance model performance, develop robust data modeling techniques to ensure datasets are structured and formatted for optimal training efficiency, research and implement innovative data curation methods, and collaborate with cross-functional teams to meet the demands of cutting-edge language models.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.