Research Scholar
CohereFull Time
Internship
Candidates should possess an advanced degree (MS or PhD) in Computer Science, Machine Learning, or a related field. Strong software engineering skills with a proven track record of building complex systems are essential, along with expertise in Python and deep learning frameworks like PyTorch. Familiarity with large-scale machine learning, particularly language models, and the ability to balance research with engineering constraints are required. Excellent problem-solving, communication, and collaboration skills are also necessary, as is a care for the societal impacts of AI work. Preferred experience includes working on high-performance, large-scale ML systems, familiarity with GPUs, Kubernetes, OS internals, transformer architectures, reinforcement learning, and large-scale ETL processes.
The Research Engineer will conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimizer development. They will lead small research projects independently and collaborate on larger initiatives, designing, running, and analyzing scientific experiments to advance understanding of large language models. Responsibilities also include optimizing and scaling training infrastructure for efficiency and reliability, developing and improving dev tooling for team productivity, and contributing to the entire technology stack from low-level optimizations to high-level model design.
Develops reliable and interpretable AI systems
Anthropic focuses on creating reliable and interpretable AI systems. Its main product, Claude, serves as an AI assistant that can manage tasks for clients across various industries. Claude utilizes advanced techniques in natural language processing, reinforcement learning, and code generation to perform its functions effectively. What sets Anthropic apart from its competitors is its emphasis on making AI systems that are not only powerful but also understandable and controllable by users. The company's goal is to enhance operational efficiency and improve decision-making for its clients through the deployment and licensing of its AI technologies.