Infrastructure Software Engineer
Baseten- Full Time
- Junior (1 to 2 years)
Candidates should possess extremely strong software engineering skills and proficiency in Python along with related ML frameworks such as JAX, Pytorch, and XLA/MLIR. Experience with distributed training infrastructures like Kubernetes and Slurm, as well as large-scale distributed training strategies, is essential. Hands-on experience in training large models at scale and contributing to the tooling or setup of the training infrastructure is also required. A bonus would be having published papers at top-tier venues such as NeurIPS, ICML, and others.
As a Member of Technical Staff, you will design and write high-performance and scalable software for training. You will improve the training setup from both an infrastructure and codebase performance perspective. Additionally, you will craft and implement tools to accelerate training cycles and enhance the overall efficacy of the training infrastructure. You will also research, implement, and experiment with ideas on the supercompute and data infrastructure while learning from and collaborating with leading researchers in the field.
Provides NLP tools and LLMs via API
Cohere provides advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a user-friendly API. Their services cater to a wide range of clients, including businesses that want to improve their content generation, summarization, and search functions. Cohere's business model focuses on offering scalable and affordable generative AI tools, generating revenue by granting API access to pre-trained models that can handle tasks like text classification, sentiment analysis, and semantic search in multiple languages. The platform is customizable, enabling businesses to create smarter and faster solutions. With multilingual support, Cohere effectively addresses language barriers, making it suitable for international use.