Machine Learning Researcher
ElevenLabs- Full Time
- Mid-level (3 to 4 years), Senior (5 to 8 years)
Candidates must have deep familiarity with machine learning research processes and best practices, particularly in speech/audio or related fields. A proven track record of successfully delivering complex technical programs with multiple concurrent workstreams is essential. Experience in building and managing research infrastructure, including compute resources, data pipelines, and evaluation frameworks is required. A strong technical background that enables effective communication with research scientists and an understanding of ML system requirements is necessary. The ability to create and implement processes that accelerate research while maintaining scientific rigor is important, along with a history of fostering collaboration across teams and establishing effective knowledge sharing practices. Familiarity with agile methodologies adapted for research environments and strong analytical skills for evaluating project health, resource allocation, and strategic alignment are also required. Excellent communication abilities for articulating technical concepts to diverse stakeholders are crucial.
The Research Program Manager will lead the strategic execution of the Voice AI Foundations research program, architecting and managing a portfolio of research initiatives across neural audio codecs, generative models, and multimodal speech systems. They will develop strategic roadmaps that align research objectives with company goals, orchestrate collaboration between research teams, and build and maintain research infrastructure that enables rapid experimentation. Creating processes that balance creative freedom with the need for quick validation and establishing clear success metrics will be key responsibilities. The manager will also manage relationships with external partners and ensure that research outputs are properly documented, replicated, and positioned for eventual productization.
Speech recognition APIs for audio transcription
Deepgram specializes in artificial intelligence for speech recognition, offering a set of APIs that developers can use to transcribe and understand audio content. Their technology allows clients, ranging from startups to large organizations like NASA, to process millions of audio minutes daily. Deepgram's APIs are designed to be fast, accurate, scalable, and cost-effective, making them suitable for businesses needing to handle large volumes of audio data. The company operates on a pay-per-use model, where clients are charged based on the amount of audio they transcribe, allowing Deepgram to grow its revenue alongside client usage. With a focus on the high-growth market of speech recognition, Deepgram is positioned for future success.