Research Scientist - Voice AI Foundations
Deepgram- Full Time
- Mid-level (3 to 4 years), Senior (5 to 8 years)
Candidates must hold a Ph.D. or equivalent experience in Computer Science, Electrical Engineering, or related fields with a focus on generative modeling, particularly in computer vision. Experience in image/video generation, audio sequence-to-sequence modeling, model distillation, and optimization is preferred, though not mandatory. A proven track record of publications at top-tier conferences or in reputable journals is required, along with proficiency in working with video, audio, text, and images for real-time or offline tasks. Strong programming skills in Python and expertise with PyTorch or TensorFlow are essential.
The Research Scientist will conduct research to develop innovative models and algorithms for image and video generation. They will publish research in open-access formats and contribute to the AI community. Collaboration with product and design teams to implement prototypes into production is expected, along with optimizing models and improving their efficiency for real-world applications.
Social gaming platform with AI interactions
Cantina.com offers a platform that merges social gaming with artificial intelligence, allowing users to create and interact with AI-driven bots in a virtual space called "The Cantina." Users can customize their experiences by adding bots with unique personalities, engaging in conversations, and creating AI art, making each interaction personalized. The company stands out by focusing on user-driven content and a dynamic experience, appealing to tech-savvy individuals and social gamers. Cantina operates on a subscription model, aiming to provide an engaging environment that leverages the growing interest in AI and social gaming.