Research Scientist - Voice AI Foundations
DeepgramFull Time
Mid-level (3 to 4 years), Senior (5 to 8 years)
Redwood City, California, United States
Key technologies and capabilities for this role
Common questions about this position
The salary range is $180K - $230K.
The position is hybrid. Engineering teams work from the Redwood City office 3 days per week, with all other Bay Area employees joining on Fridays.
A Ph.D. or Master’s in CS, EE, or related field is required. The role demands expertise in developing and optimizing vision-language models, pre-training and fine-tuning on image-text data, model compression techniques like distillation and quantization, and cross-team collaboration.
The company values in-person collaboration, creativity, and team alignment, with engineering teams working from the Redwood City office 3 days per week and Bay Area employees joining on Fridays.
Strong candidates will have a Ph.D. or Master’s in CS, EE, or related fields, hands-on experience with full-cycle model development including pre-training, fine-tuning, and optimization of vision-language models, and the ability to collaborate cross-functionally while innovating in multimodal AI.
AI software for proactive physical security
Ambient.ai enhances physical security systems with software that uses artificial intelligence and computer vision. The technology helps security teams shift from reactive to proactive operations by detecting unusual changes in human behavior and locations, without relying on facial recognition. This approach respects privacy while providing AI-verified alerts that reduce false alarms and improve efficiency. The company aims to serve organizations needing security solutions while continuously adapting to the evolving risk landscape.