AI Researcher (Voice)
TavusFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $215K - $340K.
All roles require in-person work at the NYC HQ located in Union Square.
The role requires expertise in designing novel architectures for large-scale video and multimodal diffusion models, developing training techniques, scaling models to billions of parameters, and experience in multimodal fusion, temporal modeling, video control, and generative modeling.
The company has a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC, where early members have an outsized impact on products and company culture.
Strong candidates are exceptional Research Engineers with experience advancing large-scale multimodal video diffusion models, conducting novel research in generative architectures, and driving rapid experimentation with product impact.
Video captioning and translation services
Captions.ai enhances video content by providing captioning and translation services tailored for content creators, social media influencers, marketing agencies, and businesses. Their main offerings include automatic subtitle generation, translation into 28 languages, and video compression to improve performance. These tools simplify the video production process, allowing users to produce professional-quality videos with ease. Unlike many competitors, Captions.ai uses a freemium model, offering basic services for free while charging for advanced features, which helps attract a large user base and convert free users into paying customers. The company's goal is to make high-quality video content accessible to a wider audience, and recent funding will support their growth and product development.