Captions

Machine Learning Engineer (5+ years of experience)

New York, New York, United States

$170,000 – $230,000Compensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Video AI, Artificial Intelligence, BiotechnologyIndustries

Requirements

Candidates should possess proven experience deploying deep learning models on GPU-based infrastructure, including NVIDIA GPUs, CUDA, and TensorRT. Strong knowledge of containerization (Docker, Kubernetes) and microservice architectures for ML model serving is required, along with proficiency in Python and at least one deep learning framework such as PyTorch or TensorFlow. Familiarity with compression techniques like quantization, pruning, and distillation, and experience profiling and optimizing model inference are also necessary.

Responsibilities

The Machine Learning Engineer will develop high-performance GPU-based inference pipelines for large multimodal diffusion models, build and maintain serving infrastructure for low-latency predictions at scale, collaborate with DevOps teams on containerization and autoscaling, leverage techniques like quantization and pruning to optimize model performance, design and maintain automated CI/CD pipelines for model deployment, explore cutting-edge GPU acceleration frameworks, and implement robust monitoring and alerting systems.

Skills

Machine Learning
Multimodal Video Diffusion Models
Generative Models
Model Optimization
Model Deployment
Low-latency Inference
High-throughput Inference
Audio-video Generation
Diffusion Architectures
Temporal Modeling

Captions

Video captioning and translation services

About Captions

Captions.ai enhances video content by providing captioning and translation services tailored for content creators, social media influencers, marketing agencies, and businesses. Their main offerings include automatic subtitle generation, translation into 28 languages, and video compression to improve performance. These tools simplify the video production process, allowing users to produce professional-quality videos with ease. Unlike many competitors, Captions.ai uses a freemium model, offering basic services for free while charging for advanced features, which helps attract a large user base and convert free users into paying customers. The company's goal is to make high-quality video content accessible to a wider audience, and recent funding will support their growth and product development.

New York City, New YorkHeadquarters
2021Year Founded
$82.7MTotal Funding
SERIES_CCompany Stage
Consumer Software, EntertainmentIndustries
51-200Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
401(k) Retirement Plan
401(k) Company Match
Commuter Benefits
Wellness Program
Unlimited Paid Time Off
Flexible Work Hours

Risks

Increased competition from startups like Beeble AI could challenge Captions' market position.
Integration challenges from AlpacaML acquisition may delay product enhancements.
Rapid expansion may stretch resources, potentially affecting service quality.

Differentiation

Captions offers AI-powered video editing with automatic subtitle generation and language dubbing.
The platform supports video compression for optimized performance and accessibility.
Captions uses a freemium model to attract a wide user base and convert to paid plans.

Upsides

Captions secured $60 million in Series C funding, indicating strong investor confidence.
The acquisition of AlpacaML enhances Captions' creative tools with AI rendering capabilities.
Expansion to web and desktop platforms increases accessibility and user engagement.

Land your dream remote job 3x faster with AI