Machine Learning Engineer (5+ years of experience)

New York, New York, United States

Apply with AI Apply

$170,000 – $230,000Compensation

Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level

Full TimeJob Type

UnknownVisa

Video AI, Artificial Intelligence, BiotechnologyIndustries

Machine Learning Engineer - Captions

Salary: $170K - $230K Employment Type: Full-Time Location: New York City, NY (Union Square HQ - In-person required)

Position Overview

Captions is the leading video AI company, building the future of video creation. We are seeking a talented Machine Learning Engineer to partner closely with our Researchers and bring large-scale multimodal video diffusion models into production. This role involves optimizing and deploying cutting-edge generative models (tens to hundreds of billions of parameters) to achieve low-latency, high-throughput inference at scale. You will work on advanced AI technologies including audio-video generation, diffusion architectures, and temporal modeling, impacting millions of creators globally.

About Captions

Captions is a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. We are on a mission to serve the next billion video creators. As an early team member, you will have an outsized impact on our product and company culture.

We are backed by top-tier investors and entrepreneurs, including: Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.

Recent Recognition:

The Information: 50 Most Promising Startups
Fast Company: Next Big Things in Tech
The New York Times: When A.I. Bridged a Language Gap, They Fell in Love
Business Insider: 34 most promising AI startups
Time: The Best Inventions of 2024

Responsibilities

Inference & Deployment

Develop high-performance GPU-based inference pipelines for large multimodal diffusion models.
Build, optimize, and maintain serving infrastructure for low-latency predictions at large scale.
Collaborate with DevOps teams on containerizing models, managing autoscaling, and ensuring uptime SLAs.

Model Optimization & Fine-Tuning

Utilize techniques like quantization, pruning, and distillation to reduce latency and memory footprint while maintaining quality.
Implement continuous fine-tuning workflows to adapt models based on real-world data and feedback.

Production MLOps

Design and maintain automated CI/CD pipelines for model deployment, versioning, and rollback.
Implement robust monitoring (latency, throughput, concept drift) and alerting for critical production systems.

Performance & Scaling

Explore and implement cutting-edge GPU acceleration frameworks (e.g., TensorRT, Triton, TorchServe) to improve throughput and reduce costs.

Requirements

Technical Expertise

Proven experience deploying deep learning models on GPU-based infrastructure (NVIDIA GPUs, CUDA, TensorRT, etc.).
Strong knowledge of containerization (Docker, Kubernetes) and microservice architectures for ML model serving.
Proficiency in Python and at least one deep learning framework (PyTorch, TensorFlow).

Model Optimization

Familiarity with compression techniques (quantization, pruning, distillation) for large-scale models.
Experience profiling and optimizing model inference (batching, concurrency, hardware utilization).

Infrastructure

Hands-on experience with ML pipeline orchestration (Airflow, Kubeflow, Argo) and automated CI/CD for ML.
Strong grasp of logging, monitoring, and alerting tools (Prometheus, Grafana, etc.) in distributed systems.

Domain Experience

Exposure to diffusion models, multimodal video generation, or large-scale generative architectures.
Experience with distributed training frameworks (FSDP, DeepSpeed, Megatron-LM).

Application Instructions

Please note that all roles require in-person presence at our NYC HQ. We do not work with third-party recruiting agencies; please do not contact us.

Skills

Machine Learning

Multimodal Video Diffusion Models

Generative Models

Model Optimization

Model Deployment

Low-latency Inference

High-throughput Inference

Audio-video Generation

Diffusion Architectures

Temporal Modeling

Captions

Video captioning and translation services

About Captions

Captions.ai enhances video content by providing captioning and translation services tailored for content creators, social media influencers, marketing agencies, and businesses. Their main offerings include automatic subtitle generation, translation into 28 languages, and video compression to improve performance. These tools simplify the video production process, allowing users to produce professional-quality videos with ease. Unlike many competitors, Captions.ai uses a freemium model, offering basic services for free while charging for advanced features, which helps attract a large user base and convert free users into paying customers. The company's goal is to make high-quality video content accessible to a wider audience, and recent funding will support their growth and product development.

New York City, New YorkHeadquarters

2021Year Founded

$82.7MTotal Funding

SERIES_CCompany Stage

Consumer Software, EntertainmentIndustries

51-200Employees

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Retirement Plan

401(k) Company Match

Commuter Benefits

Wellness Program

Unlimited Paid Time Off

Flexible Work Hours

Risks

Increased competition from startups like Beeble AI could challenge Captions' market position.

Integration challenges from AlpacaML acquisition may delay product enhancements.

Rapid expansion may stretch resources, potentially affecting service quality.

Differentiation

Captions offers AI-powered video editing with automatic subtitle generation and language dubbing.

The platform supports video compression for optimized performance and accessibility.

Captions uses a freemium model to attract a wide user base and convert to paid plans.

Upsides

Captions secured $60 million in Series C funding, indicating strong investor confidence.

The acquisition of AlpacaML enhances Captions' creative tools with AI rendering capabilities.

Expansion to web and desktop platforms increases accessibility and user engagement.

Land your dream remote job 3x faster with AI

Try Jobo Free

Machine Learning Engineer (5+ years of experience)

Machine Learning Engineer - Captions

Position Overview

About Captions

Responsibilities

Inference & Deployment

Model Optimization & Fine-Tuning

Production MLOps

Performance & Scaling

Requirements

Technical Expertise

Model Optimization

Infrastructure

Domain Experience

Application Instructions

Skills

Captions

About Captions

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Retirement Plan

401(k) Company Match

Commuter Benefits

Wellness Program

Unlimited Paid Time Off

Flexible Work Hours

Risks

Increased competition from startups like Beeble AI could challenge Captions' market position.

Integration challenges from AlpacaML acquisition may delay product enhancements.

Rapid expansion may stretch resources, potentially affecting service quality.

Differentiation

Captions offers AI-powered video editing with automatic subtitle generation and language dubbing.

The platform supports video compression for optimized performance and accessibility.

Captions uses a freemium model to attract a wide user base and convert to paid plans.

Upsides

Captions secured $60 million in Series C funding, indicating strong investor confidence.

The acquisition of AlpacaML enhances Captions' creative tools with AI rendering capabilities.

Expansion to web and desktop platforms increases accessibility and user engagement.

Related Jobs

Senior Research Engineer - Enterprise Products

Software Engineer - Model Performance

GPU Solutions Engineer

Staff Machine Learning Engineer (Referrals)

AI Engineer & Researcher, Inference

Machine Learning Engineer

Senior Machine Learning Engineer-Perception

Natural Language Processing Researcher

Product Deployment Specialist

Sr. Staff Software Engineer – High Performance GPU Inference Systems

MLOps Engineer

AI/ML Engineer

ML Engineer

Land your dream remote job 3x faster with AI