Software Engineer, Data at Twelve Labs

Seoul, South Korea

Twelve Labs Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI, Machine Learning, TechnologyIndustries

Requirements

  • 5+ years of experience in a fast-changing startup or similar environment with high adaptability, resilience, and proactive problem-solving for frontend engineering tasks
  • Solid hands-on experience as a backend and/or data engineer, with interest in ML/AI systems and experience in data collection and processing for large-scale generative or contrastive learning models
  • Proficiency in at least one programming language such as Python or Golang
  • Preferred Qualifications
  • Experience leading a team as a Technical Lead and driving projects
  • Experience building model-based language or vision-language datasets such as RLAIF
  • Development experience using FFmpeg or other high-performance image/video processing libraries
  • Master's or PhD degree in machine learning or related fields

Responsibilities

  • Build a data engine for collecting, preprocessing, cleaning, filtering, and labeling large-scale multimodal (video, image, audio) datasets for LLM/VLM training
  • Develop data management and visualization systems capable of handling petabyte (PB)-scale video, image, and audio data
  • Design and build libraries and services that deliver real impact, beyond superficial features
  • Collaborate closely with various teams to define project priorities and goals, and lead technical initiatives from planning through development and operations

Skills

Distributed Systems
Data Pipelines
Multimodal Data Processing
Data Labeling
Video Processing
Image Processing
Audio Processing
LLM Training Data
VLM Training Data
PB-Scale Data Management
Data Visualization

Twelve Labs

AI system for video content understanding

About Twelve Labs

Twelve Labs focuses on artificial intelligence and video understanding by developing a system that analyzes videos to extract key features like actions, objects, and speech. This information is transformed into vector representations, enabling fast semantic search within large video datasets. The company differentiates itself by providing a platform that is faster and more effective than many existing models, allowing developers and product managers to easily integrate its technology through an API. Twelve Labs aims to make all videos searchable, enhancing the way businesses utilize video content.

San Francisco, CaliforniaHeadquarters
2021Year Founded
$104.2MTotal Funding
EARLY_VCCompany Stage
Enterprise Software, AI & Machine Learning, EducationIndustries
51-200Employees

Risks

Increased competition from emerging AI startups in video understanding.
Rapid AI advancements require continuous innovation, straining resources.
Potential over-reliance on key investors like SK Telecom and Databricks.

Differentiation

Twelve Labs offers a comprehensive AI system for multimodal video understanding.
Their technology transforms video content into vector representations for fast semantic search.
The platform's API allows easy integration into clients' systems with minimal effort.

Upsides

Growing demand for AI-driven video analytics in education, healthcare, and security sectors.
Recent $30 million funding enhances technology development and strategic partnerships.
Collaboration with cloud providers boosts scalability and efficiency of video processing.

Land your dream remote job 3x faster with AI