Software Engineer, Data at Twelve Labs

Seoul, South Korea

Twelve Labs Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI, Machine Learning, TechnologyIndustries

Requirements

  • 5+ years of experience in a fast-changing startup or similar environment with high adaptability, resilience, and proactive problem-solving for frontend engineering tasks
  • Solid hands-on experience as a backend and/or data engineer, with interest in ML/AI systems and experience in data collection and processing for large-scale generative or contrastive learning models
  • Proficiency in at least one programming language such as Python or Golang
  • Preferred Qualifications
  • Experience leading a team as a Technical Lead and driving projects
  • Experience building model-based language or vision-language datasets such as RLAIF
  • Development experience using FFmpeg or other high-performance image/video processing libraries
  • Master's or PhD degree in machine learning or related fields

Responsibilities

  • Build a data engine for collecting, preprocessing, cleaning, filtering, and labeling large-scale multimodal (video, image, audio) datasets for LLM/VLM training
  • Develop data management and visualization systems capable of handling petabyte (PB)-scale video, image, and audio data
  • Design and build libraries and services that deliver real impact, beyond superficial features
  • Collaborate closely with various teams to define project priorities and goals, and lead technical initiatives from planning through development and operations

Skills

Key technologies and capabilities for this role

Distributed SystemsData PipelinesMultimodal Data ProcessingData LabelingVideo ProcessingImage ProcessingAudio ProcessingLLM Training DataVLM Training DataPB-Scale Data ManagementData Visualization

Questions & Answers

Common questions about this position

What is the work arrangement for this position?

The position is hybrid, combining autonomy and collaboration in a hybrid work setup.

What are the required skills for the Software Engineer, Data role?

Candidates need 5+ years in a fast-changing startup environment, solid backend or data engineering experience with interest in ML/AI systems, experience with data collection and processing for large-scale models, and proficiency in Python, Golang or similar programming languages. Strong communication skills to explain technical concepts to non-experts and collaborate across teams are also essential.

What does Twelve Labs' company culture emphasize?

The company values honesty and reflection towards self and team, perseverance and humility without fearing failure or feedback, continuous learning to elevate team capabilities, and enjoying the process of solving challenging problems together.

What benefits are offered to employees?

Benefits include MacBook and 700,000 KRW worth of remote work equipment support with replacement every 3 years, monthly 600,000 KRW corporate card for meals and transportation, office snack bar, 2-week year-end winter break, annual health checkup, and English education program support.

What makes a strong candidate for this role?

A strong candidate has startup experience with high adaptability, backend/data engineering expertise in large-scale multimodal data processing, proficiency in Python or Golang, and excellent cross-team collaboration skills. Preferred qualifications include technical leadership, experience with RLAIF datasets, FFmpeg usage, or advanced degrees in ML-related fields.

Twelve Labs

AI system for video content understanding

About Twelve Labs

Twelve Labs focuses on artificial intelligence and video understanding by developing a system that analyzes videos to extract key features like actions, objects, and speech. This information is transformed into vector representations, enabling fast semantic search within large video datasets. The company differentiates itself by providing a platform that is faster and more effective than many existing models, allowing developers and product managers to easily integrate its technology through an API. Twelve Labs aims to make all videos searchable, enhancing the way businesses utilize video content.

San Francisco, CaliforniaHeadquarters
2021Year Founded
$104.2MTotal Funding
EARLY_VCCompany Stage
Enterprise Software, AI & Machine Learning, EducationIndustries
51-200Employees

Risks

Increased competition from emerging AI startups in video understanding.
Rapid AI advancements require continuous innovation, straining resources.
Potential over-reliance on key investors like SK Telecom and Databricks.

Differentiation

Twelve Labs offers a comprehensive AI system for multimodal video understanding.
Their technology transforms video content into vector representations for fast semantic search.
The platform's API allows easy integration into clients' systems with minimal effort.

Upsides

Growing demand for AI-driven video analytics in education, healthcare, and security sectors.
Recent $30 million funding enhances technology development and strategic partnerships.
Collaboration with cloud providers boosts scalability and efficiency of video processing.

Land your dream remote job 3x faster with AI