[Remote] Infrastructure Engineer at Roboflow

New York, New York, United States

Roboflow Logo
$180,000 – $200,000Compensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, Computer Vision, AI & Machine Learning, Software DevelopmentIndustries

Requirements

Candidates should have over 5 years of hands-on infrastructure or DevOps engineering experience, preferably in fast-paced startup environments. Strong experience with AWS or GCP, Kubernetes in production, Docker, and Helm is required. Proficiency with Terraform, scripting (e.g., Bash), and Python for automation is necessary. Comfort in reading and contributing to application code (Node.js, Python) and familiarity with security best practices and compliance standards (SOC 2, HIPAA, etc.) in cloud-native environments are essential. The ideal candidate thrives in high-ownership environments where priorities shift quickly, balancing speed with long-term reliability, and has experience working cross-functionally with developers, product teams, and customers. Experience in early- to mid-stage startups, especially those with AI/ML infrastructure or SaaS platforms, is preferred.

Responsibilities

The Infrastructure Engineer will design, secure, and maintain cloud infrastructure powering production SaaS and ML workloads across AWS and/or GCP. They will build and operate scalable, containerized applications using Kubernetes, Helm, and Docker. Responsibilities include developing and managing infrastructure-as-code solutions using Terraform, Bash, and Python. The role involves working directly with customers and internal teams to meet security, compliance, and reliability requirements (SOC 2, HIPAA, GDPR). Additionally, they will improve observability, reliability, and on-call processes, including SLO/SLAs and incident response. Automating CI/CD workflows with tools like GitHub Actions and Spacelift and contributing code (Python, Node.js) to product features and platform infrastructure are key duties. Identifying and acting on cost-optimization opportunities across the tech stack is also expected.

Skills

Cloud Architecture
Databases
File Storage
Search Clusters
Microservices
Machine Learning Pipelines
Security
Reliability
Rapid Delivery

Roboflow

Platform for creating and deploying AI models

About Roboflow

Roboflow offers a platform for engineers to create, train, and deploy machine learning models using their own images and videos. The platform features an auto-annotate API for efficient data labeling, along with tools for preprocessing and augmenting image data. Roboflow distinguishes itself from competitors by providing project management tools that enhance team collaboration on AI projects. The company's goal is to simplify the AI development process for a diverse range of clients, from individual engineers to large organizations.

Des Moines, IowaHeadquarters
2020Year Founded
$60.5MTotal Funding
SERIES_BCompany Stage
AI & Machine LearningIndustries
51-200Employees

Benefits

Unlimited vacation
Stock options
Generous medical, dental, & vision coverage
Flexible schedule
Parental leave
Travel stipend
Productivity stipend
Professional development
401k

Risks

Increased competition from Intel's Gaudi 3 AI accelerator challenges Roboflow's market position.
Potential conflicts with Lumenalta partnership may affect focus and resource allocation.
Reliance on major investors poses financial stability risks if priorities shift.

Differentiation

Roboflow offers a comprehensive AI development platform for creating, training, and deploying models.
The platform supports various image and video formats for training AI models.
Roboflow's auto-annotate API saves time by automatically labeling large data batches.

Upsides

Roboflow secured $40M to enhance computer vision accessibility and open-source tools.
Partnership with Lumenalta enhances AI data management and model training capabilities.
Few-shot AI prompting accelerates image labeling, attracting users with large datasets.

Land your dream remote job 3x faster with AI