Machine Learning / AI Operations Engineer at Arcade

San Francisco, California, United States

Arcade Logo
$120,000 – $180,000Compensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, SoftwareIndustries

Requirements

  • Strong understanding of the state of the art in machine learning, especially LLMs and tool-calling (e.g. MCP)
  • Comfortable with tuning libraries (HuggingFace Trainer, DeepSpeed, FSDP, QLoRA, etc)
  • Familiarity with model lifecycle management tools (MLflow, Weights & Biases, DVC, etc)
  • Experience with model optimization, quantization and deployment formats (ONNX, OpenVino, TensortRT, etc)
  • Experience with modern monitoring tools (Prometheus, Grafana, Datadog, ELK, Arize AI, etc.)
  • Production experience with at least one major agent framework (Langchain, LlamaIndex, OpenAI Agents SDK, Mastra, etc)
  • 5+ years of software engineering experience comprising of:
  • 3+ years experience working on a production level ML training or inference system
  • 2+ years experience building and deploying ML models

Responsibilities

  • Build: Create bleeding-edge models fine-tuned for Arcade's agentic products
  • Deploy: Test and deploy our models and related application software, both on-prem and in our cloud
  • Monitor: Prevent model drift. Make the models and APIs better. Collect the data you need to do it
  • Build our stack: Use your experience to chose the right tools for the job, balancing speed, maintainability and cost
  • Shape the roadmap for the team
  • Share your work with our customers and community, building our (and your) brand

Skills

Key technologies and capabilities for this role

Machine LearningAIAgentic ToolsDistributed SystemsLangChainLlamaIndexAuthenticationIntegrations

Questions & Answers

Common questions about this position

Is this position remote or onsite?

This is an onsite position.

What are the main responsibilities of this role?

You will build bleeding-edge models fine-tuned for Arcade's agentic products, test and deploy models and software both on-prem and in the cloud, monitor to prevent model drift, improve models and APIs, collect necessary data, and build the stack using the right tools.

What skills and experience are required for this Machine Learning/AI Operations Engineer role?

Experience building and fine-tuning models for agentic AI products, deploying models on cloud and on-premise, monitoring model drift, improving models and APIs, data collection, and selecting tools for ML infrastructure is required.

What is the company culture like at Arcade?

Arcade has assembled a dream team of experts in authentication, integrations, distributed systems, and AI from top companies like Okta, Redis, Microsoft, and Google, who have built and founded successful developer platforms, fostering a high-caliber, innovative environment.

What makes a strong candidate for this role?

A strong candidate has hands-on experience building, deploying, and monitoring custom ML models for agentic AI, especially in cloud and on-premise environments, with expertise in preventing model drift and selecting optimal tools.

What is the salary or compensation for this position?

This information is not specified in the job description.

Arcade

Digital platform for interactive product demos

About Arcade

Arcade provides a platform for businesses to create interactive product demos that enhance storytelling. Users can easily record their screens using a Chrome extension and combine various media to craft engaging narratives. The platform's unique features include interactive elements and the ability to update demos as live assets, which saves time for businesses. Operating on a freemium model, Arcade also offers analytics to help users understand audience engagement.

San Francisco, CaliforniaHeadquarters
2021Year Founded
$20.9MTotal Funding
SERIES_ACompany Stage
Data & Analytics, Consumer SoftwareIndustries
51-200Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
Unlimited Paid Time Off
401(k) Company Match
$500 a month remote work stipend
Biannual company retreats
Competitive salary and meaningful equity

Risks

Increased competition from startups may dilute Arcade's market share.
Rapid evolution of AI tools may outpace Arcade's current offerings.
Dependence on Chrome extensions poses risks if browser policies change.

Differentiation

Arcade offers a unique platform for creating interactive product demos.
The platform supports interactive features like pan, zoom, and callouts for enhanced engagement.
Arcade's demos are live assets, allowing continuous edits without starting from scratch.

Upsides

Arcade raised $14M Series A to advance AI content generation and growth.
Increased demand for interactive content boosts Arcade's market potential.
Growing trend of AI in content creation aligns with Arcade's focus on AI-driven demos.

Land your dream remote job 3x faster with AI