Untether AI

AI Infrastructure Solutions Architect

California, United States

Not SpecifiedCompensation
Expert & Leadership (9+ years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Information Technology & ServicesIndustries

Requirements

Candidates must possess 10+ years of experience in the compute/server/datacenter industry, with at least 3 years of experience working with accelerated AI solutions utilizing GPUs or custom AI accelerators. Successful experience in specifying, building, and deploying rack-scale hardware and software infrastructure is required, and experience working at AIaaS cloud service providers, on-prem datacenter, or high-performance computing labs is highly desirable. Strong interpersonal skills, including the ability to work efficiently in a group environment and influence cross-functional teams without direct managerial authority, are also necessary. Excellent problem-solving skills and the ability to resolve complex issues with ambiguity are crucial.

Responsibilities

The AI Infrastructure Solutions Architect will architect Untether AI’s rack-scale solutions, encompassing accelerator cards, servers, networking, and rack topologies, while selecting, integrating, and testing best-of-breed hardware and full-stack software. They will lead and collaborate with cross-functional teams to drive a shared vision and deliver a best-in-class solution, working with business development teams to recommend and deliver rack-level solutions to partners and customers. Furthermore, the Architect will dogfood the systems they create, testing, bulletproofing, and recommending them to partners and customers, and will work closely with hardware, software, and product teams.

Skills

AI Infrastructure
Rack-Scale System Design
Inference Acceleration
GPU/AI Accelerator Experience
Hardware and Software Integration
High-Performance Computing
Networking
Full-Stack Inference Serving
Cross-Functional Collaboration
Problem-Solving

Untether AI

Enhances AI inference with at-memory computing

About Untether AI

Untether AI enhances the speed and efficiency of AI inference workloads using at-memory computing. This method places the compute element next to memory cells, which boosts compute density and accelerates AI inference for various neural networks, such as those used in vision, natural language processing, and recommendation systems. The company targets businesses that rely on AI technologies and need high-performance computing for inference tasks. Their products, including the runAI200® devices and tsunAImi® accelerator cards, are designed to deliver exceptional performance, with the tsunAImi® card offering over 2 PetaOps. This allows businesses to optimize their AI workloads while maintaining a compact PCI-Express form factor. Untether AI's goal is to provide efficient and cost-effective solutions for companies looking to enhance their AI applications.

Key Metrics

Toronto, CanadaHeadquarters
2018Year Founded
$144.6MTotal Funding
SERIES_BCompany Stage
Hardware, AI & Machine LearningIndustries
51-200Employees

Benefits

Paid Vacation
Health Insurance
Unlimited Paid Time Off
Stock Options

Risks

Emerging competition in energy-efficient AI hardware could threaten market position.
Rapid AI model evolution may require frequent hardware updates and innovations.
Supply chain vulnerabilities in semiconductor components could impact production timelines.

Differentiation

Untether AI's at-memory computing maximizes AI inference efficiency and speed.
The tsunAImi® accelerator card delivers over 2 PetaOps per card, optimizing AI workloads.
Untether AI's imAIgine SDK allows rapid deployment of neural networks with flexible kernels.

Upsides

Collaboration with Arm enhances solutions for ADAS and AV applications in the automotive sector.
$20 million funding supports ongoing development of machine learning inferencing hardware.
Partnership with J-Squared opens new opportunities in defense and commercial sectors.

Land your dream remote job 3x faster with AI