AI Engineer & Researcher - CUDA/GPU Kernel at xAI

Palo Alto, California, United States

xAI Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial IntelligenceIndustries

Requirements

  • Experience building high-performance GeMM CUDA kernels using Tensor cores or CUDA cores from scratch or by utilizing CuTe/CUTLASS
  • Experience implementing features for attention kernel by extending existing kernels or writing them from scratch
  • Comfortable with writing both forward and backward kernels and ensuring its correctness while considering floating point errors
  • Experience optimizing for both memory-bound and compute-bound operations
  • Ability to reason about register pressure, shared-memory usage and GPU utilization through tools such as Nsight and removing bottlenecks
  • Familiarity with the latest and the most effective techniques in optimizing inference and training workloads
  • Experience using pybind to integrate custom-written kernels into a framework, specially JAX/XLA
  • Strong communication skills to concisely and accurately share knowledge with teammates
  • Proficiency with CUDA, CUTLASS, C/C++ and Python binding tools

Responsibilities

  • Developing and improving low-level CUDA kernel optimizations for state-of-the-art inference and training software stack
  • Profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight
  • Understanding GPU memory hierarchy and computation capabilities
  • Implementing the latest methods from the deep learning literature in low-level CUDA kernels
  • Innovating new ideas that bring us closer to the limits of a GPU
  • Contribute hands-on to the company’s mission with strong work ethic and prioritization skills

Skills

CUDA
CUTLASS
C/C++
Python
Nsight
GeMM
Tensor Cores
CUDA Cores
CuTe
GPU kernels
attention kernels

xAI

AI tools for research and information retrieval

About xAI

x.ai develops AI tools aimed at enhancing research and information retrieval. Their main product, Grok, is designed to answer a variety of questions, including unconventional ones that other AI systems might not handle. Grok provides real-time knowledge, making it a useful resource for researchers, academics, and professionals who need quick access to relevant information. Unlike competitors, Grok stands out for its ability to suggest questions and provide nuanced answers, catering to a diverse range of inquiries. The goal of x.ai is to empower users by streamlining their research processes and fostering innovation through reliable information access.

Burlingame, CaliforniaHeadquarters
2023Year Founded
$11,803.1MTotal Funding
SERIES_CCompany Stage
Data & Analytics, AI & Machine LearningIndustries
1,001-5,000Employees

Benefits

Health Insurance
Remote Work Options

Risks

Increased competition from Anthropic could challenge xAI's market position.
Legal battles involving Elon Musk may divert resources from xAI's operations.
Reliance on Nvidia GPUs poses risks if supply chain issues arise.

Differentiation

Grok answers unconventional questions, unlike many AI systems.
xAI's Grok provides real-time knowledge, enhancing research efficiency.
Grok's ability to generate striking images sets it apart in visual data processing.

Upsides

xAI secured $6 billion funding, boosting AI infrastructure and R&D.
Grok's iOS app launch expands user accessibility and engagement.
AI-driven research tools are increasingly integrated with cloud platforms, aiding xAI's growth.

Land your dream remote job 3x faster with AI