Robotics Intern - Large Behavior Models, Learning From Videos (LFV) at Toyota Research Institute

Los Altos, California, United States

Toyota Research Institute Logo
Not SpecifiedCompensation
InternshipExperience Level
InternshipJob Type
UnknownVisa
Robotics, Automotive, Artificial IntelligenceIndustries

Requirements

  • Currently pursuing a Ph.D. in Machine Learning, Robotics, or related fields
  • Publication or desire to publish at high-impact conferences/journals (e.g., CoRL, ICLR, NeurIPS, CVPR, ICCV, ECCV, ICML, UAI, AISTATS, AAAI, TMLR, RSS, ICRA, IROS, RA-L, etc.) on relevant topics
  • Passionate about large scale challenges in ML and CV grounded in physical systems, especially in the space of robotics
  • Proficiency with one or more coding languages and systems, preferably Python, Unix, and a Deep Learning framework (e.g., PyTorch)
  • Ability to collaborate with other researchers and engineers of the LFV team, and, more broadly, the Robotics division to invent and develop interesting research ideas
  • A reliable teammate who loves to think big, go deeper, and deliver with integrity
  • Provide a link to Google Scholar and include a full list of publications when submitting CV

Responsibilities

  • Conduct daring research in Computer Vision that solves open problems of high theoretical and practical value, and evaluate solutions on real-world benchmarks and systems, with a focus on robotics
  • Push the boundaries of knowledge and the state-of-the-art in Visual Systems for Robotics
  • Partner with a multidisciplinary team, including other research scientists and engineers across the LFV team, the Robotics division, TRI, Toyota, and university partners
  • Stay up to date on the state-of-the-art in Machine Learning ideas and software
  • Present results in verbal and written communications at international conferences, internally, and via open-source contributions to the community

Skills

Key technologies and capabilities for this role

Computer VisionRoboticsVideo GenerationWorld Modeling4D ReconstructionMulti-Modal Foundation ModelsMulti-View GeometryData AugmentationLarge Vision ModelsMachine LearningSimulators

Questions & Answers

Common questions about this position

Is this internship paid, and how long is it?

This is a paid 12-week internship opportunity for summer 2026.

What is the work arrangement for this internship?

The internship will be a hybrid in-office role.

What are the key qualifications for this role?

Candidates must be currently pursuing a Ph.D. in Machine Learning, Robotics, or related fields, have publications or a desire to publish at high-impact conferences like CoRL, ICLR, NeurIPS, CVPR, and demonstrate proficiency with one or more coding languages and systems, along with passion for large-scale ML and CV challenges in robotics.

What is the team and focus of this internship?

The Learning From Videos (LFV) team in the Robotics division focuses on areas like Video Generation, World Modeling, 4D Reconstruction, and Multi-Modal Foundation Models, with an emphasis on embodied robotics applications to tackle challenges in real-world unstructured environments.

What makes a strong candidate for this internship?

Strong candidates are Ph.D. students with publications at top conferences in ML, CV, or robotics, passion for large-scale challenges in physical systems, and proficiency in coding languages.

Toyota Research Institute

Research in mobility, safety, and automation

About Toyota Research Institute

Toyota Research Institute focuses on improving mobility through research and development in the automotive and technology sectors. The company works on enhancing safety, automated driving, robotics, materials science, and machine learning. Their products include advanced safety features and automated driving systems that aim to make driving safer and more efficient. Unlike many competitors, TRI emphasizes a research-driven approach, collaborating with various partners and licensing their innovations to enhance Toyota's offerings and maintain a competitive edge. The goal of TRI is to advance mobility solutions that improve quality of life and support the transition to zero-emissions transportation.

Los Altos, CaliforniaHeadquarters
2016Year Founded
$100MTotal Funding
SEEDCompany Stage
Robotics & Automation, Automotive & Transportation, AI & Machine LearningIndustries
201-500Employees

Benefits

Highly competitive benefits package
Robust programs to support the wellbeing, happiness, and health of our people and their families.
401(k) plan including matching and annual profit sharing along with total vacation and holidays totaling 38 days per year.
Subsidized commuter benefits and generous employee and vehicle allowances
2 paid days per year to participate in volunteer activities.
Maternity Leave program with 10 paid weeks plus baby bonding leave and Milk Stork for traveling moms
Baby Bonding Leave–an additional 16 paid weeks–to all new parents, including those who choose to adopt.
Back-up child and adult / elder care programs to help everyone thrive
WellBeats Virtual Fitness Trainer.

Risks

Public skepticism may undermine AI-assisted driving safety initiatives.
Boston Dynamics' high costs could impact TRI's humanoid robot project returns.
Complexity of real-world environments may delay TRI's robot skill deployment.

Differentiation

TRI focuses on AI to enhance human life and mobility.
Partnerships with Boston Dynamics and Stanford showcase TRI's collaborative innovation.
TRI's Driving Sensei concept integrates AI to improve driver skills and engagement.

Upsides

Generative AI allows TRI's robots to learn complex tasks without new coding.
TRI's AI-driven material synthesis could revolutionize EV battery development.
Autonomous tandem drifting collaboration with Stanford pushes vehicle dynamics boundaries.

Land your dream remote job 3x faster with AI