Jobless Developer
PlusAI logo
PlusAI

Posted 13 days ago

Open

Deep Learning Research Intern

Santa ClaraRemoteInternship

AI Summary

Internship focusing on Vision-Language models for autonomous driving, involving distillation and deployment of models on onboard compute for strategic decision making.

About this role

PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus was named by Fast Company as one of the World’s Most Innovative Companies. Partners including TRATON GROUP’s Scania, MAN, and International brands, Hyundai Motor Company, Iveco Group, Bosch, and DSV are working with Plus to accelerate the deployment of next-generation autonomous trucks. If you’re ready to make a huge impact and drive the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams.

At Plus, we are pioneering the next generation of autonomous driving (AV 2.0) by replacing hyper-complex legacy stacks with end-to-end Physical AI models. Our SuperDrive™ virtual driver software powers fully autonomous Level 4 semi-trucks, navigating multi-lane highways, complex construction zones, and unpredictable night-driving scenarios.

This summer, you will join our core AI team to work on the frontier of autonomous decision-making: Vision-Language models. You will use vision language models to generate meta actions for strategic decision making. Your project will also focus on designing and implementing advanced knowledge distillation and compression techniques to transition large models for on-board compute usage.

Responsibilities

  • Work on Vision Language Action models in support of Plus's reasoning layer for on-board strategic decision making. Develop distillation technques for deployment on on-board compute.
  • Required Skills:

  • Pursuing MS or PhD in CS, EE, mathematics, statistics or related field
  • Thorough understanding of state-of-the-art vision language/vision language action models, diffusion, flow matching and transformers.
  • 1-2 years experience with implementing and training models in at least one deep learning framework (PyTorch, Tensorflow, Jax)
  • Skills

    Deep Learning FrameworksDiffusionJAXKnowledge DistillationMeta ActionsModel TrainingOn-board ComputePyTorchTensorFlowTransformersVision-Language-Action ModelsVision-language Models

    Explore related jobs

    Browse these categories