PhD Research Intern, Large Language Models - 2025

at Nvidia

📍 Santa Clara, United States

$62,400-187,200 per year

INTERN
✅ On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 3 Machine Learning @ 3 Communication @ 3 Parallel Programming @ 3 PyTorch @ 3

Details

NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society — from gaming to robotics, self-driving cars to life-saving healthcare, climate change to virtual worlds where we can all connect and create. We are passionate about research that pushes boundaries but also has impact in the real world. You will be part of an amazing collaborative research team that consistently publishes at the top venues in machine learning and systems.

Responsibilities

  • Investigate novel approaches to infuse theory-of-mind reasoning into the post- or pre-training phases of large language models.
  • Collaborate with other team members, teams, and/or external researchers.
  • Transfer your research to product groups to enable new products or types of products.
  • Opportunity to publish original research.

Requirements

  • Currently pursuing a PhD Degree in Computer Science/Engineering or Electrical Engineering.
  • Research experience in at least one of the following areas:
    • Large Language Models – training, alignment, and evaluation.
    • Foundation Models.
    • Multimodal Models/Agents.
    • Vision-Language Models.
    • Deep Learning, Model Compression, and Acceleration Techniques.
    • Pruning.
    • Quantization.
    • NAS (Neural Architecture Search).
    • Efficient Backbone Architecture.
    • Distillation.
    • Strong research track record and publication record at top-tier conferences.
    • Excellent communication skills.
    • Excellent programming skills in Python; C++ and parallel programming (e.g., CUDA) is a plus.
    • Hands-on experience with large-scale model training is a plus.
    • Knowledge of common machine learning frameworks, such as PyTorch.

Benefits

  • The hourly rate for our interns is 30 USD - 90 USD, determined based on the position and your location, year in school, degree, and experience.
  • Eligible for intern benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.