PhD Research Intern, Large Language Models - 2025

at Nvidia
$62,400-187,200 per year
$30-90 per hour
INTERN
βœ… On-site

SCRAPED

Used Tools & Technologies

Not specified

Required Skills & Competences ?

Python @ 3 Machine Learning @ 3 Communication @ 3 Parallel Programming @ 3 PyTorch @ 3 CUDA @ 3

Details

NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society β€” from gaming to robotics, self-driving cars to life-saving healthcare, climate change to virtual worlds where we can all connect and create. We are passionate about research that pushes boundaries but also has an impact in the real world. You will be part of an amazing collaborative research team that consistently publishes at the top venues in machine learning and systems.

Our internships offer an excellent opportunity to expand your career and get hands-on with one of our industry-leading Large Language Models Research teams. We’re seeking strategic, ambitious, hard-working, and creative individuals who are passionate about helping us tackle challenges no one else can solve.

Responsibilities

  • Investigate novel approaches to infuse theory-of-mind reasoning into the post- or pre-training phases of large language models.
  • Collaborate with other team members, teams, and/or external researchers.
  • Transfer your research to product groups to enable new products or types of products.
  • Opportunity to publish original research.

Requirements

  • Currently pursuing a PhD Degree in Computer Science/Engineering, Electrical Engineering.
  • Research experience in at least one of the following areas:
    • Large Language Models – training, alignment, and evaluation
    • Foundation Models
    • Multimodal Models/Agents
    • Vision-Language Models
    • Deep Learning, Model Compression, and Acceleration Techniques
    • Pruning
    • Quantization
    • Neural Architecture Search
    • Efficient Backbone Architecture
    • Distillation
    • Strong research track record and publication record at top-tier conferences.
    • Excellent communication skills.
    • Excellent programming skills in some rapid prototyping environment such as Python; C++ and parallel programming (e.g., CUDA) is a plus.
    • Hands-on experience with large-scale model training is a plus.
    • Knowledge of common machine learning frameworks, such as PyTorch.

NVIDIA is widely considered one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world. Are you a creative and collaborative researcher with a real passion for computer graphics? If so, we want to hear from you!

The hourly rate for our interns is 30 USD - 90 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience. You will also be eligible for Intern benefits.