Summer Intern- Multimodal LLM Algorithm
Overview
You'll work at the frontier of multimodal large language models, applying cutting-edge capabilities to content understanding and generation tasks to continuously raise the bar on model performance.
This is a full-time summer 2026 internship (5+ months). Strong performers will be considered for a return offer.
Responsibilities
● Track and explore cutting-edge multimodal LLM research; apply findings to multimodal content understanding tasks
● Contribute to model architecture design, pre-training, fine-tuning, and downstream application development
● Support team capability building through ongoing experimentation and model optimization
Qualifications
● Master's or PhD student in CS, AI, or a related field
● Research or project experience in multimodal learning, RL, NLP, or CV
● Solid math and programming foundations; proficiency in PyTorch
● Strong interest in large models and emerging AI; self-motivated with genuine research potential
● Team player with good communication skills
Bonus Points
● Publications at top NLP, CV, or ML conferences or journals