Research & Development Intern-LLM, Summer 2026
- Help design and develop custom evaluation metrics for LLM and agentic systems
- Verify those new metrics on existing datasets
- Assist in design and setup of experimental data collection efforts for new datasets
- Validate human perception of said custom metrics, using existing datasets or designing playtests to collect perceptions.
- Document and develop software for running the custom eval metric(s) in larger systems
- Understand and engage with the broad technical and creative goals of the project
- Help define milestones that solve important technical issues and convey the potential impact of the project
- Engage in team brainstorms and troubleshooting
- Experience with LLMs
- Experience with agentic systems
- Experience with LLM evaluation metrics
- Python coding skills
- Previous experience with experiment design and data analysis
- Experience with Pytorch
- Currently enrolled in a master’s program from an accredited college/university, earning a degree in Computer Science or Generative AI.
- NLP, Generative AI, evaluation metrics
- Be enrolled in an accredited college/university pursuing a degree taking at least one class at time of application posting OR currently participating in a Disney College Program or Disney Internship
- Be at least 18 years of age
- Possess unrestricted work authorization
- Have not completed one year of continual employment on a Disney internship or Disney College Program
- Able to have a consistent, reliable work schedule throughout the internship
- Fully available from Monday through Friday for the duration of the internship, 40 hours each week
- Able to provide own housing for the duration internship program in the Glendale, CA area
- Able to provide/have reliable transportation to/from work
Recommended Jobs
Travel Certified Hand Therapist / Hand Specialist (OT)
Travel Certified Hand Therapist / Hand Specialist (OT) – Skilled Nursing Facility 📍 Salinas, CA 🕒 13-Week Assignment | 36 Hours per Week 💲 $54–$60 per hour 🚀 Start Date: ASAP We are se…
Senior Product Manager | Cash App | Ziphire.hr
Job Link : Cash App is seeking a dynamic Senior Product Manager for Support Automation to drive innovation within our technology team. In this pivotal role, you will enhance our customer support s…
Principal / Sr. Principal Software Engineer Backend
Architect and develop custom, maintainable ETL solutions from the ground up using Python, SQLAlchemy, and SQL Server. Develop and optimize data storage solutions, including relational and NoSQL databa…
VCA AI Agent Engineering Sr. Manager
Job Description The Sr. Manager is responsible for the hands-on implementation and integration of AI agents, ensuring seamless tool integrations (such as with analytics software), efficient code de…
Sr. Software Engineer
&##128640;SR SOFTWARE ENGINEER WANTED - DRIVE MEANINGFUL IMPACT AT HYPERGROWTH VENTURE&##128640; Are you driven by impact, ownership, and a passion for elegant yet efficient systems? ⚙️What Yo…
Buyer
We are on the search for a Buyer who is goal-oriented and highly entrepreneurial to join our fun and exciting team! This person will lead the assigned team in recognizing current trends and developing…
Chef - Nutritionist/Dietician
Chef Needed with Nutritionist/Dietician Background We are seeking a full-time chef to provide healthy & nutritious meals for the residents in our temporary housing facilities. A qualified candidat…
Staff Computer Vision Engineer
Company Description iMETALX, Inc. is creating a future where space is accessible and sustainable for all. We provide space domain awareness (SDA) and in-space servicing, assembly and manufacturing…
Accountant (20714559)
Description Grow & Thrive Are you a detail-oriented accounting professional who wants your work to make an impact in the community? The City of San Bruno is seeking a skilled and motivated Accou…
Technical Consultant (System Engineering focus)
Overview Join Esri as a Technical Consultant to assist our clients with their ArcGIS Enterprise systems, emphasizing advanced deployment strategies like high availability, disaster recovery, and Kube…