Research Engineer - The Diffusion LLM Team
Job Description
Job Description
About the Institute of Foundation Models
We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.
As part of our team, you’ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.
The Role
As a member of the Diffusion LLM Team at MBZUAI, you will play a central role in designing, building, and releasing industrial-scale Diffusion Large Language Models. Our team has two core missions. First, we develop and release diffusion-based LLMs that push the speed–quality frontier at scale by matching autoregressive model quality while enabling faster generation. Second, we improve inference-time scaling relative to standard LLMs, so that additional test-time compute translates into higher-quality samples.
You will work closely with researchers and engineers across architecture, training, and infrastructure to turn research ideas into high-impact model releases for next-generation LLMs.
Key Responsibilities
Design, train, and scale large language models for research and real-world deployment.
Lead or contribute to the release of industrial-scale diffusion language models.
Develop and evaluate training strategies and objectives for efficient model scaling.
Publish research findings and contribute to open-source model and code releases.
Academic Qualifications
MSc or PhD in Machine Learning or Computer Science, or equivalent industry experience.
Professional Experience
Hands-on experience training large models using modern deep learning frameworks at scale.
Strong background in transformer architectures and large-scale optimization techniques.
Demonstrated expertise in LLM pre-training or post-training, with a strong focus on model scaling.
Research track record evidenced by publications, open-source contributions, or released models.
Knowledge of diffusion models or discrete diffusion methods is a plus, but not required.
Ability to work independently while contributing effectively to a collaborative research team.
Salary Range
The posted salary range represents the company’s good faith estimate of the compensation for this position upon hire. The actual compensation offered may vary within this range depending on individual qualifications, including but not limited to relevant skills, experience, education, certifications, geographic location, and specific business needs.
Recommended Jobs
Director, Global Business Management, GCM
FRAUD ALERT: Please note that DSV will never request a chat interview or solicit funds from applicants or employees through its interviewing and hiring process. We do not require any form of payment …
LOCUM Cardiothoracic Physician Assistant
We are hiring a locum Cardiothoracic Physician Assistant for a 6-month LOCUM Need near Pittsburgh, Pennsylvania! Our need is to come and assist in our Operating Room, Floor and Clinic supporting our …
House Parents
Milton Hershey School, a cost-free private residential school for pre-K through 12th grade, is seeking dedicated House Parents to join our Education, Training & E-Learning team. House Parents live in …
Battery Module Senior Manufacturing Engineer
Joby Aviation is seeking a passionate Senior Manufacturing Engineer to join our Powertrain & Electronics Manufacturing Engineering Team in San Carlos, CA. The Sr. Manufacturing Engineer will lead the …
Process Engineer
Process Engineer - Milpitas, CA (Onsite) Salary: $130K-$150K We're hiring a Process Engineer to support and improve manufacturing operations for PCBA (SMT & box build) production. This role foc…
General Manager
Property Description: Sea Ranch Golf Links offers a rare, authentic round on the rugged Sonoma Coast. Shaped by the wind, weather, and terrain, our links-style layout pays homage to the origins of g…
Account Executive/ Inside SaaS B2B Sales
Postings.com Account Executive /Inside Sales Location: San Diego, CA (Downtown/GAS LAMP) At Postings.com we know that good employees are our biggest asset. Good people. A Great Company. …
Senior Manager, Customer Support
Job Description Job Description Crusoe is on a mission to accelerate the abundance of energy and intelligence . As the only vertically integrated AI infrastructure company built from the ground…
Software Engineer, Applications & Deployment Infrastructure
Software Engineer, Applications &Deployment Infrastructure Confidential Physical AI & Robotics Company | San Francisco Bay Area Build the Infrastructure That Makes Intelligent Robots Work in th…
Sr. Product Designer - Web GIS SDK
Overview Join Esri’s Product Design team as a Senior Product Designer focused on the Web GIS SDK. In this role, you’ll help shape cohesive, scalable maps and application components used across the we…