Distributed Machine Learning Engineer

Institute Of Foundation Models
Sunnyvale, CA

About the Institute of Foundation Models

We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.

As part of our team, you’ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.

The Role

The Distributed ML Engineer will play a role at the forefront of optimizing performance for the machine learning software stacks, especially at training and inference, and support the team to develop new and cutting-edge systems. The ideal candidate will have a strong background in parallel computing, and hands-on experience in system level coding, debug methodologies, and large-scale machine learning experience.

Key Responsibilities

  • Understand, analyze, profile, optimize, and provide guidance to the team on deep learning workloads on state-of-the-art hardware and software platforms to improve their efficiency with different levels of optimization
  • Design and implement performance benchmarks and testing methodologies to evaluate application performance
  • Build tools to automate workload analysis, workload optimization, and other critical workflows
  • Triage system issues and identify bottleneck and inefficiencies by analyzing the sources of issues and the impact on hardware, network and propose solutions to enhance GPU utilization
  • Support the team to develop appropriate kernels and systems for new model architectures and algorithms
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Represent MBZUAI at industry conferences and events, showcasing the institution’s cutting-edge HPC and deep learning capabilities and establishing MBZUAI as a global leader in AI research and innovation.
  • Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.

Academic Qualifications

  • Ph.D. in CS, EE or CSEE with 1+ years working experience, OR
  • Masters in CS, EE or CSEE or equivalent experience with 2+ year working experience

$150,000 - $450,000 a year

Visa Sponsorship

This position is eligible for visa sponsorship.

Benefits Include

*Comprehensive medical, dental, and vision benefits

*Bonus

*401K Plan

*Generous paid time off, sick leave and holidays

*Paid Parental Leave

*Employee Assistance Program

*Life insurance and disability

Posted 2025-09-22

Recommended Jobs

Senior Scientist I, Cell/Molecular Biology - Cell Line Development

AbbVie Inc.
South San Francisco, CA

Company Description AbbVie's mission is to discover and deliver innovative medicines and solutions that solve serious health issues today and address the medical challenges of tomorrow. We striv…

View Details
Posted 2025-09-02

AI Software Engineer, Vehicle Engineering

Spacex
Hawthorne, CA

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolog…

View Details
Posted 2025-09-22

Private Caregivers Needed in Los Angeles

Private Senior Care, LLC
Los Angeles, CA

Overview: We are a private geriatric healthcare team. We work with private families looking for 1-1 in-home Caregiving and CNA services. Our goal is to match great caregivers with one of our client…

View Details
Posted 2025-08-16

Join the Healing Heart of California's Coastal Paradise!

NurseRecruiter
Salinas, CA

Registered Nurse - Perioperative Nurse - Operating Room - Travel - (OR RN) Hey friend! Ready to travel and save lives in beautiful Salinas, California? Picture yourself as a perioperative nurse in a …

View Details
Posted 2025-07-30

Arby's Team Member

KBP Inspired
Vista, CA

Looking to kickstart your career? At our KBP Inspired franchise location, we’re on the lookout for individuals like you to become part of our Arby’s team. If you’re motivated, a team player, and exci…

View Details
Posted 2025-09-22

Certified Master DHCS Trainer: Facility Site Reviews/Medical Record Reviews

SIS
Pleasanton, CA

Job Description Job Description Certified Master DHCS Trainer: Facility Site Reviews/Medical Record Reviews Pleasanton, CA Salary - $175,000 $256,000 / yr Requirements of this position? …

View Details
Posted 2025-07-30

Senior Accounting Administrator

Four Season Travel
El Monte, CA

Senior Accounting Administrator Location El Monte, CA : Introduction: Four-Season Travel is a rapidly growing tourism operator that provides charter bus services, inbound and outbound tour operation…

View Details
Posted 2025-09-22

Senior Product Manager - Connected Products

Therma-tru
San Francisco, CA

Company Description Fortune Brands Innovations, Inc. is an industry-leading innovation company focused on creating smarter, safer and more beautiful homes and improving lives. Our driving purpos…

View Details
Posted 2025-09-14

Connected Supply Chain, Planning - Kinaxis, Director Save for Later Remove job

PwC
San Francisco, CA

Job Title Connected Supply Chain, Planning - Kinaxis, Director Job Category Operations Consulting Level Director Specialty/Competency Operations Industry/Sector Not Applicable Job Type Reg…

View Details
Posted 2025-09-10

Production Operator

Paragon Laboratories
Torrance, CA

Available Positions:  First Shift: 6:00 am - 2:30 pm (2 open positions). Second Shift: 2:30pm - 11:00 pm (1 open position). Summary of Position: Primary responsibilities are overseeing and…

View Details
Posted 2025-08-07