Distributed Machine Learning Engineer

Institute Of Foundation Models
Sunnyvale, CA

About the Institute of Foundation Models

We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.

As part of our team, you’ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.

The Role

The Distributed ML Engineer will play a role at the forefront of optimizing performance for the machine learning software stacks, especially at training and inference, and support the team to develop new and cutting-edge systems. The ideal candidate will have a strong background in parallel computing, and hands-on experience in system level coding, debug methodologies, and large-scale machine learning experience.

Key Responsibilities

  • Understand, analyze, profile, optimize, and provide guidance to the team on deep learning workloads on state-of-the-art hardware and software platforms to improve their efficiency with different levels of optimization
  • Design and implement performance benchmarks and testing methodologies to evaluate application performance
  • Build tools to automate workload analysis, workload optimization, and other critical workflows
  • Triage system issues and identify bottleneck and inefficiencies by analyzing the sources of issues and the impact on hardware, network and propose solutions to enhance GPU utilization
  • Support the team to develop appropriate kernels and systems for new model architectures and algorithms
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Represent MBZUAI at industry conferences and events, showcasing the institution’s cutting-edge HPC and deep learning capabilities and establishing MBZUAI as a global leader in AI research and innovation.
  • Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.

Academic Qualifications

  • Ph.D. in CS, EE or CSEE with 1+ years working experience, OR
  • Masters in CS, EE or CSEE or equivalent experience with 2+ year working experience

$150,000 - $450,000 a year

Visa Sponsorship

This position is eligible for visa sponsorship.

Benefits Include

*Comprehensive medical, dental, and vision benefits

*Bonus

*401K Plan

*Generous paid time off, sick leave and holidays

*Paid Parental Leave

*Employee Assistance Program

*Life insurance and disability

Posted 2025-12-10

Recommended Jobs

Senior Data Scientist, Revenue Management Systems

Viking Cruises Us
Los Angeles, CA

Senior Data Scientist, Revenue Management Systems Job Summary : Viking is scaling its Revenue Management System (RMS) from a successful MVP to full inventory coverage and sustained yield upl…

View Details
Posted 2026-01-13

Food Runner

Nola Palo Alto
Palo Alto, CA

Description Food runners are an important part of our staff and serve as assistants to the waitstaff. Their main duties are to deliver, or "run" food to tables once a food order has been prepared …

View Details
Posted 2026-01-03

Customer Engagement Specialist

Olympic Marketing Consultants Inc.
Fremont, CA

Customer Engagement Specialist Location Fremont, CA : Olympic Marketing Consultants, Inc. is a fast-growing local marketing and sales firm based in the Bay Area. We specialize in lead generation and …

View Details
Posted 2026-01-09

Senior Software Engineer in Test - Data Platform

Okta
San Francisco, CA

Get to know Okta Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 P…

View Details
Posted 2025-11-28

Construction Repair Technician

Advanced Integrated Pest Management
Concord, CA

Elevate Your Career with Advanced IPM: Join Our Growing Team! Are you seeking an opportunity to advance your career in a family-owned company that truly values its employees? Do you want to make a me…

View Details
Posted 2025-11-07

Project Manager

K2 Staffing
Los Angeles, CA

Our client is consistently recognized as a best workplace, and for commitment to safety, sustainability, and community partnerships. They hire the very best in the construction industry and strives t…

View Details
Posted 2025-10-03

Store Manager, Ralph's Coffee, Palo Alto

Ralph Lauren
Palo Alto, CA

Position Overview The store manager is responsible for leading all team members in the efficient and profitable operation of Ralph’s Coffee. They are responsible for managing the da…

View Details
Posted 2025-12-12

Accounts Receivable Controller

Marlee
Paradise, CA

About the company Fast-track your career with the Marlee Talent Pool. We're not just matching you with your ideal roles but unlocking your long-term career potential. Marlee goes above and beyond by…

View Details
Posted 2025-11-28

Executive IT Support

Neon
Mountain View, CA

GAQ426R304 About the role: As IT Executive Support, you will be the primary technical partner for Databricks senior leadership. You will own the executive support experience, delivering seamless, …

View Details
Posted 2025-12-03

Sr Software Engineer (Malware Research - Antivirus Systems)

Palo Alto Networks
Santa Clara, CA

Company Description Our Mission At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vi…

View Details
Posted 2026-01-07