Data Scientist
About the Institute of Foundation Models
We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.
As part of our team, you’ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.
The Role
As a Data Scientist at the Institute of Foundation Models, your primary responsibility is to curate high quality data at the web-scale to fuel the development of next generation machine learning models. You will work on exploring, consolidating data sources and collaborate with cross-functional teams to conduct in-depth data research, contributing to MBZUAI’s mission of driving impactful AI discoveries and positioning the institution as a leader in the global AI research community. Your expertise will be key in enhancing the performance of large-scale machine learning models, while supporting the development of transformative AI tools that can influence industries worldwide.
Key Responsibilities
- Conduct research on the best data recipes to support various large scale machine learning models.
- Collaborate with the research teams to identify additional sources of data.
- Develop algorithms and systems to efficiently apply the recipe on large scale data.
- Manage and maintain the data catalog, help the research and engineering teams to understand available data sources.
- Develop and implement systems to support the lifecycle of machine learning models, especially on data preprocessing.
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
- Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
- Contribute to research papers and represent MBZUAI at industry conferences and events, showcasing the institution’s cutting-edge HPC and deep learning capabilities and establishing MBZUAI as a global leader in AI research and innovation.
- Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.
Academic Qualifications
- Minimum of Bachelor’s degree or equivalent practical experience.
- Preferred Master's degree or PhD in Computer Science or related technical field.
$100,000 - $650,000 a year
Visa Sponsorship
This position is eligible for visa sponsorship.
Benefits Include
*Comprehensive medical, dental, and vision benefits
*Bonus
*401K Plan
*Generous paid time off, sick leave and holidays
*Paid Parental Leave
*Employee Assistance Program
*Life insurance and disability
Recommended Jobs
Study Moderator - Los Angeles, CA
Description and Requirements WORK LOCATION : Onsite in Los Angeles, CA JOB STATUS: Full-time WORK SCHEDULE: Mon - Fri START DATE: ASAP END DATE: December , …
Usability Tester
Capio Group is looking for an experienced Usability Tester! Full-time employee - Sacramento Salary: $110,000 - $120,000 About Us: Capio Group is a California-based Information Technology …
Software Engineer, Product
About Eventual Eventual is a data platform that helps data scientists and engineers build data applications across ETL, analytics and ML/AI. OUR PRODUCT IS OPEN-SOURCE AND USED AT ENTERPRISE SC…
Supply Chain ,Order Management Systems
JOB DESCRIPTION Your role is responsible for developing elegant, efficient solutions to significant business problems. This team focuses on systems enabling the Supply Chain , Order Management …
Product Quality Engineer - Airfoil Castings
**Job Description Summary** Are you ready to see your future take flight? At GE Aerospace, we are advancing aviation technologies for today and tomorrow. Your work will contribute to the production of…
Embedded Tester
Role: Embedded Tester- Software Integrated Testing (SIT) Location: Alameda CA (Need local candidates only) Job Type: Contract Job Description – SW Design engineer for Software integration T…
Discover Oakland: Care for Newborns in a Vibrant City!
Registered Nurse - Neonatal Intensive Care - Travel - (NICU RN) Join the vibrant Oakland Medical Center as a Neonatal Intensive Care RN and make a real difference in the lives of newborns in need! Th…
Special Education Teacher
Summary ......Summary ...Summary Special Education Teacher (Mild/Moderate) - $60/hour (... ...Contract Length: 2025-2026 School Year Start Date: ASAP...??.... .....Breakdown: + 1 bilingual …
Senior Software Engineer, Enterprise
At Nuna, our mission is to make high-quality healthcare affordable for everyone. We are dedicated to tackling one of our nation’s biggest problems with ingenuity, creativity, and a keen moral compass…
Administrative Assistant Sales Coordinator
Administrative Assistant Sales Coordinator Location Oceanside, CA : Overview: e3 Diagnostics is part of a world-leading hearing healthcare and technology group built on a heritage of care, health, an…