Data Scientist
About the Institute of Foundation Models
We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.
As part of our team, you’ll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.
The Role
As a Data Scientist at the Institute of Foundation Models, your primary responsibility is to curate high quality data at the web-scale to fuel the development of next generation machine learning models. You will work on exploring, consolidating data sources and collaborate with cross-functional teams to conduct in-depth data research, contributing to MBZUAI’s mission of driving impactful AI discoveries and positioning the institution as a leader in the global AI research community. Your expertise will be key in enhancing the performance of large-scale machine learning models, while supporting the development of transformative AI tools that can influence industries worldwide.
Key Responsibilities
- Conduct research on the best data recipes to support various large scale machine learning models.
- Collaborate with the research teams to identify additional sources of data.
- Develop algorithms and systems to efficiently apply the recipe on large scale data.
- Manage and maintain the data catalog, help the research and engineering teams to understand available data sources.
- Develop and implement systems to support the lifecycle of machine learning models, especially on data preprocessing.
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
- Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
- Contribute to research papers and represent MBZUAI at industry conferences and events, showcasing the institution’s cutting-edge HPC and deep learning capabilities and establishing MBZUAI as a global leader in AI research and innovation.
- Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.
Academic Qualifications
- Minimum of Bachelor’s degree or equivalent practical experience.
- Preferred Master's degree or PhD in Computer Science or related technical field.
$100,000 - $650,000 a year
Visa Sponsorship
This position is eligible for visa sponsorship.
Benefits Include
*Comprehensive medical, dental, and vision benefits
*Bonus
*401K Plan
*Generous paid time off, sick leave and holidays
*Paid Parental Leave
*Employee Assistance Program
*Life insurance and disability
Recommended Jobs
Full Time ObGyn Job Mission Viejo, CA
Seeking a Nocturnist OB Hospitalist/Laborist Physician who is Board Certified/Board Eligible in Obstetrics and Gynecology to join an established group of 11 full-time physicians at Mission Hospital i…
Nurse Navigator - Cath Lab
At Houston Methodist, the Nurse Navigator position is responsible for serving as an expert population-specific clinician, patient/client advocate and assisting population-specific patients/families t…
Sr. Staff Functional Safety Engineer
About Rivian Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to att…
Cashier
Job description Ramos Oil Co., a family owned business since 1951, is seeking a Clerk/Cashier at its Rio Vista location. Seeking friendly, energetic and responsible people. Essential Duties and R…
Enterprise Customer Success Manager
Why Ivo? Contract negotiation is the most time-consuming, costly, and difficult component of the contract lifecycle—and it hasn’t gotten much easier since the days of fax machines. Large languag…
Software Engineer (Fullstack)
Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission to advance the benefits of sustainable air mobility. We are …
Senior Data Scientist, Analytics (Growth)
About Airwallex Airwallex is the only unified payments and financial platform for global businesses. Powered by our unique combination of proprietary infrastructure and software, we empower over…
Leasing Consultant (Homecoming at The Resort)
Description Leasing Consultant – Homecoming at The Resort (Rancho Cucamonga, CA) About Us Lewis Group of Companies is one of the nation’s largest privately held real estate development firm…
AI Engineer & Researcher - GPU Kernel
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on eng…
Software Engineer, Backend
Join us in building the future of finance. Our mission is to democratize finance for all. An estimated $124 trillion of assets will be inherited by younger generations in the next two decades. T…