Staff Software Engineer/Data Scientist, Large Model Evaluation
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.
The Large Model Evaluation team is at the nexus of Waymo’s AI ambition . With advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs), Waymo is building state-of-the-art AI systems that handle the full complexity of real-world driving. At its core, our progress is defined by our ability to measure it. While robust evaluation is the bottleneck for deploying any large model, the challenge at Waymo is uniquely complex and safety-critical. We are looking for quantitatively-minded engineers to research and propose new ways to assess the ML models deployed in the Waymo Driver.
You will:
- Develop novel metrics and sampling techniques to measure the driving trajectories generated by ML models.
- Employ creative simulation strategies to measure the driving performance of generative AI models. Identify potential edge cases, and provide reliable performance insights that inform model development and deployment.
- Build data pipelines for signal discovery, data labeling, feature extraction and metric computation based on large-scale simulations.
- Conduct data analysis to diagnose regressions in ML models.
- Collaborate with world-class engineering and research teams that develop large-scale ML models.
You have:
- 7+ years of relevant industry experience in a heavily quantitative software engineering area
- Experience navigating complex technical and product landscapes, defining technical strategy, and creating roadmaps.
- Software Engineering Fundamentals:
- Proficiency in programming in Python or C++
- Experience with software design principles, coding best practices, testing methodologies, and version control software.
- Experience building software pipelines for data processing, system evaluation, or metric computation, in the context of large-scale systems.
- Machine learning & Quantitative Experience
- Knowledge of AI fundamentals, such as transformer architectures, distillation techniques, etc.
- Experience evaluating the quality of ML models
- Demonstrated experience taking quantitative findings through to productionized tools.
We prefer:
- Experience with simulation systems, robotics, or autonomous vehicles.
- Familiarity with one of the modern deep learning frameworks (e.g. JAX, Tensorflow)
- Experience leading a team of Engineers
The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.
Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.
Salary Range
$238,000—$302,000 USD
Recommended Jobs
Sr. Product Manager, Recs Cross-Surface Personalization
Our Mission Launched in 2012, Tinder® revolutionized how people meet, growing from 1 match to one billion matches in just two years. This rapid growth demonstrates its ability to fulfill a fundament…
Credentialing Specialist
Credentialing Specialist JOB-10045419 Anticipated Start Date December 08, 2025 Location New York, NY Type of Employment Contract Hire Employer Info Our client …
Restaurant Manager
The Table in Willow Glen is looking for a restaurant manager. This is a high volume and fast paced environment requiring a skilled professional looking to add to the team. Duties include the followin…
Auto Mechanic/Technician
South Bay Auto Auction is looking for good people to help grow our business. As a one-stop independent auto auction, our team members are by far the most important part of our company. We are lookin…
MANAGEMENT SERVICES TECHNICIAN
Location: Environmental Services Department Serves as the MST for the Custodian Supervisor II. Performs a variety of duties and is expected to consistently exercise a high degree o…
IT Project Manager
As a member of the Red Bull North America Team, the IT Project Manager is responsible for coordinating, communicating, and delivering multiple enterprise technology projects within the eCommerce work…
Project Manager
Description/Comment: Job Description: As a Project Manager for the Global Hosting Program, you will be responsible for driving the successful migration of data center workloads into next-genera…
Associate Fraud Strategy Data Scientist San Jose, CA
Associate Fraud Strategy Data Scientist San Jose, CA Fraud Strategy Data Scientist, Risk Data Scientist w/Fraud, Risk Analytics, Data Analysis, Data Science, Fraud Mitigation, Industry: eCommerce, o…
Sr. Software Engineer, Delivery Platform
At Serve Robotics, we’re reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, make deliveri…
Sr. Software Engineer
Company Description About CyberArk : CyberArk (NASDAQ: CYBR ), is the global leader in Identity Security . Centered on privileged access management, CyberArk provides the most comprehens…