Senior Data Engineer

Toyota Research Institute
Los Altos, CA

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team in Automated Driving, Energy & Materials, Human-Centered AI, Human Interactive Driving, Large Behavioral Models, and Robotics.

The Automated Driving Advanced Development division at TRI will focus on enabling innovation and transformation at Toyota by building a bridge between TRI research and Toyota products, services, and needs. We achieve this through partnership, collaboration, and shared commitment. This new division is leading a new cross-organizational project between TRI and Woven by Toyota to conduct research and develop a fully end-to-end learned driving stack. This cross-org collaborative project is harmonious with TRI’s robotics divisions' efforts in Diffusion Policy and Large Behavior Models.

We are looking for a Senior Data Engineer to design and build the foundational data infrastructure and tools that power our autonomy research and development workflows. This includes large-scale ingestion pipelines, structured feature stores, labeling infrastructure, scene search and data discovery tools, and performance diagnostics for machine learning and simulation workflows.

Responsibilities

  • Design and implement scalable, production-grade pipelines for data ingestion, transformation, storage, and retrieval from vehicle fleets and simulation environments.
  • Build internal tools and services for data labeling, curation, indexing, and cataloging across large and diverse datasets.
  • Collaborate with ML researchers, autonomy engineers, and data scientists to design schemas and APIs that power model training, evaluation, and debugging.
  • Develop and maintain feature stores, metadata systems, and versioning infrastructure for structured and unstructured data.
  • Support the generation and integration of synthetic datasets with real-world logs to enable hybrid training and simulation workflows.
  • Optimize pipelines for cost, latency, and traceability, ensuring reproducibility and consistency across environments.
  • Partner with simulation and cloud platform teams to automate workflows for closed-loop testing, scenario mining, and performance analytics.

Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
  • 8+ years of experience building data-intensive software systems, ideally in robotics, autonomous driving, or large-scale ML environments.
  • Proficient in Python, SQL, and familiar with C++.
  • Experience designing ETL pipelines using modern frameworks (e.g., Apache Spark, Flyte, Union).
  • Strong knowledge of cloud-native architectures, including AWS services (e.g., S3, or equivalents (Google Cloud platform)
  • Familiarity with sensor data types (camera, lidar, radar, GPS/IMU) and common data serialization formats (e.g., protobuf. ROS2bag, MCAP).
  • Deep understanding of data quality, observability, and lineage in high-volume systems.
  • Track record of building reliable and performant infrastructure that supports both ad-hoc exploration and repeatable production workflows.

Bonus Qualifications

  • Experience in AD/ADAS, robotics, or autonomous systems — especially handling perception or planning datasets.
  • Familiarity with ML pipeline orchestration frameworks (e.g. Kubeflow, SageMaker, etc).
  • Experience working with temporal or spatial data, including geospatial indexing and time-series alignment.
  • Exposure to synthetic data generation, simulation logging, or scenario replay pipelines.
  • Strong software engineering fundamentals, CI/CD, testing, code review, and service deployment best practices.
  • Experience collaborating with cross-functional, distributed teams across research and production orgs.

Please include links to any relevant open-source contributions or technical project write-ups with your application.

The pay range for this position at commencement of employment is expected to be between $180,000 and $270,000/year for California-based roles; however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. Note that TRI offers a generous benefits package (including 401(k) eligibility and various paid time off benefits, such as vacation, sick time, and parental leave) and an annual cash bonus structure. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.

TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.

Posted 2025-09-22

Recommended Jobs

Workday AP Reporting Analyst

GTN Technical Staffing
California

Workday Reporting Analyst HIGHLIGHTS: Location: Fully Remote Position Type:  Contract/ Contract-to-Hire Work Authorization: U.S. Citizens and Green Card Holders Only Hourly R…

View Details
Posted 2025-09-25

Staff Full-Stack Engineer

Ambience Healthcare
San Francisco, CA

About Us: Ambience is developing the most capable AI systems for healthcare and medicine. As healthcare costs soar to 17.3% of US GDP and a projected shortage of 100,000 physicians within the next d…

View Details
Posted 2025-09-14

Sr SAP Product Manager QTC - Revenue

Palo Alto Networks
Santa Clara, CA

Company Description Our Mission At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vi…

View Details
Posted 2025-09-25

Explore Oakland: Pediatric Nursing in a Vibrant City!

NurseRecruiter
Oakland, CA

Registered Nurse - Pediatric - Travel - (Peds RN - Pedi RN) Embark on a rewarding journey as a Pediatric Nurse in Oakland, where you’ll provide compassionate care in a vibrant city. Join a dedicated …

View Details
Posted 2025-08-20

Esthetician (Trainer) LA

FACEGYM
West Hollywood, CA

Our Company Brand: FACEGYM is the first of its kind and a unique facial fitness experience, and is in the confidence-boosting business. Think of us as a complete gym workout for your face. We comb…

View Details
Posted 2025-10-31

Nurse Manager / Operating Room (San Pedro, California)

Our Lead Good Course LLC, DBA: HBAConnect
California

Client Details : Direct reports approximately 60. OR suite = 12. Bariatric Center of Excellence, a lot of ortho total joint procedures. No spine, heart, or neuro cases. Average cases = 50 daily…

View Details
Posted 2025-10-24

Dispatch Operating Center Specialist

Alstom
Los Angeles, CA

At Alstom, we understand transport networks and what moves people. From high-speed trains, metros, monorails, and trams, to turnkey systems, services, infrastructure, signalling and digital mobility,…

View Details
Posted 2025-08-11

Software Engineer, Dev Productivity

Openai
San Francisco, CA

About the Team The Applied AI team safely brings OpenAI's technology to the world. We released ChatGPT, Plugins, DALL·E, and the APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also operate i…

View Details
Posted 2025-10-31

Purchasing Manager

Pacific Fusion
San Leandro, CA

About Pacific Fusion Pacific Fusion was founded in 2023 with the mission to power the world with abundant, affordable, clean energy. We are rapidly designing and building a pulsed magnetic fusion…

View Details
Posted 2025-10-13

Project Manager – Solar Studio (Land Development)

Eden Capital Careers
San Diego, CA

Location:  San Diego, CA 92122 Salary Range:  $118,000 – $142,000 USD per year   About Our Client Our client is an employee-owned civil engineering firm known for its collaborative cul…

View Details
Posted 2025-10-25