Staff Software Engineer, Ads ML Inference Infrastructure

Pinterest
Palo Alto, CA

Staff Software Engineer, Ads ML Inference Infrastructure

 

The Ads ML Inference Infra team owns the online inference and feature serving systems that power real-time model scoring and delivery for all Ads models at Pinterest. The team is looking for a staff engineer with strong hands-on experience in large-scale ML inference systems, as well as capabilities in solving ambiguous technical problems and driving strategic, cross-functional efforts.

 

What you’ll do:


  • Lead and drive efforts to build next-generation model inference and feature serving systems that power up to 100x larger models and directly uplevel Pinterest’s monetization business.

  • Design and optimize low-latency, high-throughput inference pipelines to meet strict SLOs while improving performance, efficiency, and cost .

  • Partner with Ads ML and product teams to productionize new model architectures (including LLMs and multi-stage ranking models) and scale them reliably to global traffic.

  • Evolve the online feature platform (feature computation, caching, and retrieval) to improve coverage, freshness, and consistency for Ads models.

  • Evaluate and integrate new technologies (e.g., GPU acceleration, model compression, Triton, vLLM, Dynamo ) to advance our inference stack.

  • Build strong partnerships with other infra and ML teams to improve end-to-end reliability, observability, and developer velocity for Ads ML.

  • Mentor and coach other engineers, guiding them through technical decisions, system design, and career development.

 

What we’re looking for:


  • BS (or higher) degree in Computer Science or a related field.

  • ~8+ years of relevant industry experience designing and operating large-scale, production ML or distributed infra systems .

  • Deep knowledge of at least one programming language ( Java, C++, Python ).

  • Deep experience with distributed systems or recommendation / ads serving infrastructure (e.g., request routing, online storage, caching, feature serving, APIs).

  • Hands-on experience with at least one deep learning framework ( PyTorch or TensorFlow ) and bringing models from offline experimentation to production.

  • [Preferred] Experience with model / hardware accelerator libraries (e.g., CUDA, quantization, distillation, low-precision inference).

  • [Preferred] Experience with inference optimization and serving frameworks such as Triton, vLLM, or Dynamo .

  • Proven track record of leading complex projects , setting technical direction, and collaborating across functions and orgs ; experience mentoring and coaching other engineers.

 

In-Office Requirement Statement:


  • We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.

  • This role will need to be in the office for in-person collaboration 1-2 times per week and therefore needs to be in a commutable distance from one of the following offices Palo Alto, CA; San Francisco, CA; Seattle, WA.

 

Relocation Statement:


  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

#LI-HYBRID

#LI-AG8

Posted 2026-02-13

Recommended Jobs

Tech Lead / Architect, Global Traffic Infrastructure

ByteDance
San Jose, CA

Location: San Jose Team: Technology Employment Type: Regular Job Code: A142931 Responsibilities About the Team The Global Traffic Infrastructure (GTI) team leverages …

View Details
Posted 2026-03-19

Software Engineer, Agentic AI Product - Moveworks

Servicenow
Mountain View, CA

Company Description Who we are Moveworks is the Agentic AI Assistant platform that empowers the entire workforce.  Our platform enables employees to converse with all of their business syst…

View Details
Posted 2026-02-19

Litigation Attorney (Temecula)

Jobot
Temecula, CA

Well decorated insurance defense firm with need in ATL office. This Jobot Job is hosted by: Kris Leishman Are you a fit? Easy Apply now by clicking the Apply button and sending us your resume. …

View Details
Posted 2026-03-27

Senior Software Engineer, Elixir

Wonderschool
San Francisco, CA

Position Summary: Wonderschool is harnessing the power of technology to provide comprehensive support to childcare providers operating out of their homes as well as in the government and non-profit…

View Details
Posted 2026-02-13

Account Executive - Sake

Mutual Trading Co., Inc.
El Monte, CA

Chinese Bilingual Account Executive - Sake Who we are: Established in 1926, Mutual Trading Co., Inc. was originally a small co-op organization for centralized purchasing of basic import foods…

View Details
Posted 2026-01-26

Barista

Infuse Hospitality
San Francisco, CA

Infuse Hospitality, part of the Phoenix3 Collective portfolio, is seeking a proactive, friendly, and detail-oriented Barista to serve as the welcoming face of our café and deliver an exceptional gu…

View Details
Posted 2026-03-25

Senior Java Developer

Two95 International Inc.
Sacramento, CA

Job Title: Senior Java Developer Location: Sacramento, CA (On-site) Duration: 6 Months (Possible Extension) Rate: $Open /hr. Job Summary We are seeking an experienced Senior Java Develo…

View Details
Posted 2026-03-22

Sales Account Representative I - Days (Harbor 2 - Corp Office)

Robinson Pharma
Santa Ana, CA

The Sales Account Representative I supports the Sales team by attending trade shows and participating in the development and implementation of sales and marketing strategies to create new contract …

View Details
Posted 2026-01-13

Data Scientist - Financial Modeling

SanDisk
Milpitas, CA

Company Description Sandisk understands how people and businesses consume data and we relentlessly innovate to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a r…

View Details
Posted 2026-03-13

Housekeeper

GreatAuPair LLC
Albany, CA

General cleaning, cooking and running errands as needed

View Details
Posted 2025-11-09