AI Capacity Planning & Procurement Manager

Fireworks Ai
Redwood City, CA

About Us:


Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.

The Role:


We are an AI company building reliable, high-performance model serving infrastructure. Our customers run mission-critical workloads and expect consistency, cost transparency, and predictable scaling. Capacity — GPU, network, and cloud economics — is existential for us. You will be one of the earliest hires focused on making sure we always have the right compute, at the right price, ahead of demand.

Key Responsibilities:



  • Architect the Multi-Year Capacity Strategy: Develop and own the strategic, multi-year capacity plan by synthesizing inputs from the company strategy, business forecast, Industry trend and Engineering’s product roadmap.

  • Infrastructure Cost Management: Directly manage the biggest cost on our P&L. Translate market insights, technology advancement, and forward-looking plans into rigorous financial models to enable fast business growth while minimizing Total Cost of Ownership (TCO) per unit of performance (e.g., TFLOP). Partner with infra, finance, and GTM to codify capacity strategy into budgets and KPIs

  • Build market intelligence with advanced compute technology roadmap, lead time, and pricing for relevant SKUs (H200/B200/B300/GB200/GB300/MI355, networking, storage)

  • Own end-to-end procurement of cloud capacity (GPUs, storage, networking, etc) across multiple vendors (cloud, bare-metal, colocation, integrators, brokers) including RFQs/RFPs, commercial Negotiation.

  • Define Capacity Management discipline with processes and tools: inventory, allocation, unit economics, cost attribution and optimization

  • Stand up processes for disciplined renewals , rev-share allocations, and hedging against supply shocks

Minimum Qualifications:



  • 5–10+ years of experience in capacity management, sourcing, data center supply, cloud procurement, infrastructure operations, or a related field (e.g., hyperscaler, colocation provider, OEM, or LLM infrastructure startup)

  • Demonstrated ability to develop and communicate multi-year capacity plans that align business, product, and financial objectives

  • Strong knowledge of GPU and server SKUs, networking topologies, power and space constraints, and a high-level understanding of the global AI compute supply and demand landscape

  • Proven experience negotiating contracts valued at seven figures or higher, with a solid grasp of key terms such as commit profiles, flexibility clauses, drawdowns, credits, and SLA remedies

  • Proficiency in unit economics and scenario modeling, including total cost of ownership (TCO) and TFLOP-month analysis

  • Ability to operate effectively in a fast-paced, low-process environment and contribute to company-defining initiatives

Preferred Qualifications:



  • Established network across major capacity suppliers, including NVIDIA partners, cloud providers, brokers, integrators, and colocation vendors

  • Experience connecting capacity planning with real-world model serving workload patterns and performance requirements

  • Background in AI infrastructure environments, such as hyperscaler clouds or AI infrastructure startups

Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.

Base Pay Range (Plus Equity)

$150,000 - $250,000 USD

Why Fireworks AI?



  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.

  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.

  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.

  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Posted 2025-10-31

Recommended Jobs

Machine learning scientist risk

Cash App
California

Since we opened our doors in 2009, the world of commerce has evolved immensely, and so has Square. After enabling anyone to take payments and never miss a sale, we saw sellers stymied by disparate, …

View Details
Posted 2025-10-27

Senior Frontend Engineer, AI Agents

Amplitude
San Francisco, CA

Amplitude is the leading digital analytics platform that helps companies unlock the power of their products. Over 4,300 customers, including Atlassian, NBCUniversal, Under Armour, Square, and Jersey …

View Details
Posted 2025-10-01

Staff AI Implementation Engineer

Servicenow
Santa Clara, CA

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow st…

View Details
Posted 2025-09-22

IT Technician

American Advanced Management
Kentfield, CA

DESCRIPTION OF POSITION This job description is a record of the essential functions of the listed job. The job description provides the employee, CEO, Human Resources, applicants, and other agenci…

View Details
Posted 2025-09-22

Account Manager

Nobilitas Group LLC
Bakersfield, CA

Description: Our Client has future and immediate hiring needs for experienced Account Managers/District Representatives in Bakersfield, CA to support our Pacific Sales team in Chemical Technologie…

View Details
Posted 2025-09-10

Parts Driver

Mercedes-Benz of San Francisco
San Francisco, CA

Job Summary: We are looking for a Parts Driver to join our growing team! The right candidate will have strong communication skills and a positive attitude. The day-to-day duties of this role include o…

View Details
Posted 2025-10-15

Software Engineer, Trust

Notion
San Francisco, CA

About Us: We're on a mission to make it possible for every person, team, and company to be able to tailor their software to solve any problem and take on any challenge. Computers may be our most p…

View Details
Posted 2025-09-22

Senior BMS HIL Test Engineer

Ford
Long Beach, CA

In this position... • Operate and support dSPACE and Typhoon based HIL test system (on-site). • Create HIL test strategies, test cases, and scripts based on client, internal, and regulatory requi…

View Details
Posted 2025-09-13

Director, Customer Care Product Operations

Earnin
Mountain View, CA

About EarnIn As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living…

View Details
Posted 2025-09-22

Staff Product Manager, New Products

Calendly
San Francisco, CA

About the team & opportunity What’s so great about working on Calendly’s Product team? We design seamless product experiences that delight our customers. Calendly takes the work out of schedul…

View Details
Posted 2025-09-28