AI Capacity Planning & Procurement Manager
About Us:
Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.
The Role:
We are an AI company building reliable, high-performance model serving infrastructure. Our customers run mission-critical workloads and expect consistency, cost transparency, and predictable scaling. Capacity — GPU, network, and cloud economics — is existential for us. You will be one of the earliest hires focused on making sure we always have the right compute, at the right price, ahead of demand.
Key Responsibilities:
- Architect the Multi-Year Capacity Strategy: Develop and own the strategic, multi-year capacity plan by synthesizing inputs from the company strategy, business forecast, Industry trend and Engineering’s product roadmap.
- Infrastructure Cost Management: Directly manage the biggest cost on our P&L. Translate market insights, technology advancement, and forward-looking plans into rigorous financial models to enable fast business growth while minimizing Total Cost of Ownership (TCO) per unit of performance (e.g., TFLOP). Partner with infra, finance, and GTM to codify capacity strategy into budgets and KPIs
- Build market intelligence with advanced compute technology roadmap, lead time, and pricing for relevant SKUs (H200/B200/B300/GB200/GB300/MI355, networking, storage)
- Own end-to-end procurement of cloud capacity (GPUs, storage, networking, etc) across multiple vendors (cloud, bare-metal, colocation, integrators, brokers) including RFQs/RFPs, commercial Negotiation.
- Define Capacity Management discipline with processes and tools: inventory, allocation, unit economics, cost attribution and optimization
- Stand up processes for disciplined renewals , rev-share allocations, and hedging against supply shocks
Minimum Qualifications:
- 5–10+ years of experience in capacity management, sourcing, data center supply, cloud procurement, infrastructure operations, or a related field (e.g., hyperscaler, colocation provider, OEM, or LLM infrastructure startup)
- Demonstrated ability to develop and communicate multi-year capacity plans that align business, product, and financial objectives
- Strong knowledge of GPU and server SKUs, networking topologies, power and space constraints, and a high-level understanding of the global AI compute supply and demand landscape
- Proven experience negotiating contracts valued at seven figures or higher, with a solid grasp of key terms such as commit profiles, flexibility clauses, drawdowns, credits, and SLA remedies
- Proficiency in unit economics and scenario modeling, including total cost of ownership (TCO) and TFLOP-month analysis
- Ability to operate effectively in a fast-paced, low-process environment and contribute to company-defining initiatives
Preferred Qualifications:
- Established network across major capacity suppliers, including NVIDIA partners, cloud providers, brokers, integrators, and colocation vendors
- Experience connecting capacity planning with real-world model serving workload patterns and performance requirements
- Background in AI infrastructure environments, such as hyperscaler clouds or AI infrastructure startups
Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.
Base Pay Range (Plus Equity)
$150,000 - $250,000 USD
Why Fireworks AI?
- Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
- Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
- Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
- Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.
Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.
Recommended Jobs
Machine learning scientist risk
Since we opened our doors in 2009, the world of commerce has evolved immensely, and so has Square. After enabling anyone to take payments and never miss a sale, we saw sellers stymied by disparate, …
Senior Frontend Engineer, AI Agents
Amplitude is the leading digital analytics platform that helps companies unlock the power of their products. Over 4,300 customers, including Atlassian, NBCUniversal, Under Armour, Square, and Jersey …
Staff AI Implementation Engineer
Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow st…
IT Technician
DESCRIPTION OF POSITION This job description is a record of the essential functions of the listed job. The job description provides the employee, CEO, Human Resources, applicants, and other agenci…
Account Manager
Description: Our Client has future and immediate hiring needs for experienced Account Managers/District Representatives in Bakersfield, CA to support our Pacific Sales team in Chemical Technologie…
Parts Driver
Job Summary: We are looking for a Parts Driver to join our growing team! The right candidate will have strong communication skills and a positive attitude. The day-to-day duties of this role include o…
Software Engineer, Trust
About Us: We're on a mission to make it possible for every person, team, and company to be able to tailor their software to solve any problem and take on any challenge. Computers may be our most p…
Senior BMS HIL Test Engineer
In this position... • Operate and support dSPACE and Typhoon based HIL test system (on-site). • Create HIL test strategies, test cases, and scripts based on client, internal, and regulatory requi…
Director, Customer Care Product Operations
About EarnIn As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living…
Staff Product Manager, New Products
About the team & opportunity What’s so great about working on Calendly’s Product team? We design seamless product experiences that delight our customers. Calendly takes the work out of schedul…