Software Engineer, Multimedia
About Us:
Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.
The Role:
We're looking for a strong Backend Infrastructure Engineer to help accelerate our multimedia AI capabilities. You'll build and optimize the infrastructure powering state-of-the-art multimodal AI including vision-language models (VLMs), and speech AI models. You'll focus on achieving industry-leading latency and throughput across diverse multimedia workloads. You'll develop infrastructure for features like VLM fine-tuning, real-time voice processing pipelines, and model enablement on the latest hardware. You'll be instrumental in helping us capture significant ARR growth in the multimedia AI space while ensuring we deliver the fastest, most reliable multimodal platform in the market.
Key Responsibilities:
- Collaborate with ML engineers and researchers to productionize models and support evolving multimedia capabilities
- Identify, profile and address performance bottlenecks across the stack, from media preprocessing to vision/audio encoders to the core inference engine
- Ensure high reliability, observability, and security across backend systems.
- Own the enablement and optimization of new model releases, ensuring we consistently deliver the fastest implementations in the market.
- Build and maintain performant APIs and services
- Collaborate closely with customers and sales teams to implement custom features and optimizations that drive ARR growth
- Propose new roadmap items based on customer needs.
Minimum Qualifications:
- Bachelor’s degree in Computer Science, Engineering, or a related field.
- 3+ years of experience as a backend or infrastructure engineer, ideally supporting ML/AI systems or data-intensive workloads.
- Experience with PyTorch and deep learning frameworks for inference and training.
- Strong programming skills in Python and/or Go, with a track record of building reliable distributed backend systems.
- Experience with cloud platforms (e.g., AWS, GCP), infrastructure-as-code tools (e.g., Terraform), and containerization/orchestration tools (e.g., Docker, Kubernetes).
Preferred Qualifications:
- Experience supporting ML workloads in production (model fine-tuning, distributed training, inference optimization)
- Experience working directly with LLMs, vision-language models, audio models (ASR, TTS) or other multimodal AI systems in production environments
- Experience with performance optimization and profiling for high-throughput systems
- Knowledge of model quantization, speculative decoding, or other ML optimization techniques
Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.
Base Pay Range (Plus Equity)
$170,000 - $240,000 USD
Why Fireworks AI?
- Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
- Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
- Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
- Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.
Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.
Recommended Jobs
Bilingual DME Office Assistant
Bilingual Assistant needed for Durable Medical Equipment office in Van Nuys. Looking for someone who can work independently, handle office tasks, is detail-oriented, organized, and has excellent c…
Entry Level Software Developer
Job Title: Software Developer Department: Technology Reports To: Director of Technology Location: Windsor, CO (preferred), Fort Worth, TX, or Houston, TX The Software Developer is a ke…
Software Engineer II
*DELETE AS APPROPRIATE** - please leave the relevant location tag for LinkedIn #LI-Remote #LI-Onsite #LI-Hybrid Who We Are 2K is headquartered in Novato, California and is a wholly owned la…
Community Real Estate Accountant
Title: CRE Staff Accountant Reports to: CRE Finance Director Pay Range: $78,000 - $87,000 Location: San Francisco, CA. This is a hybrid position, and candidates must be able to work three da…
Sales Associate
Christian Dior Couture seeks a Sales Associate for its Beverly Hills location. The role involves delivering outstanding client service, achieving sales targets, and building client relationships. Cand…
Software Development Engineer in Test - Linux
Zoox seeks a Software Development Engineer in Test to join our Embedded Linux team. In this role, you will lead efforts to design, build, maintain, and improve coverage of a Hardware-In-the-Loop Cont…
Intern, Industrial Design
Ammunition is an international design group providing services in product design, brand strategy and identity, UX design, graphic design, and packaging. While Ammunition’s strengths are diverse across…
Senior Backend Software Engineer - Oakland (Hybrid)
We help companies stay secure while moving fast. Built by engineers for engineers, The Teleport Access Platform delivers on-demand, least privileged access to infrastructure based on cryptographic…
Flight Software Engineer (Senior)
As a member of the flight Software team, you will design, develop, and own the software driving the autonomous operation of Apex’s satellite buses. You will be responsible for developing mission crit…
Software Engineer, Generalist
Who We Are Sauron protects your family and home, bringing the innovations of autonomous robots and self-driving cars to residential security. Our team is led by veteran entrepreneurs and roboticists…