Staff Full Stack Engineer, Speech Infrastructure - USA
About Inworld
At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their users. We are building an intelligent runtime to conquer these monsters and make this vision a reality.
We are backed by investors such as Lightspeed, Section 32, Kleiner Perkins, Microsoft’s M12 venture fund, BITKRAFT, Founders Fund, and First Spark Ventures. Our technology is used by category leaders, including NVIDIA, Microsoft Xbox, Niantic, Wishroll, Little Umbrella and Streamlabs, among many others. Inworld has been recognized by CB Insights as one of the 100 most promising AI companies globally and has been named one of LinkedIn's Top 10 Startups in the USA.
About the role
Our intelligent runtime must seamlessly connect to foundational models to power real-time, interactive experiences. For this to be possible at scale, the infrastructure that serves these models, especially for demanding tasks like Text-to-Speech (TTS) and Speech-to-Text (STT), must be exceptionally fast, reliable, and cost-effective.
We are seeking a Full Stack Engineer to build this critical infrastructure. You will be responsible for designing, building, and scaling the full-stack systems that serve our voice production models. Your work will focus on the difficult engineering problems of building a robust, low-latency platform that forms the backbone of the next generation of AI-driven software.
Responsibilities
Design, develop, and scale the complete infrastructure that powers cutting-edge TTS, STT, and other real-time voice AI capabilities.
Engineer robust deployment systems for speech models on Kubernetes with PyTorch, ensuring high availability and low latency for intelligent runtime.
Write clean, high-performance backend services and APIs in Python, Java/Kotlin, and Go to handle audio processing, model inference, and complex data pipelines.
Create and maintain internal web applications and dashboards using Node.js to enable teams to monitor, debug, and manage speech systems effectively.
Collaborate closely with ML engineers to bridge the gap between cutting-edge research models and production-ready solutions that can serve millions of users.
Qualifications
A BA/BS degree in Computer Science or a related technical field, or equivalent practical experience.
5+ years of professional experience in full-stack or backend software development.
Demonstrate experience in building production-grade application APIs that span both backend and frontend stacks.
Strong proficiency in Python and demonstrated experience with one or more of the following: Java/Kotlin , Go , or Node.js .
Hands-on experience building and maintaining production systems using containerization (Docker) and orchestration ( Kubernetes ).
Experience with or a strong interest in the infrastructure challenges of deploying ML models, particularly with frameworks like PyTorch .
Solid foundation in data structures, algorithms, and system design.
A good fit for this role may have
A passion for learning and staying up-to-date with the latest advancements in AI infrastructure and ML systems.
Direct experience building infrastructure for speech processing (TTS/STT) or other real-time ML applications .
Ability to work collaboratively in a fast-paced environment with shifting priorities.
Familiarity with MLOps best practices and tools.
Experience with cloud platforms like GCP or AWS.
We believe in the power of in-person collaboration to solve the hardest problems and foster a strong team culture. We offer relocation assistance and look forward to you joining us in our Mountain View office.
The base salary range for this full-time position is $200,000 - $300,000+ bonus + equity + benefits.
Recommended Jobs
Floor Tech
Looking to hire a 2nd shift floor tech. Hours will be 12pm-8pm 5 days a week, with one weekend off a month Duties to include: Trash removal Dust mopping Run scrubber and buffer (can train r…
Entry level Dispatcher / Customer service - Carson CA 90746
Job Summary: We are seeking a detail-oriented and proactive data entry clerk to manage daily container tracking and ensure smooth coordination with truckers for delivery appointments. This role invo…
Avionics System Test Operations Engineer
At Vast, our mission is to contribute to a future where billions of people are living and thriving in space. Vast is developing next-generation space stations to ensure a continuous human presence in…
Corporate Insights Intern, Summer 2026
About The Role & Program Are you curious, creative, and excited to learn more about data and consumer research? Do you want to learn how to use consumer insights to understand the entertainment la…
Product Manager, Signals & Onboarding (Ads Platform)
Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lang…
Sr Software Engineer - Player Experiences
Why PlayStation? PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of pr…
Psychiatric Mental Health Nurse Practitioner
Psychiatric Mental Health Nurse Practitioner Location: Oakland, California Position Type: Full-Time Salary Range: $180,000 – $200,000 per year Schedule: Monday–Friday, 9:00 AM – 5:0…
Senior Product Manager - Autonomy
About Applied Intuition Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines. Founded in 2017, Applied Intuition delivers the to…
R&D Product Engineer: IC and System-in-Package
Overview is at the forefront of technology innovation, delivering breakthroughs and trusted insights in electronic design, simulation, prototyping, test, manufacturing, and optimization. Our ~15,00…
Senior Zero Knowledge (ZK) Engineer
About Nexus The Nexus Project is a scientific and engineering effort bringing truth to the field of computation. We’re enabling bringing to life a new form of compute: verifiable computation, powere…