Staff Software Engineer: Microservice Infrastructure & Real-Time ML Inference
About the role
We're looking for a Staff Software Engineer (Backend) to design and build the next generation of our real-time translation infrastructure. You'll architect mission-critical microservices that power low-latency audio/video processing pipelines, working with cutting-edge speech recognition, translation, and voice synthesis technologies. You'll be instrumental in scaling our platform to handle millions of concurrent
streaming sessions while maintaining sub-100ms latency requirements. This role combines deep systems programming, distributed systems architecture, and cloud infrastructure expertise.
Mission & Scope
Own Sanas’ microservice and streaming architecture, that power sub-100 ms, real-time language translation in both B2B and B2C environments. Define Technical Strategy, align multiple teams, and raise the bar on reliability, performance, and reliability across regions.
What you'll do
- Lead the design for high-throughput, low-latency microservices that enable bidirectional streaming in Sanas’ audio/video pipelines.
- Build event/telemetry/feature pipelines (Kafka/Redis/DynamoDB) that support near-real-time decisions and model features at scale.
- Productionize model serving (Triton/vLLM/TorchServe), implement autoscaling/batching/shadow-deploys, and enforce p99/p999 budgets.
- Establish SLOs/error budgets, graceful degradation (keep call quality first), idempotency, circuit breakers, retries with jitter, and chaos drills.
- Lead Sanas-wide logging/metrics/tracing (OpenTelemetry), RED/USE dashboards, and symptom-based alerting.
- Drive cross-team designs, mentor seniors, lead postmortems/design reviews, and lay the foundation for shared libraries and patterns (auth, interceptors, tracing, schema rollout).
Qualifications
- 7+ years of Software Engineering experience, with a focus on distributed architecture and technical leadership.
- Strong proficiency in Python or Go; strong async/concurrency (asyncio/futures), profiling, and GC/heap tuning.
- Strong proficiency in Containerization and Orchestration: AWS/Azure, Terraform, Kubernetes, IaaC patterns and node pools. (CPU/GPU)
- Experience in ML Inference: Triton/vLLM/TorchServe; GPU scheduling/packing, batching, A/B and shadow traffic.
- Experience with gRPC/protobuf at scale (versioning, interceptors, performance tuning, and compatibility testing)
- Nice-to-have: Experience with WebRTC/SRTP, RTP/RTCP, NAT traversal STUN/TURN,, SIP interop; FFmpeg/codec tradeoffs.
- Nice-to-have: Experience in data streaming with Kafka, Redis, DynamoDB; exactly-once/at-least-once patterns; stream-batch bridges.
Recommended Jobs
Registered Dietitian Nutritionists
GoTo Telemed is seeking experienced, credentialed Registered Dietitian Nutritionists (RDN) to join our telehealth network. This flexible, 1099 independent contractor opportunity allows you to provide…
Staff Machine Learning Engineer - World Foundation Model
Woven by Toyota is enabling Toyota’s once-in-a-century transformation into a mobility company. Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current sta…
Heavy General labor (auto car parts) - Gardena CA 90249
Looking for general labor workers to work at our auto parts warehouse in Gardena CA 90249 Job description: Handling auto parts, such as alternators, and performing picking and packing duties, whic…
Library Program Manager I/II (Amended) (20707567)
Description The City of Woodland offers a competitive total compensation package including: ~Starting income ranging from $6,891.01 to $9,471.11 per month based on skills and years of experie…
Principal Land Surveyor
Principal Land Surveyor San Luis Obispo, CA About the Role An established civil design and land services group with decades of regional experience is seeking a Lead Land Surveyor t…
Legal Counsel
The Company: Faraday Future (FF) is a California-based mobility company, leveraging the latest technologies and world's best talent to realize exciting new possibilities in mobility. We're produci…
Driver
Overview Seeking dependable individuals who are trustworthy, responsible, and hard-working to pick up charity donations locally and in the surrounding areas. Drivers start pay is $21 per hour and…
C++ Embedded Developer
Job Title: Embedded Software Engineer Location: Sacramento, CA (McClellan AFB ) Type/Duration: 3-month contract to hire Required Qualifications: · Graduated with a Bachelor Degree in Compu…
Property Assistant Manager
Job Description Property Assistant Manager PeopleReady of Gardena, CA is now hiring Property Assistant Managers in Torrance, CA! As a Property Assistant Manager, you will perform necessary repa…
Personal Assistant
I am looking for an individual to keep up with ands, do shopping and pretty much anything else I need done. The candidate must be someone who can not only make my work and personal life more comfortab…