Senior Distributed System Engineer (San Francisco)
Location
San Francisco
Employment Type
Full time
Location Type
On-site
Department
Engineering
Inference.net is seeking a Senior Distributed ML Systems Engineer to join our team. This role involves developing large-scale, fault-tolerant distributed systems that handle millions of large language model inference requests per day. If you are passionate about developing next-generation ML systems that operate at scale, we want to hear from you.
You will be responsible for designing and implementing the core systems that power our globally distributed LLM inference network. You'll work on problems at the intersection of distributed systems, machine learning, and resource optimization.
About Inference.net
We are building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute that can be used for running large-language models like DeepSeek and Llama 4. At any given moment, we have over 5,000 GPUs and hundreds of terabytes of VRAM connected to the network.
We are a small, well-funded team working on difficult, high-impact problems at the intersection of AI and distributed systems. We primarily work in-person from our office in downtown San Francisco. Our investors include A16z CSX and Multicoin. We are high-agency, adaptable, and collaborative. We value creativity alongside technical prowess and humility. We work hard, and deeply enjoy the work that we do.
Key Responsibilities
Design and implement scalable distributed systems for our inference network
Develop models for efficient resource allocation across a network of heterogeneous hardware and quickly changing topology
Optimize network latency, throughput, and availability
Build robust logging and metrics systems to monitor network health and performance
Conduct reviews of architecture and system design to ensure use of best practices
Collaborate with founders, engineers, and other stakeholders to improve our infrastructure and product offerings
What We're Looking For
Very strong problem-solving skills and ability to work in a startup environment
5+ years of experience in building high performance systems
Strong programming skills in Typescript, Python, and one of Go, Rust, or C++
Solid understanding of distributed systems concepts
Knowledge of orchestrators and schedulers like Kubernetes and Nomad
Use of AI tooling in development workflow (ChatGPT, Claude, Cursor, etc)
Experience with LLM inference engines like vLLM or TensorRT-LLM is plus
Experience with GPU programming and optimization (CUDA experience is a plus)
Experience with Postgres or NATS.io is a bonus
Compensation
We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $180,000 - $250,000, plus equity and benefits, depending on experience.
Equal Opportunity
Inference.net is an equal opportunity employer. We welcome applicants from all backgrounds and don't discriminate based on race, color, religion, gender, sexual orientation, national origin, genetics, disability, age, or veteran status.
J-18808-Ljbffr
#J-18808-LjbffrRecommended Jobs
Instructional Aide
Starting Rate: $18 /hour Environment: Special Education Program, Grades K-12 Spectrum Center Schools and Programs , a growing, dynamic organization with a social mission to offer hope is seeki…
Retail Associate PT
Carlisle Gift Shop, Walnut Creek, OH is a great place to start or develop your career in hospitality to learn skills you’ll use for the rest of your life. If you enjoy sharing hospitality with othe…
Public Works Construction Inspector
Job Description Job Description Description: 4LEAF, Inc. ("4LEAF") is a California-based professional services firm specializing in Construction Management, Inspection, Plan Review, Planning, an…
Backend Engineer - Enterprise Agent
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on e…
Senior Product Manager
Based in San Francisco, Arine is a rapidly growing healthcare technology and clinical services company with a mission to ensure individuals receive the safest and most effective treatments for thei…
Customer Service Representative POST NUMBER: 441363
Job Description: We are seeking a detail-oriented and customer-focused Customer Service Representative to join our client’s team in Irvine, CA . In this role, you will be responsible for proces…
Associate Veterinarian - Folsom, CA - #7291
Associate Veterinarian - Folsom, CA - #7291 Come work in a collaborative and fun working environment with a great clientele! We are seeking a compassionate and dedicated Associate Veterinarian to jo…
Fragrance & Beauty Advisor
Chanel seeks a Fragrance & Beauty Advisor in San Diego to serve as a brand ambassador, providing exceptional client service and achieving sales targets. The role involves maintaining a seamless omni-c…
Account Manager
Job Description Job Description Company Description American Iron & Metal (AIM) is a family-owned company and recognized global leader in the metal recycling industry with more than 125 site…
Project Manager
Description: Title: Project Manager – Science Education & Community Must be local Thousand Oaks, CA campus and willing to come onsite a few days per month. Part time role, 20 hours/week. Dura…