AI Performance Software Engineer (San Francisco)
AI Performance Engineer CUDA & PyTorch Focus
Location : San Fransisco, CA
Compensation : $200,000-$300,000
All potential candidates should read through the following details of this job with care before making an application.
A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.
This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.
What Youll Do :
- Drive core research and implementation of performance optimizations for modern AI models
- Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
- Design and build scalable, distributed compute strategies across GPU-based systems
- Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
- Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency
What We're Looking For :
- Strong background in CUDA and low-level GPU performance tuning
- Proven experience building with PyTorch and deploying high-performance ML models
- Proficiency in Python and C++
- Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
- Exposure to AI compilers or frameworks like MLIR is a plus
- Interest in system design, scalability, and accelerating LLM workloads in real production environments
If youve spent your time making large models faster, leaner, and more efficientand want to solve hard technical problems at the core of GenAI infrastructurethis role is for you.
Reach out to learn more.
#J-18808-LjbffrRecommended Jobs
Floor Coordinator (St. Helena)
Position Objective The Floor Coordinator works within the store management team to help achieve store sales goals and maximize profitability. Through effective management in partnership with the Sto…
Director, Alcatraz Retail
Organization Description: Since 1981, the Golden Gate National Parks Conservancy (Parks Conservancy) has been the nonprofit partner of the National Park Service. Working alongside the Presidio Trust,…
Attending Physiatrist, Musculoskeletal Medicine
Job Details: Job Summary: The Westchester Medical Center Department of Physical Medicine and Rehabilitation is seeking an Attending Physiatrist-Musculoskeletal Medicine to join our growing program…
Sheriff's Correctional Deputy I/II
Location 525 W. Sycamore Street Willows, 95988 Description This position performs a variety of work in the monitoring of county, state and federal detainees and maintains the security an…
DataOps Analyst with .NET
8+ years of progressively increasing responsibility and experience within a software engineering environment which provides the necessary skills, knowledge, and abilities. Experience serving as a s…
Human Resources Specialist
Purpose of Job The HR Specialist plays a critical role in supporting the Human Resources department by focusing on compliance, HR data analytics, and process optimization. This position is ideal f…
Property Accountant
Ethan Conrad Properties, Inc. is one of the largest and the fastest growing Commercial Real Estate Companies in Sacramento, CA. With over 11.6MM square feet, over 170 properties, and over 250 buildi…
Sales Executive, Director
Calling all innovators - find your future at Fiserv. We're Fiserv, a global leader in Fintech and payments, and we move money and information in a way that moves the world. We connect financial …
Quality Assurance Officer (San Francisco)
Join to apply for the Quality Assurance Officer role at Kennedy Jenks 2 days ago Be among the first 25 applicants Join to apply for the Quality Assurance Officer role at Kennedy Jenks Fou…
Computer Vision and Machine Learning Engineer (Sunnyvale)
Computer Vision and Machine Learning Engineer Sunnyvale, California, United States | Machine Learning and AI Description We are seeking a proactive Computer Vision and Machine Learning Engin…