Staff Site Reliability Engineer (SRE)
Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology. The flagship product—an AI-driven, non-invasive cardiac test supported by the ACC/AHA Chest Pain Guidelines called the Heartflow FFR CT Analysis—provides a color-coded, 3D model of a patient’s coronary arteries indicating the impact blockages have on blood flow to the heart. Heartflow is the first AI-driven non-invasive integrated heart care solution across the CCTA pathway that helps clinicians identify stenoses in the coronary arteries (RoadMap™Analysis), assess coronary blood flow (FFR CT Analysis), and characterize and quantify coronary atherosclerosis (Plaque Analysis). Our pipeline of products is growing and so is our team; join us in helping to revolutionize precision heartcare.
Heartflow is a publicly traded company (HTFL) that has received international recognition for exceptional strides in healthcare innovation, is supported by medical societies around the world, cleared for use in the US, UK, Europe, Japan and Canada, and has been used for more than 400,000 patients worldwide.
Heartflow is transforming cardiovascular care with cutting-edge, non-invasive technology. We are launching a massive Platform Modernization initiative to power the next generation of our life-saving medical products.
We're looking for an experienced Site Reliability Engineer (SRE) to join our cloud-native infrastructure team. You will work closely with our Platform engineers and development teams to ensure our critical systems are highly available, scalable, observable, and performant. If you thrive on eliminating toil, automating complex operations, and defining the standards for production excellence, we want to talk to you.
Job Responsibilities
As our Staff SRE, you'll be the primary expert responsible for our entire compute ecosystem. Your key responsibilities will include:
As a Staff SRE, you'll operate at the highest level of technical expertise and influence. You won't just solve problems; you'll prevent them at a fundamental level across organizational boundaries.
- Design, implement, and lead large-scale, cross-functional projects to improve the reliability, performance, and efficiency of our core services and infrastructure (10× impact).
- Drive the reduction of toil by developing and deploying sophisticated automation tools and frameworks, championing the "everything as code" philosophy.
- Serve as a technical escalation point for critical incidents, perform deep-dive root cause analyses (RCAs), and implement robust corrective measures to prevent recurrence.
- Define and implement SLOs, SLIs, and Error Budgets for critical services. Enhance our monitoring, logging, and tracing systems to provide comprehensive visibility into system health.
- Set the technical direction and best practices for the entire SRE and engineering organization. Mentor mid-level and senior engineers on design patterns, operational rigor, and reliability principles.
We're looking for a leader and a deep technical expert with a proven track record of solving the hardest scaling and reliability challenges.
Required Qualifications
- 8+ years of progressive experience in Site Reliability Engineering, Production Engineering, or a closely related role.
- Expert-level proficiency with AWS, including networking, compute, and storage.
- Deep expertise in Kubernetes and the cloud-native ecosystem.
- Fluency in at least one major scripting/programming language for automation and tooling (e.g., Python, Go, or Java).
- Solid experience with monitoring and logging solutions (Datadog)
- Proven ability to design and implement robust, highly available distributed systems.
- Demonstrated experience with Infrastructure as Code tools like Terraform.
- Exceptional communication skills, capable of explaining complex technical issues to both technical and non-technical audiences.
Nice-to-Have
- Experience implementing Service Mesh technologies (e.g., Istio, Linkerd).
- A strong understanding of security principles and practices in a cloud environment.
- Certifications such as CKA (Certified Kubernetes Administrator) or CKAD (Certified Kubernetes Application Developer).
#sre #kubernetes #openrole
A reasonable estimate of the base salary compensation range is $185,750 to $250,922, cash bonus, and equity. #LI-IB1 #LI-Hybrid;
Heartflow is an Equal Opportunity Employer. We are committed to a work environment that supports, inspires, and respects all individuals and do not discriminate against any employee or applicant because of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, veteran status, or any other status protected under federal, state, or local law. This policy applies to every aspect of employment at Heartflow, including recruitment, hiring, training, relocation, promotion, and termination.
Positions posted for Heartflow are not intended for or open to third party recruiters / agencies. Submission of any unsolicited resumes for these positions will be considered to be free referrals.
Heartflow has become aware of a fraud where unknown entities are posing as Heartflow recruiters in an attempt to obtain personal information from individuals as part of our application or job offer process. Before providing any personal information to outside parties, please verify the following: A) all legitimate Heartflow recruiter email addresses end with “@heartflow.com” and B) the position described is found on our careers site at .
Recommended Jobs
Lead Engineer, Systems Safety
Rise above. Are you ready to take human possibility to a new dimension with us? Supernal is an Advanced Air Mobility (AAM) company that’s developing an electric vertical take-off and landing (eVTOL…
Senior Product Manager, Compliance
CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. Our technology provides enterprises and leading AI labs with the most performant, …
Product Management Intern, Summer 2026
Job Summary: About the Role & Program : Technology is at the heart of Disney’s past, present, and future. Disney Entertainment and ESPN Product & Technology is a global organization of engin…
Packaging Supervisor
(Sunday-Tuesday & every other Wednesday from 5:00pm-5:00am) Department: Retail Packaging Position Overview: The Production Supervisor is responsible for overseeing daily operations within the …
Assistant/Associate Professor of Economics
Pomona College seeks to hire an economist whose primary research interests are in macroeconomics. Qualified candidates should complete their Ph.D. by the start date, July 1, 2026. The preferred candid…
Ground Software Engineer (Mid)
Spacecraft represent the most pressing unmet need across the entire aerospace industry. As more launch vehicles come online and the cost to orbit decreases, more companies launching payloads to space…
Senior Propulsion Test Engineer
As a team, we’ve launched five satellites into orbit, signed ten commercial deals worth over $1 billion in revenue, raised over $750 million from top global investors, and recruited a team of over 40…
Mid Market Account Executive - San Francisco
Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Ba…
Part Time Receptionist
Responsibilities: Ensure all phone calls are directed in a timely and professional manner. Always provide excellent customer service over the phone and in person. Greet and guide customers to a…
Assistant Office Services Manager
Assistant Office Services Manager Location Long Beach, CA : What we are looking for We are looking for an Assistant Office Services Manager to support our Energy Business Unit. You will be responsib…