Lead Site Reliability Engineer

Glean
Palo Alto, CA

About Glean:

Founded in 2019, Glean is an innovative AI-powered knowledge management platform designed to help organizations quickly find, organize, and share information across their teams. By integrating seamlessly with tools like Google Drive, Slack, and Microsoft Teams, Glean ensures employees can access the right knowledge at the right time, boosting productivity and collaboration. The company’s cutting-edge AI technology simplifies knowledge discovery, making it faster and more efficient for teams to leverage their collective intelligence.

Glean was born from Founder & CEO Arvind Jain’s deep understanding of the challenges employees face in finding and understanding information at work. Seeing firsthand how fragmented knowledge and sprawling SaaS tools made it difficult to stay productive, he set out to build a better way - an AI-powered enterprise search platform that helps people quickly and intuitively access the information they need. Since then, Glean has evolved into the leading Work AI platform, combining enterprise-grade search, an AI assistant, and powerful application- and agent-building capabilities to fundamentally redefine how employees work.

About the Role:

Glean is seeking a Site Reliability Engineering Lead to foster a culture of engineering excellence, drive technical strategy, and develop a high-performing, collaborative team. Your role is pivotal in ensuring our services meet stringent Service Level Objectives (SLOs) and in building resilient, automated production environments in the cloud. You'll lead a team and be responsible for products globally, providing technical leadership to key projects and empowering your team to do the same.

Much of our software development focuses on building infrastructure to scale our operations in a hybrid cloud environment and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale and fast growth which are unique to Glean, while using your expertise in coding, algorithms, problem-solving, and SRE practices. We keep Glean applications up and running, ensuring our customers have the best and most reliable experience possible.

You are:



  • Technical Leadership and Mentorship : Play a key role in driving technical excellence and fostering a culture of reliability across engineering teams. You will lead by example, setting best practices for incident management, performance optimization, and automation. Influence best practices, drive cross-team collaborations, and contribute to the execution of key objectives in alignment with engineering leadership and cross-functional partners. Establish strong technical credibility, shaping architectural decisions and ensuring the delivery of high-quality, reliable systems.

  • Ensure High Availability: Implement and maintain resilient cloud architectures, monitor system performance, and proactively identify and resolve potential bottlenecks or points of failure.

  • Incident Management: Participate in primary oncall rotation; cultivate technical curiosity and growth mindset, and a blameless postmortem culture within the team. Continuously optimize the on-call process for sustainability and efficiency.

  • Automation and Tooling: Develop and maintain automation scripts, tools, and processes to streamline system deployment, monitoring, and management tasks. Your contributions will be vital in efficiently scaling cloud operations.

  • Performance Optimization: Optimize cloud infrastructure and applications for performance, scalability, and cost-effectiveness.

  • Security and Compliance: Collaborate with security engineers to implement best practices and ensure compliance with security standards and policies.

  • Monitoring and Alerting: Design and configure advanced monitoring systems to gain insights into system behavior, set up alerts, and respond proactively to potential issues. Create and maintain comprehensive dashboards and playbooks for production on-call.

  • Software Development Consultation: Engage actively in the entire software development lifecycle. Participate in system design reviews and provide valuable SRE insights during launch reviews, influencing and enhancing system architecture.

About you:



  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.

  • 8+ years of experience in a senior-level role within Site Reliability Engineering or similar role, particularly in managing cloud-based services and infrastructure.

  • 5+ years of experience with software development in one or more programming languages.

  • 3+ years of experience managing people or teams, leading projects, and designing, analyzing, and troubleshooting distributed systems running in Cloud.

  • Strong knowledge of cloud platforms such as Google Cloud Platform, AWS, or Azure.

  • Practical experience with containerization technologies, including Docker and Kubernetes. Familiarity with infrastructure as code tools like Terraform is essential.

  • Solid understanding of networking, security principles, and best SRE and security practices.

  • Proficiency in using monitoring and alerting tools to detect and respond to potential issues effectively

Location:



  • This role is hybrid (4 days a week in one of our Palo Alto Office)

Compensation & Benefits:

The standard base salary range for this position is $200,000 - $260,000 annually. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

We offer a comprehensive benefits package including competitive compensation, Medical, Vision, and Dental coverage, generous time-off policy, and the opportunity to contribute to your 401k plan to support your long-term goals. When you join, you'll receive a home office improvement stipend, as well as an annual education and wellness stipends to support your growth and wellbeing. We foster a vibrant company culture through regular events, and provide healthy lunches daily to keep you fueled and focused.

We are a diverse bunch of people and we want to continue to attract and retain a diverse range of people into our organization. We're committed to an inclusive and diverse company. We do not discriminate based on gender, ethnicity, sexual orientation, religion, civil or family status, age, disability, or race.

#LI-HYBRID

Posted 2026-02-10

Recommended Jobs

Sales / Preconstruction (Mechanical Contractor)

K2 Staffing
San Diego, CA

Summary A well-established mechanical contracting firm based in San Diego, specializing in commercial HVAC, plumbing, piping, and building systems, is seeking an experienced Sales / Preconstructi…

View Details
Posted 2025-10-31

Commercialization / GTM Manager

zoox
Foster, CA

This role functions as the lead responsible for driving the commercial success and market health of Zoox's service in key launch cities. Operating with an entrepreneurial, metrics-driven mindset, thi…

View Details
Posted 2025-11-18

Housekeeper

GreatAuPair LLC
Ladera Ranch, CA

Get hired for Nazila's housekeeper Job in Ladera Ranch, CA. Our family needs a house keeper. Find housekeeper care work in Ladera Ranch.

View Details
Posted 2025-11-09

Manufacturing Engineer (Aerospace) (Torrance)

Jobot
Torrance, CA

Join a stable and growing aerospace manufacturer as a hands-on Manufacturing Engineer where you’ll drive real impact on precision production, lead continuous improvement initiatives, and work with cu…

View Details
Posted 2026-03-12

Certified Engineering Geologist (Clovis)

Jobot
Clovis, CA

Revenue Cycle Analyst (Tableau) - 100% Remote / Fortune 500 / Great Benefits This Jobot Job is hosted by: Joseph Sipocz Are you a fit? Easy Apply now by clicking the Apply button and sending us…

View Details
Posted 2026-03-27

Yard Duty -Cultivate a Safe & Respectful School Environment

Escuela Popular
San Jose, CA

Job Title: Yard Duty - Cultivate a Safe & Respectful School Environment Promote Safety, Respect, and Positive School Culture At Escuela Popular, we believe a great education is built on a found…

View Details
Posted 2026-01-31

Senior Accounts Payable Specialist #4533

Grail
Menlo Park, CA

Our mission is to detect cancer early, when it can be cured. We are working to change the trajectory of cancer mortality and bring stakeholders together to adopt innovative, safe, and effective techn…

View Details
Posted 2026-03-16

Fullstack Software Engineer (New Grad)

Patreon
San Francisco, CA

Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans a…

View Details
Posted 2026-03-10

Full Stack Engineer Manufacturing Test

Cerebras Systems
Sunnyvale, CA

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programmi…

View Details
Posted 2026-02-28

Electronics Manufacturing Programmer

Quintech Electronics & Communications
Anaheim, CA

Quintech Electronics and Communications, Inc. is seeking a Surface Mount Technology (SMT) Programmer whose main responsibilities will include creating, updating and maintaining SMT programs includin…

View Details
Posted 2026-03-22