Staff Site Reliability Engineer (Mobile)

Paypal
San Jose, CA

Manage and deliver large-scale reliability improvement projects, ensuring systems are performant, available, and resilient. Drive the identification of performance bottlenecks and lead initiatives to optimize and scale critical systems and services. Architect and implement scalable infrastructure solutions to support growing user demands while maintaining system reliability. Lead the design and enhancement of monitoring frameworks, ensuring systems are highly observable, and support the response to production incidents. Take ownership of improving system resilience by designing fault-tolerant architectures and implementing disaster recovery strategies. Lead capacity planning initiatives to ensure system resources are proactively managed, preventing downtime or performance degradation under high load. Work closely with development, operations, and other technical teams to ensure seamless system integration and align on best practices for reliability. Act as a technical mentor within the organization, guiding teams through complex reliability challenges and promoting a culture of excellence. Help define and execute long-term reliability engineering strategies and standards to ensure the scalability and performance of core services. Develop and enforce best practices for operational excellence, including automation, incident management, and system monitoring, across engineering teams. Standards & Governance: Define mobile-specific SLIs/SLOs (e.g., crash-free sessions, ANRs, startup times, network success rates) and establish observability and alerting best practices in Datadog. Ensure consistency in how mobile reliability is measured and tracked across iOS and Android teams. Tooling & Automation: Lead development of reliability tools and automation—covering regression detection, performance benchmarking, and release health dashboards. Integrate crash/ANR triage systems with Datadog, Crashlytics, and CI/CD pipelines (Harness, Gradle, Bazel). Cross-Team Leadership: Act as liaison with backend/web SRE teams to ensure unified visibility and incident response. Partner with Product, QA, and Release Engineering to meet operational readiness standards and influence architecture for reliability from design to delivery. Cultural Enablement & Mentorship: Lead the rollout of on-call practices, incident response, and blameless postmortems. Mentor senior SREs across regions and drive adoption of reliability ownership among mobile engineering teams. Strategic Enablement: Collaborate with infrastructure and developer productivity teams to integrate mobile builds into reliable CI/CD pipelines. 5+ years relevant experience and a Bachelor's degree OR Any equivalent combination of education and experience. Expertise defining and implementing SLIs/SLOs for distributed and client-server systems. Hands-on experience with Datadog or similar platforms for monitoring, alerting, and dashboards. Proven ability to lead on-call rotations, incident response, and postmortems. Strong programming skills in Python, Go, or similar, with working knowledge of Swift or Kotlin for client instrumentation. Experience building automation and internal tools to improve reliability and efficiency. Skilled in integrating CI/CD systems (Harness, Jenkins, Fastlane) for mobile deployments. Partner with engineering teams to ensure robust monitoring, alerting, and dashboards for critical mobile services. Create and maintain runbooks and playbooks to standardize operational practices and empower teams to self-manage reliability. Lead post-incident reviews, identify areas for improvement, and help implement proactive reliability measures. Collaborate with the Datadog observability team to enhance signal quality and alerting efficiency. Strong communication and leadership skills with a proven ability to mentor and influence across teams. Strong knowledge of iOS and/or Android performance and reliability challenges. Experience with Bazel, Gradle, or similar build systems. Familiarity with backend reliability and distributed systems concepts. Proven success introducing on-call or observability practices within engineering teams. Experience with large-scale, customer-facing mobile or fintech systems.

Posted 2026-02-10

Recommended Jobs

Sr Manager - Payment Solution Architect - Implementation Services

VISA
San Francisco, CA

Job Description The Senior Manager of Payments Optimization (Global) will play a critical role in enabling our merchant, acquirer and partner customers to achieve market-leading payment success rat…

View Details
Posted 2026-03-21

OPS - Post Gluer Operator - 6060

Veritiv Corporation
Alhambra, CA

Job Purpose: Job Responsibilities: Additional Responsibilities & Qualifications: A Folder Gluer Machine Operator sets up, operates, and maintains machinery to fold and glue pape…

View Details
Posted 2026-03-21

Director, Enterprise Sales (West)

Cognition
San Francisco, CA

We are an applied AI lab building end-to-end software agents. We're the makers of Devin, the first AI software engineer. Cognition is building collaborative AI teammates that enable engineers to foc…

View Details
Posted 2026-02-13

Accounts Payable Manager

Alphatec Spine
Carlsbad, CA

The Accounts Payable Manager is responsible for overseeing the full-cycle accounts payable function while actively developing, mentoring, and leading a high-performing team. This role ensures timely …

View Details
Posted 2026-03-31

Berkeley Litigation Associate

Hagens Berman Sobol Shapiro LLP
Berkeley, CA

Hagens Berman Sobol Shapiro LLP is a national class-action and complex litigation law firm that takes on the world's largest corporations and entities, fighting for the rights of consumers, employees,…

View Details
Posted 2026-03-03

Software Engineer, Permissions

Notion
San Francisco, CA

About Us: Notion helps you build beautiful tools for your life’s work. In today's world of endless apps and tabs, Notion provides one place for teams to get everything done, seamlessly connecting do…

View Details
Posted 2026-02-19

Server / Waitstaff

Comal Berkeley
Berkeley, CA

Description Comal has been a community mainstay since opening in downtown Berkeley in 2012. Recognized for its excellence by Michelin Bib Gourmand & SF Chronicle Top 100, Comal serves delicious re…

View Details
Posted 2026-03-07

Senior Software Engineer, Full-Stack Android

Zoox
Foster, CA

Zoox is looking for a Senior Fullstack Software Engineer to join our Mobile Apps Engineering team. This position comes with a high degree of independence and the opportunity to shape both our Android…

View Details
Posted 2026-03-25

Workers' Compensation Defense Attorney (Los Angeles)

Jobot
Los Angeles, CA

Growing Food Manufacturing organization seeks an experienced Engineer to manage capital projects, Continuous Improvement and new equipment installation and validation. This Jobot Job is hosted by:…

View Details
Posted 2026-03-27

Lead Operating Engineer

MalaceHR
Carson, CA

Job Title: Lead Building Engineer Location: Carson, CA We are seeking a highly skilled and motivated This position offers the opportunity to work in a unique, high-performance environment supp…

View Details
Posted 2026-03-28