Senior Software Engineer, Reliability
We design high-performance, low-latency, high-throughput services, promote best practices, and engage in architectural design to embed reliability into every layer of our products. We seek your expertise in distributed systems, resilience engineering, and large-scale production operations — to identify gaps, design and build solutions, and guide product teams towards building highly available and resilient services. Your work will directly strengthen our SRE strategy, operational excellence, system performance, and reliability culture. We are seeking innovative problem-solvers passionate about large-scale distributed systems and eager to grow their skills in modern SRE practices. As a small team tackling complex challenges at scale, we offer the opportunity to make significant technical contributions while driving observability culture across the organization. 5+ years of working experience designing, developing, and operating large-scale, customer-facing products or services Experience coding in higher-level languages (e.g., Java, Scala, Go, Python) is preferred A strong interest in solving challenging problems using innovative and data-driven approaches An SRE-centric mindset — you build and manage systems with reliability, scalability, availability, and security as core principles Experience designing complex systems and frameworks using proven system design principles, such as NALSD (Non-Abstract Large System Design) methodologies Experience troubleshooting issues across distributed Linux environments, with comfort tracing problems across applications, systems, and networks Proficient with modern cloud technologies such as GCP, AWS, and Kubernetes Experienced in service observability practices and tools (e.g., Prometheus, OpenTelemetry, SignalFx, or similar) Comfortable learning new software, frameworks, and APIs quickly and effectively Natural collaborator who inspires others, mentors junior engineers, and drives technical excellence Bonus: Familiarity with PHP/JavaScript/NodeJS You will be constantly developing automations / frameworks / tools for better platform reliability/resilience/availability You will collaborate with other engineers on the team as well as cross functionally to foster solid software engineering principles and represent our engineering values You will participate in various POCs on new projects and frameworks being evaluated for the product/platforms You will improve our observability as both a developer/maintainer of systems/frameworks, and a mentor to our product development teams You will work with modern cloud-native technologies including container orchestration (Kubernetes, Docker), service mesh solutions (Istio, Linkerd), and cloud platforms (AWS, GCP) You will participate in product design reviews and architectural discussions to ensure reliability is considered early in the development lifecycle of product/services You will participate in a team on-call rotation
Recommended Jobs
Accounts Receivable Clerk (DOE $28-$32)
About The Ream Companies The Ream Companies is a leading restoration contractor, known for our responsive service and high-quality work in times of need. We’re looking for someone who’s d…
Hiring Event - Part Time Associate Banker Golden Gate (30 Hours)
Job Description At Chase, we are passionate about creating memorable experiences for our clients and employees, making them feel welcomed, valued, and understood. We build lasting relationships by …
Food Service Technician I
Job Description and Duties Patton State Hospital, Human Resources Department, is accepting applications for at least one (1) Permanent, Part-Time, Food Service Technician I position in the Nutriti…
Sr Staff Engineer Software (Web Security)
Company Description Our Mission At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vi…
Supply Chain Data Analyst
Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission to advance the benefits of sustainable air mobility. We are …
Early Childhood Educators for Children & Family Services - Teacher (37
Summary ...Early Childhood Educators at Community.... ...have various teaching positions....... .....??.../Lead Teacher: $38.80 - $44... ...through pre- kindergarten. As a teacher... ..…
Marketing Coordinator
Job Responsibilities: ;Creator Lists &; Vetting ;Researching and creating influencer lists for the team for earned kits, earned experiences and live events ;Vetting Creators for earned progr…
Graphic Designer, VM/Creative
Job Title: Graphic Designer, VM/Creative Salary: $85,000 - $95,000 Department: Creative Manager: Sr. Art Director, VM Who We Are Haus Labs is a vegan and cruelty-free cosmetics brand…
Project Manager - construction restoration
Summary Our client is a well-recognized, highly sought-after residential general contractor in the Los Angeles, CA (i.e. Pacific Palisades) area and they are in immediate need of an experienced R…