Senior Site Reliability Engineer

Tp-link Systems Inc.
Irvine, CA

At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user experience with simpler, smarter, and more reliable connectivity.

We're looking for a passionate and experienced  Senior Site Reliability Engineer  to join our team and play a crucial role in ensuring our cloud platform's security, Reliability, scalability, and operational excellence.

 

About Us:

Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster, more reliable connectivity. With a commitment to excellence, TP-Link serves customers in over 170 countries and continues to grow its global footprint.

We believe technology changes the world for the better! At TP-Link Systems Inc, we are committed to crafting dependable, high-performance products to connect users worldwide with the wonders of technology. 

 

Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle. 

Responsibilities:

  • Serve as technical SME for implementing and operating Microservices on Kubernetes cloud-based platforms.
  • Collaborate with the Cloud Technical Development and DevOps teams to deploy services to the Multi-Cloud Platform.
  • Performing Load Tests and Chaos Tests to ensure the scalability and reliability of microservices.
  • Build Observability for Microservices and cloud platforms like AWS, OCI, Azure, and GCP.
  • Write and Execute the Disaster recovery plans in collaboration with the Development and DevOps team.
  • Analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
  • Write and maintain scripts for automation using languages like Python, Go, or Bash.
  • Define and maintain the KPIs (SLA/SLO/SLI) for all cloud microservices with development teams to better understand the business.
  • Create and maintain technical documentation, including architecture diagrams, design documents, and standard operating procedures.
  • Guarantee adherence to security and compliance standards, including ISO27001, SOC2, and GDPR.
  • Lead incident response efforts to troubleshoot and resolve production issues quickly.
  • Perform post-incident analysis to identify root causes and potential workarounds/solutions.
  • Assist with product/technology selection, including implementation of POCs
  • Be fluid and open to change and evolving processes and tools
  • Help to mentor and train less senior members of the team
  • Ability to be part of On-call rotation and provide support after work hours and on weekends.
  • Other duties as assigned
  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience as a Site Reliability Engineer.
  • Proficiency in programming and scripting languages like Java, Python, Bash, or PowerShell.
  • Hands-on experience in SRE, DevOps, cloud operations, and cloud security best practices.
  • Strong knowledge of security technologies, including Identity and access management, Network security, Application security, and Data protection.
  • Strong problem-solving and analytical skills, with the ability to work independently and as part of a team.
  • Experience in developing and maintaining technical documentation and implementing compliance requirements.

Additional Skills (Preferred):

  • Expert-level cloud certifications include AWS Solutions Architect, Professional, Azure Solutions Architect Expert, and GCP Professional Cloud Architect.
  • Experience with container orchestration technologies (e.g., Kubernetes).

Base Salary Range: $140,000 - $180,000

  • Competitive salary and comprehensive benefits package.
  • The chance to be part of a growing and innovative company.
  • Engaging and inclusive work culture.
  • The opportunity to be involved in challenging and impactful projects.
Posted 2025-09-22

Recommended Jobs

Staff Software Engineer

Mlabs
Mountain View, CA

Our client is a premier vehicle software supplier that is accelerating the adoption of safe, intelligent machines globally. They are trusted by 18 of the top 20 automakers and serve a wide range of i…

View Details
Posted 2025-09-22

Behavior Technician

Developmental Pathways Inc.
Palmdale, CA

Job Description Job Description Ready to make a difference and start in a professional, meaningful role with Developmental Pathways! Do you have a passion for working with  CHILDREN!? Are you…

View Details
Posted 2025-07-30

Senior Software Engineer in Test

Veeva Systems
Pleasanton, CA

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histo…

View Details
Posted 2025-07-31

Product Manager (Capital Growth) - Experienced

Parafin
San Francisco, CA

About Us: At Parafin, we’re on a mission to grow small businesses. Small businesses are the backbone of our economy, but traditional banks often don’t have their backs. We build tech that makes…

View Details
Posted 2025-09-14

Senior Data Architect 21605-1

Mondo
Burbank, CA

Apply now: Senior Data Architect, location is Hybrid (Burbank, CA). The start date is August 26, 2025 or two weeks from offer for this 1-year contract position. Job Title: Senior Data Architect …

View Details
Posted 2025-09-02

Nurture New Beginnings in Beautiful Carmichael, CA!

NurseRecruiter
Carmichael, CA

Registered Nurse - Labor & Delivery - Travel - (LD RN) An opportunity for a travel Registered Nurse in Labor and Delivery in Carmichael, California, begins 9/3/2025. The position requires an RN-CA li…

View Details
Posted 2025-08-20

Senior Accounts Payable Specialist

Palantir Technologies
Palo Alto, CA

A World-Changing Company Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our part…

View Details
Posted 2025-09-14

Sales Operations Specialist (On-site Guadalajara, MX)

Driscoll's
Watsonville, CA

About the Opportunity Additional Locations: Mexico-Guadalajara           The Sales Support Specialist is a member of Sales Operations team to support end-to-end process improvements and operation…

View Details
Posted 2025-09-10

Stylist - PT - Bloomingdale's Stanford - US

ALLSAINTS
Palo Alto, CA

Stylist - PT - Bloomingdale's Stanford Palo Alto, California, United States THE ALLSAINTS TEAM At AllSaints we are in the business of feelings - making our customers feel cool a…

View Details
Posted 2025-07-29

American Girl Restaurant Party Planner (Part-Time)

Mattel
Los Angeles, CA

CREATIVITY IS OUR SUPERPOWER.  It’s our heritage and it’s also our future. Because we don’t just make toys. We create innovative products and experiences that inspire fans, entertain audiences an…

View Details
Posted 2025-09-10