SRE / DevOps Engineer
Title: Sr. SRE / DevOps Engineer
Location: Sunnyvale, CA
Job Description:
Job Summary
For this role, we are looking for a Sr. SRE / DevOps Engineer at Sunnyvale, California location.
As Site Reliability Engineer, the individual will work closely with multi-functional teams, automate operations, optimize infrastructure, implement security and solve issues in an exciting, fast-paced environment. The individual will play a vital role in ensuring that the systems are reliable, scalable, and high performing.
Responsibilities
Ensure system reliability and availability Monitor system issues, create strategies to detect issues, address those issues, design automated systems to troubleshoot, write and review post-mortems.
Mitigate Operational risks - Collaborate with development teams and other stakeholders to identify potential risks, perform risk assessments, implement risk mitigation strategies, continuously monitor and review the effectiveness of risk strategies.
Monitor system health.
Minimize emergency response (MTTR).
Maintain CI/CD pipelines, etc.
Continuous improvement by collaborating with various teams.
Automation of processes.
Must have/required experience and skills:
8+ years of experience on DevOps and Site Reliability Engineering.
Hands-on with containerization and orchestration: Docker, Kubernetes/EKS.
Proficiency in infrastructure as code tools: Terraform, Ansible, or CloudFormation.
Experience setting up and managing services running on Kubernetes.
In-depth understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation.
In-depth knowledge of monitoring and observability tools: Apache Splunk
Knowledge of Linux operating system principles, networking fundamentals, and systems management
Demonstrable fluency in at least one of the following languages: Java or Python
Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions.
Building and managing CI/CD pipeline gatekeeping production deployments, develop and implement GIT branching strategies, branch protection rules, network policies, scale up/ scale down the load on AWS.
Strong problem-solving and analytical skills
Solve performance issues and scalability issues in the system.
Technical Skills:
DevOps and SRE
AWS Kubernetes/EKS, Docker
Terraform, Ansible, or CloudFormation
Apache Splunk, Apache Flink
Programming/Scripting using Java or Python
CI/CD
Database Vertica, Snowflake.
Behavioral Skills:
Excellent Communication skills and collaboration skills
Ability to propose and implement improvements in the system
Ability to work with cross-functional stakeholders
Adaptability and a willingness to learn new technologies and techniques.
Proactive approach to issues, ability to provide prompt resolution/work around.
Recommended Jobs
Production Manager - Hemp Cannabis Brands
The Pack Labs is leading the way in the hemp-derived cannabinoid industry with premium brands like Delta Munchies and Imperial Extraction . With a focus on quality, innovation, and expanding our…
Intern, Marketing
JOB DESCRIPTION Summary - (Briefly summarize the overall purpose of the position. This should be no longer than 3-4 sentences): ultrafocused – Work together to fearlessly uncover new possibilit…
Warehouse I
Warehouse I JOB-10044711 Anticipated Start Date Aug. 11, 2025 Location Pasadena, TX Type of Employment Contract Employer Info Our client develops and deploys t…
Assistant Manager - 1073 N Hacienda Blvd, La Puente, CA 91744, USA
Domino’s began humbly in 1960, with a history of starting small but dreaming big, which remains at the core of our brand. Hard work, ambition, and a passion for pizza have always fed the power of…
Truck Care Cashier
Address: 2974 Lenwood Rd. Barstow, CA, 92311 Benefits: * $17.00 - $18.00 p/hr * Fuel Your Growth with Love's - company funded tuition assistance program * Paid Time Off * Flexible Scheduling…
Senior Product Manager, Growth Bets
**About the Role** Uber Eats is looking for a strategic Growth leader to join our dynamic and fast-paced team. As a Sr Product Manager, you will be responsible for identifying and executing cross-plat…
Operations Specialist (Remote)
Operations Specialist (Remote) Location Remote in California : Join our mission to create a completely new, 100% digital bank that uses consumer feedback to truly meet customers' best interests. Jeni…
Spanish Online Tutor for Children
Job Description Job Description Do you love helping children learn Spanish? And are you looking for a fun and flexible part-time job? Do you want to work from the comfort and safety of home? J…
STRONG Pilates- Instructor- WeHo
Job Description Job Description ABOUT STRONG We are STRONG – A Pilates-inspired, cardio-infused, and designed for a full-body workout that is low-impact and high-intensity. Our diverse classes…