Manager, Site Reliability Engineering
Role Description
DTEX Systems is looking for a Manager, Site Reliability Engineering (SRE) to lead operations within the SRE function. This role is responsible for setting priorities aligned to departmental strategies and business goals, executing against key performance metrics, and guiding the professional growth of leaders and individual contributors. The Director will ensure reliability, scalability, and robustness of DTEX’s technical infrastructure while thoughtfully adopting modern automation and AI-enabled capabilities to improve operational outcomes.Role & Responsibilities
- Lead operations within the site reliability engineering department, setting priorities based on department strategies and goals.
- Execute on key performance metrics to ensure the reliability, scalability, and robustness of production systems.
- Guide the professional development and achievement of direct reports, fostering a culture of continuous learning, accountability, and operational excellence.
- Oversee the development and implementation of new technologies to enhance system performance, stability, and security.
- Manage teams to resolve system issues effectively and efficiently, ensuring minimal downtime and disruption to business operations.
- Collaborate with Engineering, Product, Security, and Data leaders to align on strategic initiatives and drive cross-functional programs.
- Partner with Product, Security, and Data teams to evaluate and operationalize AI-enabled capabilities such as anomaly detection, predictive monitoring, and intelligent alerting to improve system reliability and performance.
- Drive responsible adoption of AI and automation within the SRE function, ensuring explainability, reliability, security, and appropriate human oversight in production environments.
- Maintain the overall performance, stability, and resilience of the company’s technical infrastructure.
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- 10+ years of experience in site reliability engineering, infrastructure engineering, or a related discipline.
- Proven experience leading and managing high-performing technical teams.
- Strong knowledge of system architecture, infrastructure design, and cloud-based systems.
- Experience with on-premise and public cloud environments (Azure, AWS, VMware, Hyper-V, Cisco).
- Proficiency in Python and Bash (PowerShell a plus).
- Strong experience with Linux and Windows operating systems.
- Deep understanding of networking, firewalls, and security principles.
- Experience with configuration management and infrastructure-as-code tools (Salt, Pulumi; Terraform acceptable).
- Hands-on experience with containerization and orchestration technologies (Docker/Podman, Kubernetes).
- Exceptional problem-solving and troubleshooting skills.
- Ability to communicate clearly, prioritize effectively, and make sound decisions under pressure.
- Working knowledge of AI and machine learning concepts relevant to reliability engineering, including anomaly detection, predictive analytics, pattern recognition, and intelligent automation (hands-on model development not required).
- Experience leveraging AI-assisted tools to improve incident response, root-cause analysis, capacity planning, and operational efficiency.
- Ability to critically evaluate AI solutions, understanding tradeoffs related to data quality, bias, reliability, security, and operational risk.
- Strong judgment in applying AI responsibly within reliability- and security-sensitive environments, maintaining human-in-the-loop decision making.
- Ability to mentor teams and leaders in building AI literacy and automation-first thinking while maintaining high standards for operational excellence.
- Impact at Scale – Drive the growth of a market-leading cybersecurity company.
- Thriving Company Culture – DTEX fosters a values-driven environment prioritizing respect, inclusion, and collaboration.
- Growth & Development – Opportunities for professional advancement and lifelong learning.
- Flexibility – Hybrid or remote work options.
- Comprehensive Benefits – Competitive compensation, equity participation, health and wellness benefits, and generous time-off policies.
DTEX Systems is the global leader in Workforce Cyber Intelligence & Security. Our mission is to safeguard the digital workforce by detecting and mitigating insider risks, preventing data loss, and enabling secure innovation.
We empower organizations to protect their most valuable assets—their people, their data, and their intellectual property—without compromising privacy or trust. Our solutions provide unmatched visibility and context into workforce behaviors, helping enterprises stop insider threats, achieve regulatory compliance, and accelerate digital transformation securely.
Our ideal customers include large, security-conscious organizations across financial services, critical infrastructure, technology, defense, and healthcare—where protecting sensitive data and ensuring compliance are mission-critical.
Joining DTEX means joining a passionate team working at the intersection of cybersecurity, intelligence, and trust. Together, we’re redefining how organizations protect their future. DTEX Systems is proud to provide equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, gender, religion, sex, national origin, age, disability, or genetics.
Base salary range (SF Bay Area): $180k-$240k.
Exact compensation may vary based on skills, experience, and location.
Recommended Jobs
Nurse Practitioner - Urgent Care
Urgent Care Nurse Practitioner Opportunity in Sunny Palm Springs, California! Palm Springs has outstanding weather 365 days/year with access to both Los Angeles and San Diego within about 2 hours. …
Senior Software Engineer, Audio - Unpublished R&D Product
Riot engineers bring deep knowledge of specific technical areas but also value the opportunity to work in a variety of broader domains. We work with both new and current technology, creating innovati…
Software Applications Engineering - Engineer|7766
Job Responsibilities: Collaborates with team members to answer customer inquiries and support debug issues in customer design software systems. Assists with completing tasks related to system a…
Director of Marketing
Umbra is an American space technology company delivering advanced systems, from sensors to spacecraft, that empower customers worldwide with unmatched access to critical information from space. Our m…
Claims Special Investigator III (Valencia)
Claims Special Investigator III Job Summary This position's primary responsibility is the investigation and disposition of potentially fraudulent claims in any product line sold by the organiz…
Entry Level Civil Engineer - FED/IRV
Cities & Places At Jacobs, we're challenging today to reinvent tomorrow by solving the world's most critical proble…
Drafter / Civil 3D Technician
BKF is a multi-service infrastructure consulting firm providing civil engineering and surveying services across California, the Pacific Northwest, and beyond. With offices throughout California and t…
Ice Cream Production Specialist
Paradis Seal Beach is hiring an Ice Cream Production Team Member to join our amazing team. This role focuses primarily on making our fresh ice cream daily , following Paradis recipes and production…
Part-Time Retail Merchandiser-Windsor
At Hasbro, our mission is to entertain and connect generations of fans through the wonder of storytelling and exhilaration of play. We’re looking for adventurous and curious people who want to explore…
DIRECTOR OF PARTNERSHIPS
Organization: Latino Community Foundation (LCF) Location: Hybrid (San Francisco Bay Area or Los Angeles, CA) Salary Range: $115,000 – $125,000 Key Responsibilities: Manage a portfolio of …