DevOps Engineer (Founding Team)
DevOps Engineer (Founding Team)
Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + meaningful equity (founding tier)
Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.
About the Role
We're building an AI-native, multi-tenant enterprise platform for complex domains in industrial verticals. In this architecture, DevOps isn't just about shipping features — it's about operationalizing intelligent agents , ensuring traceability across AI systems , and supporting mission-critical ML infrastructure at scale.
We're looking for a DevOps engineer who can own infrastructure from Day 1 — automating everything from CI/CD and observability to cloud governance and security. You’ll work with a highly technical team building real-time AI pipelines and multi-agent systems. If you want to be the person who makes the platform run — fast, secure, reliable, and explainable — this is your role.
Responsibilities
Build and maintain scalable cloud infrastructure across AWS/GCP/Azure with a focus on secure, tenant-isolated deployments
Own and evolve CI/CD systems (e.g. GitHub Actions, ArgoCD) with progressive rollout, testing, and rollback flows
Establish observability tooling across services, agents, and pipelines (OpenTelemetry, Prometheus, Grafana, Sentry)
Implement policy-as-code (OPA, Rego) for deployment safety, RBAC, audit logging, and approval workflows
Define and enforce SLAs, uptime targets (99.99%+), incident response, and remediation workflows
Secure infrastructure: IAM, VPC, encryption, key management, image scanning, secrets rotation
Automate deployments, infrastructure provisioning (Terraform, Helm), and environment replication
What We’re Looking For
Core Experience:
4–10+ years in DevOps, platform engineering, or SRE in production-grade systems
Strong experience with Docker, Kubernetes (EKS/GKE), Terraform or Pulumi
Hands-on experience deploying and monitoring distributed cloud-native systems
Familiar with GitOps practices, CI/CD design, progressive delivery, and secure SDLC
Clear understanding of how to implement monitoring, alerting, and failure simulation in dynamic environments
Engineering Mindset:
Obsessed with reliability, latency, uptime, and repeatability
Security-aware and compliance-conscious
Proactive — you don’t wait for alerts to fix things
Comfortable collaborating with backend, AI, and data teams
Bonus: Agent-Native / ML Ops Capabilities
We’re building an agentic, AI-native platform from the ground up. Experience here isn’t required, but would be a strong differentiator:
Experience running LLM orchestration frameworks (e.g. LangChain, LangGraph, Dust, ReAct agents)
Building retrieval-augmented generation (RAG) pipelines — and deploying them safely and repeatably
Familiarity with vector DBs (Weaviate, Qdrant, Pinecone) and embedding pipelines
Monitoring and governing long-running or multi-agent chains
Auditability and replay systems for agent decision-making
Serving fine-tuned or open-source LLMs with model versioning and GPU scaling (e.g. vLLM, TGI)
Interest in auto-remediation using agents (e.g. observability + alert → insight → response via LLM)
Why This Role Matters
DevOps is the nervous system of the platform — every agent, every data fabric component, every pipeline flows through what you build. This is a rare opportunity to design that system early, the right way, and future-proof it for scale, compliance, and trust.
If you're excited by intelligent systems, distributed data, and deeply technical infrastructure problems — and you want your work to have immediate real-world impact — we’d love to hear from you.
Recommended Jobs
CD&A - Forecasting Senior Manager, Global Biosimilars and Rare Disease
Join Amgens Mission of Serving Patients At Amgen, if you feel like youre part of something bigger, its because you are. Our shared missionto serve patients living with serious illnessesdrives all …
Senior Pre-Sales Engineer
In our ‘always on’ world, we believe it’s essential to have a genuine connection with the work you do. Our RUCKUS Smart Wi-Fi, ICX switching, IoT, rich AI/ML Analytics software, secure policy user/de…
Software Engineer, Security
Our co-founders started Zip in 2020 to address this seemingly intractable problem with a purpose-built platform that provides a simple, consumer-grade user experience. Within just a few short years, …
Staff Software Engineer
RELOCATION ASSISTANCE: Relocation assistance may be available CLEARANCE TYPE: Secret TRAVEL: Yes, 10% of the Time Description At Northrop Grumman, our employees have incredible opportuni…
Emergency Medicine Physician in La Mesa, CA
Are you an emergency medicine (EM) physician who aims to provide compassionate care to all patients? If so, we want you to partner with TeamHealth and join our team at Sharp Grossmont in La Mesa, Cali…
Customer Service Order Pullers
: ORDER PULLER Job Summary: The Order Puller receives and processes incoming and outgoing orders for materials, and/or merchandise to satisfy customer requests. Supervisory Responsibilities…
Admin Professional
If you're interested in this position, please apply on our careers page here: Careers Hourly Wage: $20.00 - $22.00 Candidate must live in Humboldt County and report to Eureka office Exper…
Software Engineer
IXL Learning, developer of personalized learning products used by millions of people globally, is seeking Software Engineers who have a passion for technology and education to help us add new features…
Senior Business and Financial Operations Specialist
Title: Sr. Business and Financial Operations Specialist Location: El Segundo, CA (On-site, no telework) Clearance Requirement: Active Secret Clearance Position Overview Acquisition Anal…
Certified Medical Assistant
Pacific Skin Institute is in search of a motivated candidate with a team-centered attitude! Candidates must have a passion for medicine and healthcare, helping people obtain the services they need an…