Platform/Infrastructure Engineer (San Francisco)
About LangChain
At LangChain, our mission is to make intelligent agents ubiquitous. We help developers build mission-critical AI applications across the entire agent development lifecycle. Our open source frameworks LangChain and LangGraph see over 70+ million downloads per month. Developers rely on LangChain for composable integrations and LangGraph for controllable agent orchestration. Our commercial agent platform, consisting of LangSmith and LangGraph Platform, enables teams to build, test, run, and manage agents at scale across their organization.
Founded in 2023, LangChain powers top engineering teams at companies like Replit, Lovable, Clay, Klarna, LinkedIn, and more.
About the role
We are seeking Platform and Infrastructure Engineers with deep expertise in Kubernetes, cloud platforms, and modern deployment technologies to build and maintain the infrastructure that powers AI applications in our cloud and customer environments. You'll architect and operate the critical systems that power our customers' AI observability and deployments, working directly with cutting-edge technologies at the intersection of AI and distributed systems
Design and Scale Infrastructure: Build and maintain scalable, high-throughput infrastructure solutions using Kubernetes, Helm, Docker, and multi-cloud environments (AWS, Azure, GCP) to support flagship SaaS products like LangSmith and LangGraph Platform.
Drive Reliability and Performance: Ensure platform reliability, security, and performance through robust monitoring, alerting, automated recovery systems, and proactive maintenance, including performance tuning and database optimization.
Contribute to Platform Strategy: Influence infrastructure strategy, tooling, and operational practices as the organization scales from startup to enterprise.
Enable Secure, Efficient Operations: Implement security best practices, compliance requirements, and infrastructure cost optimization strategies while architecting for high availability, disaster recovery, and resource efficiency.
Develop Automation and CI/CD Pipelines: Build and optimize CI/CD pipelines, infrastructure as code, and deployment automation strategies to streamline application delivery.
Support Customer Deployments: Create and maintain deployment solutions and monitoring tools for customer-hosted environments, and collaborate with engineering teams on application rollout and support.
Participate in Incident Response: Take part in the on-call rotation with a focus on learning, automation, and continuous improvement of incident response processes.
Document and Evolve Best Practices: Maintain comprehensive infrastructure documentation and stay up to date with emerging technologies and best practices in cloud-native systems.
How to be successful in this role
Experience: 3+ years building and operating production systems at scale
Programming proficiency: Strong hands-on software engineering skills (Python, Go, Rust)
Infrastructure expertise: Deep knowledge of Kubernetes, containerized infrastructure, cloud platforms (AWS, Azure, GCP)
Observability mastery: Hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry or similar)
Proficiency in infrastructure as code tools (Terraform, CloudFormation, etc.)
Database expertise: Production experience with OSS datastores (PostgreSQL, Redis, Kafka)
Experience with CI/CD pipelines and automation tools
Strong communication skills for cross-functional collaboration with other engineers and customers
Nice to Have
Proficiency with analytical databases (e.g. ClickHouse)
Background in high-growth startups
Previous experience in AI/ML infrastructure
Compensation & Benefits
Competitive salary and equity stake for role and stage of company. Commensurate with experience.
Annual salary range: $145,000-$195,000 USD for Senior Engineers
Recommended Jobs
Pediatrics physician San Jose CA
Specialty: Pediatric Physician Location: San Jose, CA Shifts: ASAP - Ongoing M/W/F 8a-5p & T/TH 10-7 Job Details: * Outpatient Clinic * 18-24 patients per day Primary Car…
Linux Software Engineer
Description We’re looking for a Linux Device Driver Software Engineer to develop, integrate, and optimize the Hardware Abstraction Layer (HAL) software for our satellites. You will drive the creatio…
Full-Stack/Backend Engineer
ROLE: FULL-STACK ENGINEER Summary: In search of a dynamic Full-Stack Engineer to take the reins of our product and backend engineering. The ideal candidate is a versatile technologist capable o…
Experienced Caregiver
Embark on a fulfilling journey with Golden Years In Home Senior Care! We're on the lookout for exceptional caregivers, HCA certified, who possess not just the skills but a genuine Heart to serve. If y…
Pantry Cook I - The Beverly Hilton
The iconicBeverly Hilton is looking for aCook I to join the Culinary Team\! Since 1955, the hotel has hosted memorable moments etched in history from dazzling red\-carpet events to celebrity galas and…
Lifestyle Specialist
Ready to Join a Winning Team? At Madonna Gardens Assisted Living and Memory Care , we value individuality and strong team connectivity. Our team members are compassionate, dedicated, and committed …
Senior Product Manager - Hybrid Tables (Menlo Park)
Great companies are being built by amazing teams. Come be a part of it. Where Data Does More. Join the Snowflake team. There is only one Data Cloud. Snowflakes founders started from scratch and d…
Staff Frontend Engineer
This is a senior level role. We're looking for someone who has experience shipping products or platforms end-to-end in a startup environment. What you'll do Architect and build core frontend f…
Alumni Relations Manager
We are a leading company in the beauty industry, committed to creating innovative, high-quality products for hair and skin care. We've been a pioneer in the professional beauty space for decades, prov…
Occupational Therapist - SNF - Concord, CA
Relient Health seeks a caring and compassionate Occupational Therapist (OT)to work in a great Skilled Nursing Facility (SNF)setting. This full-time , permanent position is a chance to make a rea…