Senior Site Reliability Engineer Cloud Platform
Zilliz is a fast-growing startup developing the industry’s leading vector database company for enterprise-grade AI. Founded by the engineers behind Milvus, the world’s most popular open-source vector database , the company builds next-generation database technologies to help organizations quickly create AI applications. On a mission to democratize AI, Zilliz is committed to simplifying data management for AI applications and making vector databases accessible to every organization.
What you will do:
- Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms.
- Ensure the reliability, availability, and performance of Zilliz’s distributed database systems.
- Develop and implement strategies for monitoring, incident management, and disaster recovery.
- Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention.
- Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness.
- Collaborate with software engineers to enhance system reliability, scalability, and performance.
- Maintain and improve the CI/CD pipeline to ensure smooth and rapid deployment of changes.
- Actively contribute to the Milvus Vector Database open-source community, focusing on improving reliability and operational efficiency.
What we are looking for:
- 4+ years of experience in site reliability engineering or similar roles with a focus on cloud-native systems.
- Proficiency in scripting languages such as Python, Go, or Java.
- Strong knowledge of container orchestration technologies like Kubernetes and Docker.
- Expertise with cloud platforms such as AWS, GCP, or Azure, and their respective monitoring and management tools.
- Experience with infrastructure as code tools such as Terraform or Ansible.
- Familiarity with CI/CD tools such as Jenkins, GitLab CI, or Argo.
- Proven ability to troubleshoot complex distributed systems and resolve issues promptly.
- Bachelor’s degree or above in computer science, software engineering, or other relevant disciplines.
- Ability to thrive in a fast-paced, startup environment and handle multiple projects simultaneously.
- Experience with Open Source Milvus Vector Database is nice to have
$175,000 - $225,000 a year
Zilliz is an Equal Opportunity Employer and welcome people from all backgrounds, experiences, abilities, and perspectives. All qualified applicants will receive consideration for employment regardless of race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.
Recommended Jobs
Data Analyst
Job Description: Short Description: Client seeks to hire a Data Analyst resource to provide data analysis and management support. Must have at least 10 years overall in Data Analyst experie…
Parent Educator - Part Time
Position Compensation: $ 17.50- $25.00 per hour GENERAL INFORMATION: PACE Education provides high quality early childhood education with case management support services from birth to five years of …
Founding AI Agent Engineer
Our client is a fast-growing, well-capitalized startup at the intersection of consumer hardware and AI. They are on a mission to revolutionize golf by making simulation and learning accessible to eve…
Mid-Level Backend Engineer
Job brief Mid-Level Backend Engineer Location: 100% Remote A Bit About Us: Altela is on a mission to revolutionize water treatment solutions with the latest software and technology. …
Senior Data Engineer
About Us Move money. Make money. Finix is a full-stack acquirer processor, empowering businesses of all sizes with flexible, modern payment solutions. Processing billions of dollars annually, Fini…
Software Engineer
The Swift Group is a privately held, mission-driven and employee-focused services and solutions company headquartered in Reston, VA. Our capabilities include Software Development, Engineering & IT, …
Member of Technical Staff, Machine Learning Engineer
What You'll Work On At Reinforce Labs, we partner directly with customers to build AI systems that enhance the safety and reliability of their complex, high-impact applications. In this role, you'll…
Principal Business System Analyst
Key Responsibilities: Implement, configure, and customize Oracle Fusion Financials applications to meet CSG requirements. Provide ongoing support and troubleshooting for Oracle Fusion Financial…
Manager, Social Work and Behavioral Health
Manager, Social Work and Behavioral Health San Jose, CA 110-140K This organization helps seniors stay in their homes and communities by providing comprehensive medical care and community-base…
Bookkeeper
Benefits: ~401(k) ~ Health insurance ~ Paid time off Benefits/Perks Competitive Compensation Paid Time Off Career Growth Opportunities Job Summary We are looking fo…