Senior Software Engineer, Observability

Together Ai
San Francisco, CA

About The Role


Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

The AI Infrastructure team at Together AI is at the forefront of building and scaling the foundational systems that power our generative AI platform. The storage and observability team is crucial for designing, implementing, and maintaining robust distributed storage solutions, ensuring seamless data access and management. They are also responsible for developing comprehensive observability platforms, providing critical insights into system performance and GPU utilization, and proactively identifying and resolving issues.

Requirements



  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices

  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources

  • Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members

  • Demonstrated experience with building and operating high-performance and/or globally distributed microservice architectures across one or more cloud providers (AWS, Azure, GCP)

Responsibilities



  • Identify, design, and develop foundational backend services that power Together’s cloud platform

  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure

  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs

  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems

  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance

  • Participate in an on-call rotation to address critical incidents when necessary

About Together AI


Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation


We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $260,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity


Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at

Posted 2025-11-13

Recommended Jobs

Custodian

Planet Fitness
Napa, CA

Job Summary The Custodian will be responsible for the overall cleanliness of all areas of the facility to ensure a positive member experience. Essential Duties and Responsibilities Thoroughl…

View Details
Posted 2025-08-18

Quality Assurance Engineering

Peraton
San Diego, CA

Program Overview About The Role Peraton is seeking a Quality Assurance Engineer to support our Tactical Receive Segment (TRS) Program contract at the Naval Information Warfare Center Pacific …

View Details
Posted 2025-11-08

Engineer, Biotech Downstream Process Equipment Maintenance - (JP14514)

3 Key Consulting
Thousand Oaks, CA

Job Title: Engineer, Biotech Downstream Process Equipment Maintenance - (JP14514) Location: Thousand Oaks, CA. 91320 Employment Type: Contract Business Unit: F&E Drug Substance Supply. …

View Details
Posted 2025-09-10

Senior Applied ML Engineer

Macroscope
San Francisco, CA

About Macroscope Macroscope aims to be the source of truth of what's happening for any company that builds software. Our mission is to give leaders clarity and engineers time. We help leaders und…

View Details
Posted 2025-09-28

Senior Accountant

Stampli
Mountain View, CA

Join the growing team at Stampli as a Senior Accountant and take charge of managing all accounting and payroll functions in the United States. We are seeking a talented individual who possesses excep…

View Details
Posted 2025-09-28

310T Truck & Coach Mechanic

Ontario, CA

Job description: 310T Truck & Coach Mechanic $46/hr full-time permanent position A leading construction company based in Toronto with a legacy spanning nearly a century is seeking a skille…

View Details
Posted 2025-11-09

Founding Backend Engineer

Plaud Ai
San Francisco, CA

ABOUT Plaud.ai Plaud is building the world's most trusted AI work companion for professionals to elevate productivity and performance through note-taking solutions, loved by over 1,000,000 users wor…

View Details
Posted 2025-09-22

Embedded Software Engineer

Beta-bionics-inc
Irvine, CA

About Beta Bionics Beta Bionics, Inc. is a medical technology company dedicated to bringing innovative type 1 diabetes management solutions to the many, not the few. We are committed to bringing b…

View Details
Posted 2025-09-22

Semi Truck Driver

Veritas Quantitative Services
Glendale, CA

Job Description Job Description We are seeking a Semi Truck Driver to join our team! You will be responsible for safely operating a truck with a capacity of at least 26,000 pounds Gross Vehicle W…

View Details
Posted 2025-07-29

Systems Engineering Intern

zoox
Foster, CA

Summary: Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our…

View Details
Posted 2025-11-03