Senior Software Engineer, Observability

Together Ai
San Francisco, CA

About The Role


Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

The AI Infrastructure team at Together AI is at the forefront of building and scaling the foundational systems that power our generative AI platform. The storage and observability team is crucial for designing, implementing, and maintaining robust distributed storage solutions, ensuring seamless data access and management. They are also responsible for developing comprehensive observability platforms, providing critical insights into system performance and GPU utilization, and proactively identifying and resolving issues.

Requirements



  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices

  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources

  • Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members

  • Demonstrated experience with building and operating high-performance and/or globally distributed microservice architectures across one or more cloud providers (AWS, Azure, GCP)

Responsibilities



  • Identify, design, and develop foundational backend services that power Together’s cloud platform

  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure

  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs

  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems

  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance

  • Participate in an on-call rotation to address critical incidents when necessary

About Together AI


Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation


We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $260,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity


Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at

Posted 2025-11-28

Recommended Jobs

AI-first Senior Software Engineer

Fincons.us
Los Angeles, CA

About the Role We are looking for two Senior Software Engineers who are truly AI-first—engineers with strong software architecture fundamentals who know how to effectively use modern AI tools (LLM…

View Details
Posted 2026-01-07

Senior Data Engineer

Complex
Los Angeles, CA

Company And Culture Created in 2002 by Marc Eckō, Complex is a leading global youth entertainment network showcasing the evolution of major pop culture categories, including streetwear and style, …

View Details
Posted 2025-11-25

Litigation Attorney/Law & Motion

Macdonald & Cody, LLP
Irvine, CA

Macdonald & Cody, LLP is an established insurance defense litigation firm specializing in catastrophic personal injury and construction defect cases. We are seeking highly motivated law and motion/li…

View Details
Posted 2025-11-14

Director -- Camp Winacka

Girl Scouts San Diego
San Diego, CA

Are you looking for an amazing opportunity where you can enjoy the great outdoors while helping young women build courage, confidence and character? Are you an outdoor lover who enjoys sharing that e…

View Details
Posted 2025-09-10

Bookkeeper

Roman Catholic Bishop Of San Diego
San Diego, CA

Description Parish: St. Thérèse of Carmel Catholic Church Location: 4355 Del Mar Trails Rd, San Diego, CA 92130 Reports to: Pastor Employment Type: Full time FLSA Status: Non-Exempt…

View Details
Posted 2025-11-19

Senior Data Engineer

Plum Inc
San Francisco, CA

PLUM is a fintech company empowering financial institutions to grow their business through a cutting-edge suite of AI-driven software, purpose-built for lenders and their partners across the financia…

View Details
Posted 2025-11-28

Associate QA Engineer

Veeva Systems
Pleasanton, CA

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histo…

View Details
Posted 2025-07-31

Dining Room Team Members

Birdsong
San Francisco, CA

Birdsong is a two Michelin–starred restaurant in San Francisco (SoMa), where Chef Chris Bleidorn’s cooking explores heritage cuisine with heart, craft, and a deep sense of place. Now we’re looking f…

View Details
Posted 2025-12-27

Founding Applied AI Engineer

Kastle
San Francisco, CA

About Kastle Kastle is building AI operating system for consumer lending, starting with mortgage. We work with some of America's largest mortgage lenders, helping them scale their contact center and…

View Details
Posted 2026-01-13

Sr. Staff/Principal Software Engineer - SONiC - SAI

Eridu AI
Saratoga, CA

Position Overview   We are seeking a highly experienced Senior Staff Engineer to lead the architecture, development, and integration of OCP SAI (Switch Abstraction Interface) with SONiC (Software for…

View Details
Posted 2025-12-19