Senior Software Engineer, Observability
Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.
About This Role:
We’re seeking a Senior Software Engineer to play a key role on our Observability team within the Cloud Infrastructure organization. This team owns the real-time observability platforms that underpin visibility, reliability, and operational insight across our cloud and data center infrastructure.
What You’ll Be Working On:
Maintain and manage core observability tools, including platforms for metrics, events, logs and tracing.
Develop and operate data pipelines to move telemetry data from various sources to backend storage.
Manage large-scale data ingestion and storage requirements for high-volume environments.
Perform regular updates and software enhancements to ensure system stability and security.
Participate in a standard on-call rotation to address production issues and perform root cause analysis.
Work with other engineering teams to implement monitoring best practices and standardized tooling.
Contribute to the long-term technical roadmap for the company's internal infrastructure.
What You’ll Bring to the Team:
5+ years of experience in software or systems engineering.
Proficiency in Java or Go or Python for writing production-level code.
Practical experience managing Kubernetes clusters in a production environment.
Experience deploying and managing services using Helm and YAML-based configurations.
Ability to troubleshoot and resolve issues within distributed system architectures.
Experience participating in an on-call rotation for business-critical systems.
Bonus Points:
Experience with common observability tools such as Prometheus, Grafana, Loki, ClickHouse or Elasticsearch.
Familiarity with Kafka or similar message queuing systems.
Experience using Terraform for infrastructure provisioning.
Knowledge of OpenTelemetry standards.
Familiarity with GPU-based infrastructure or machine learning workloads.
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300/month
Compensation Range
Compensation will be paid in the range of up to $172,000 -$209,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Recommended Jobs
Deli Cook
Position Title: Deli-Cook -Part time (Could lead to Full Time) Department: Market Report To: Market Manager and Market Supervisor Wage: $17.00 DOE Position Summary: The primary focus…
Software Engineer, Site Reliability
As a Software Engineer on the Infra Reliability team you will drive the evolution of our systems, ensuring they meet the highest standards of performance, reliability, and efficiency. You’ll coll…
Field Operations Supervisor - Lancaster, CA
Location: Onsite: Lancaster, CA Location Status: Work will be primarily performed at a designated field worksite location based out of a central Race Communications worksite. Occasional travel to …
Endodontist
Endodontist Opportunity – San Diego, CA About the Practice This is a rapidly growing startup practice with a hungry, growth-minded, and service-oriented team at every level. The practice has expe…
Plumbing Technician
Rooter Hero is Hiring a Plumbing Technician! Location: San Diego, CA Employment Type: Full-Time Pay Rate: $17.75 – $25.00 per hour Schedule: Full-Time, Day Shift Overview Roo…
Senior Software Engineer, Luau App Foundations
As Senior Software Engineer on the Consumer Frontend team, you will leverage the Roblox tech stack and tools to build groundbreaking experiences that push the boundaries of what is possible on the …
Medical Director (Monterey)
Purrfurably Cats is searching for a skilled veterinarian to lead our feline-exclusive practice in Monterey, California. Role and experience: As Dr. Kathleen Marcus plans to reduce he…
Senior Bioinformatics Scientist (South San Francisco, CA)
We are seeking a highly motivated and innovative Senior/Staff Bioinformatics Scientist to join our R&D team focused on synthetic biology product development. In this role, you will work at the int…
Mechanical Engineer, Autonomous Vehicle
Who We Are Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI …
Area Access Executive
Company Description AbbVie's mission is to discover and deliver innovative medicines and solutions that solve serious health issues today and address the medical challenges of tomorrow. We striv…