Software Engineer, Data Infrastructure

Thinking Machines Lab
San Francisco, CA


Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals.

We are a small team of scientists, engineers, and builders who've created some of the most widely used AI products, like ChatGPT, Character.ai, Mistral, PyTorch, OpenAI Gym, Fairseq, and Segment Anything.

About The Role


We're looking for a Staff Software Engineer with deep expertise in Data Infrastructure to help build the systems that power our foundation models.

You'll join a small, high-impact team responsible for architecting and scaling the core infrastructure behind distributed training pipelines, multimodal data catalogs, and intelligent processing systems that operate over petabytes of data.

Infrastructure is critical to us: it's the bedrock that enables every breakthrough. You'll work directly with researchers to accelerate experiments, develop new datasets, improve infrastructure efficiency, and enable key insights across our data assets.

If you're excited by distributed systems, large-scale data mining, open-source tools like Spark, Kafka, Beam, Ray, and Delta Lake, and enjoy building from the ground up, we'd love to hear from you.

What You’ll Do




  • Design, build, and operate scalable, fault-tolerant infrastructure for LLM Research: distributed compute, data orchestration, and storage across modalities.



  • Develop high-throughput systems for data ingestion, processing, and transformation — including training data catalogs, deduplication, quality checks, and search.



  • Build systems for traceability, reproducibility, and robust quality control at every stage of the data lifecycle.



  • Implement and maintain monitoring and alerting to support platform reliability and performance.



  • Collaborate with research teams to unlock new features, improve data quality, and accelerate training cycles.


Required Qualifications




  • Have 5+ years of experience in data infrastructure, ideally supporting ML or research use cases.



  • Are fluent in distributed compute frameworks such as Apache Spark and Ray.



  • Have hands-on experience with Kafka, dbt, Terraform, and Airflow.



  • Have experience building a web crawler.



  • Have extensive experience studying and scaling deduplication, data mining, and search.



  • Are deeply familiar with cloud infrastructure, data lake architectures, and batch + streaming pipelines.



  • Have strong knowledge of file formats and storage systems (e.g., Parquet, Delta Lake, etc.) and how they impact performance and scalability.



  • Are proactive about documentation, testing, and empowering your teammates with good tooling.


Strong Candidates May Also Have




  • 5+ years of industry experience building large-scale distributed systems.



  • Strong proficiency in Python, SQL, and bonus for Rust.



  • Familiarity with performance tuning and memory management in high-volume data systems.



  • Track record of scaling infrastructure and debugging complex systems in production.



  • Excellent communication and collaboration skills.


Logistics




  • Location: This role is based in San Francisco, California.



  • Visa sponsorship: We sponsor visas. While we can't guarantee success for every candidate or role, if you're the right fit, we're committed to working through the visa process together.



  • Benefits: Thinking Machines offers competitive health, dental, and vision benefits, unlimited PTO, paid parental leave, and relocation support as needed.



  • Compensation: Depending on background, skills and experience, the expected annual salary range for this position is $300,000-$350,000 USD.



  • We encourage you to apply even if you do not believe you meet every single qualification.



  • As set forth in Thinking Machines' Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.


Posted 2025-10-31

Recommended Jobs

Senior Data Scientist - Retailer

Faire
San Francisco, CA

About Faire Faire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, …

View Details
Posted 2025-09-14

Assembly Technician

Express Employment Professionals - Santa Ana
Costa Mesa, CA

Overview We are seeking a dedicated and detail-oriented Assembly Technician to join our dynamic team in Costa Mesa, CA. As an Assembly Technician, you will play a crucial role in the production proc…

View Details
Posted 2025-10-31

Licensed Behavioral Health Clinic Program Director

SAN DIEGO YOUTH SERVICES
San Diego, CA

San Diego Youth Services JOB ANNOUNCEMENT Licensed Behavioral Health Clinic Program Director Do you share our vision to create a world where all youth have equal opportunities to achieve the…

View Details
Posted 2025-10-27

Senior Caregiver

GreatAuPair LLC
San Diego, CA

I am looking for experienced, caring person who is kind, patient and a problem-solver for my 88 year old father with dementia. He can walk, get out of the bed, and eat without help. He is quite indepe…

View Details
Posted 2025-10-27

Frontend Software Engineer

Inductive Bio
Carlsbad, CA

Drug discovery is a design problem. Chemists spend hours each week combining experimental data with their own intuition to design molecules that they believe will be potent against a biological targe…

View Details
Posted 2025-09-14

Applied AI Engineer

Safetykit
San Francisco, CA

We’re inventing the future of B2B SaaS with AI agents. We’re betting on language models and we’re betting on scale. You’ll test new models the day they come out and understand their characteristics b…

View Details
Posted 2025-09-22

Process Engineer

Henkel
Rancho Dominguez, CA

What you´ll do Drive innovation and process optimization initiatives aimed at improving efficiency and reducing cost.  Build relationships and work cross functionally with R&D, Production, Quali…

View Details
Posted 2025-11-03

Bookkeeper Assistant - Account Payable & Receivable

Innovative Metrics
Beverly Hills, CA

Innovative Metrics, a successful innovative online technology company since 2005, is looking for a bookkeeper assistant. As a bookkeeper assistant, you will provide crucial support to our accou…

View Details
Posted 2025-09-14

Sr. Product Manager

Match Group
San Francisco, CA

Our Mission Launched in 2012, Tinder® revolutionized how people meet, growing from 1 match to one billion matches in just two years. This rapid growth demonstrates its ability to fulfill a fundament…

View Details
Posted 2025-10-01

Case Manager I/II - Drop-In Center

Sierra Vista Child & Family Services
Modesto, CA

Brief Description 36-40 hours/ week (Full-Time) Hours Include - Thursdays. 5-7pm and Saturday. 8-12pm Case Managers at the Family Resource Center provide education, group facilitation, and sup…

View Details
Posted 2025-10-31