Software Engineer, Data Infrastructure

Evolutionary Scale
San Francisco, CA

Who we are


EvolutionaryScale’s mission is to develop artificial intelligence to understand biology for the benefit of human health and society, through open, safe, and responsible research, and in partnership with the scientific community. Over the next ten years AI will transform biological design, making molecules and entire cells programmable. We will develop the foundation models for biology that enable this.

The EvolutionaryScale team is based in San Francisco and New York. We believe in flexibility around work schedules and locations, but expect that our team members will work half of the days or more of most weeks from one of our offices.

What you’ll do


As a Data Infrastructure Engineer, you will work closely with bioinformatics and research teams to ensure our data jobs are reliable, efficient, and scalable. You'll implement best practices for handling large-scale data processing, select and integrate the right technologies, and drive continuous improvements in performance and quality of our data sets.

The role



  • Design, develop, and maintain large-scale batch processing pipelines using tools like Spark and Ray, for acquiring biology datasets.

  • Manage data infrastructure components to ensure robust and fault-tolerant operations.

  • Optimize data ingestion, storage, and retrieval processes for acquiring large and growing biology datasets, and for efficient pre and post training data ingestion.

  • Create systems for easy and reproducible data evaluation and experiments.

  • Integrate modern ML based data curation technologies with data processing pipelines.

  • Work with researchers and other engineering teams to understand data needs, create solutions that meet modeling requirements.

Preferred qualifications


Apply even if you don’t meet all of these!


  • Staff level engineers with 5+ years experience highly preferred

  • Proven experience with large-scale data processing systems using technologies such as Hadoop, Spark, or Ray.

  • Knowledge of streaming data frameworks like Kafka Streams, Spark Streaming, or Flink.

  • Understanding of data processing principles and best practices.

  • Strong problem-solving skills, including the ability to research, debug, and resolve complex technical problems.

  • Experience with major cloud providers (AWS, GCP, or Azure), including familiarity with data warehousing tools is a plus.

  • Knowledge of biology and biology datasets is a big plus but not required.

  • Experience with large scale distributed systems or machine learning is also not required but a plus.

Posted 2025-08-22

Recommended Jobs

Entry Level Nonprofit Canvasser for PBS and NPR - $21/hr

Donor Development Strategies
San Diego, CA

Job Description Job Description Public Media Canvassers Wanted! Do you value public media and the role it plays in our society? Join our team in saving PBS & NPR ! San Diego's most trustworthy…

View Details
Posted 2025-07-30

Sr. Penetration Tester (Android)/Mobile Tester

Focuskpi Inc.
Mountain View, CA

FocusKPI is looking for a Sr. Penetration Tester (Android)/Mobile Tester to join one of our clients, a high-tech SaaS company.    The client is looking for a Sr. Penetration Tester (Android) who w…

View Details
Posted 2025-08-22

R&D Test Engineer

Mainspring Energy
Menlo Park, CA

Company Overview Mainspring Energy is revolutionizing power generation with the world’s most flexible and adaptable onsite power generator, the Mainspring Linear Generator. Commercial, industrial,…

View Details
Posted 2025-08-20

Director - Clinical Operations Strategy

Veeva Systems
San Francisco, CA

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histo…

View Details
Posted 2025-07-31

Arcade| Associate Product Manager - Generative AI

Heretic
San Francisco, CA

About Arcade Arcade is the world’s first AI product marketplace, enabling anyone to design, purchase, and sell custom, manufacturable products with a simple text prompt. Co-founded by Mariam Nafic…

View Details
Posted 2025-08-20

Software Engineer - Backend

Anon
San Francisco, CA

About Anon Anon is building the platform that enables enterprises to deploy authenticated AI agents for their most critical data workflows. We're solving one of the biggest challenges in enterpri…

View Details
Posted 2025-08-20

Temporary Museum Office Manager

UC Irvine Health
Irvine, CA

Temporary Museum Office Manager Location Irvine, CA : Updated: Nov 8, 2025 Location: Irvine-Campus Job Type: Department: Temporary Employment Services Job Opening ID: 60904 Reports To: UCI Tempora…

View Details
Posted 2025-08-22

Associate Veterinarian - San Diego County, CA - #7223

thevetrecruiter.com
El Cajon, CA

Associate Veterinarian - San Diego County, CA - #7223 We are seeking an Associate Veterinarian to join our friendly team of pet lovers! We've been serving the pets of our community for nearly 40 yea…

View Details
Posted 2025-07-30

Associate Civil Engineer - Water Resources

Techoundsllc
Riverside, CA

Title: Associate Civil Engineer – Water Resources Location:  Riverside, CA  92506 Salary: $10000,000 - $125,000 along with a lucrative bonus program, 9/80 schedule, hybrid work schedule (one day…

View Details
Posted 2025-07-30

Software Engineer

Jobs Board
Mountain View, CA

About Applied Intuition Applied Intuition is a vehicle software supplier that accelerates the adoption of safe and intelligent machines worldwide. Founded in 2017, Applied Intuition provides a sim…

View Details
Posted 2025-08-22