Software Engineer, Data Infrastructure
Who we are
EvolutionaryScale’s mission is to develop artificial intelligence to understand biology for the benefit of human health and society, through open, safe, and responsible research, and in partnership with the scientific community. Over the next ten years AI will transform biological design, making molecules and entire cells programmable. We will develop the foundation models for biology that enable this.
The EvolutionaryScale team is based in San Francisco and New York. We believe in flexibility around work schedules and locations, but expect that our team members will work half of the days or more of most weeks from one of our offices.
What you’ll do
As a Data Infrastructure Engineer, you will work closely with bioinformatics and research teams to ensure our data jobs are reliable, efficient, and scalable. You'll implement best practices for handling large-scale data processing, select and integrate the right technologies, and drive continuous improvements in performance and quality of our data sets.
The role
- Design, develop, and maintain large-scale batch processing pipelines using tools like Spark and Ray, for acquiring biology datasets.
- Manage data infrastructure components to ensure robust and fault-tolerant operations.
- Optimize data ingestion, storage, and retrieval processes for acquiring large and growing biology datasets, and for efficient pre and post training data ingestion.
- Create systems for easy and reproducible data evaluation and experiments.
- Integrate modern ML based data curation technologies with data processing pipelines.
- Work with researchers and other engineering teams to understand data needs, create solutions that meet modeling requirements.
Preferred qualifications
Apply even if you don’t meet all of these!
- Staff level engineers with 5+ years experience highly preferred
- Proven experience with large-scale data processing systems using technologies such as Hadoop, Spark, or Ray.
- Knowledge of streaming data frameworks like Kafka Streams, Spark Streaming, or Flink.
- Understanding of data processing principles and best practices.
- Strong problem-solving skills, including the ability to research, debug, and resolve complex technical problems.
- Experience with major cloud providers (AWS, GCP, or Azure), including familiarity with data warehousing tools is a plus.
- Knowledge of biology and biology datasets is a big plus but not required.
- Experience with large scale distributed systems or machine learning is also not required but a plus.
Recommended Jobs
Entry Level Nonprofit Canvasser for PBS and NPR - $21/hr
Job Description Job Description Public Media Canvassers Wanted! Do you value public media and the role it plays in our society? Join our team in saving PBS & NPR ! San Diego's most trustworthy…
Sr. Penetration Tester (Android)/Mobile Tester
FocusKPI is looking for a Sr. Penetration Tester (Android)/Mobile Tester to join one of our clients, a high-tech SaaS company. The client is looking for a Sr. Penetration Tester (Android) who w…
R&D Test Engineer
Company Overview Mainspring Energy is revolutionizing power generation with the world’s most flexible and adaptable onsite power generator, the Mainspring Linear Generator. Commercial, industrial,…
Director - Clinical Operations Strategy
Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in histo…
Arcade| Associate Product Manager - Generative AI
About Arcade Arcade is the world’s first AI product marketplace, enabling anyone to design, purchase, and sell custom, manufacturable products with a simple text prompt. Co-founded by Mariam Nafic…
Software Engineer - Backend
About Anon Anon is building the platform that enables enterprises to deploy authenticated AI agents for their most critical data workflows. We're solving one of the biggest challenges in enterpri…
Temporary Museum Office Manager
Temporary Museum Office Manager Location Irvine, CA : Updated: Nov 8, 2025 Location: Irvine-Campus Job Type: Department: Temporary Employment Services Job Opening ID: 60904 Reports To: UCI Tempora…
Associate Veterinarian - San Diego County, CA - #7223
Associate Veterinarian - San Diego County, CA - #7223 We are seeking an Associate Veterinarian to join our friendly team of pet lovers! We've been serving the pets of our community for nearly 40 yea…
Associate Civil Engineer - Water Resources
Title: Associate Civil Engineer – Water Resources Location: Riverside, CA 92506 Salary: $10000,000 - $125,000 along with a lucrative bonus program, 9/80 schedule, hybrid work schedule (one day…
Software Engineer
About Applied Intuition Applied Intuition is a vehicle software supplier that accelerates the adoption of safe and intelligent machines worldwide. Founded in 2017, Applied Intuition provides a sim…