Staff Software Engineer, Science

Biohub
Redwood City, CA

Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization, with the support of the Chan Zuckerberg Initiative.

The Team


Biohub supports the science and technology that will make it possible to help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem like an audacious goal, in the last 100 years, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.

Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems — paving the way for new discoveries that will change medicine in the decades that follow:


  • Building an AI-based virtual cell model to predict and understand cellular behavior

  • Developing novel imaging technologies to map, measure and model complex biological systems

  • Creating new tools for sensing and directly measuring inflammation within tissues in real time.tissues to better understand inflammation, a key driver of many diseases

  • Harnessing the immune system for early detection, prevention, and treatment of disease

The Opportunity


The Data Pipelines team processes scientific datasets specifically designed to enable biological modeling and supporting AI research. It is responsible for data ETL, data validation, testing, storage, and partners with the data management team for retrieval. We handle over 89 million unique cells worth of single cell transcriptomic data, over 15 thousand cryoET tomograms that are in imaging datasets as large as 20TB and counting, and will be expanding to support larger scale and additional imaging, sequencing, and literature modalities. Our resources provide access to open source data that is structured and used by tens of thousands of scientists each month to quickly query and form hypotheses on understanding how genetic variants in cells impact disease risk, define drug toxicities, and eventually discover better therapies.

As a software engineer on the Data Engineering team, you will contribute for architecture, help implement all the above mentioned data needs for our platforms, CELLxGENE Discover, CryoET, as well as the new platform we are building that has a focus on data for AI and the virtual cell, in order to enable scientists to further interrogate our very large and growing corpus of data without any need to download the data itself or have any computational expertise. You will work on a collaborative, multidisciplinary team to develop solutions for our scientist users to accelerate their workflows and accelerate the pace of scientific discovery.

No prior biology experience is needed for this role. You will have the opportunity to pair with Computational Biologists to develop solutions for our users and be able to learn about biology from experts on our team.

Our tech stack: Python, Terraform, AWS infrastructure, Argo CD and Workflows. TileDB .

What You'll Do



  • Own, maintain and continuously improve upon the data pipeline architecture.

  • Design, build, and maintain robust, scalable data pipelines for ingesting, processing, and storing large volumes of structured and unstructured data.

  • Develop and optimize ETL processes, ensuring data quality, validation, and consistency across diverse sources.

  • Implement and manage data storage solutions, including data warehouses, data lakes, and distributed databases, ensuring secure and performant to handle massive volumes of single-cell transcriptomics data and imaging data.

  • Monitor and troubleshoot data pipelines, build proactive exception handling, and ensure high reliability and uptime of production systems.

  • Document processes, maintain data models, and support data governance, lineage, and compliance initiatives.

  • Utilize modern tools and technologies, such as Argo Workflows, Kubernetes, AWS, Docker, and CI/CD pipelines.

  • Actively contribute to team problem-solving, project planning, and process improvements with a mindset for innovation and social impact.

  • Create user-friendly APIs to enable researchers and scientists to easily access and explore the curated data.

  • Develop scalable, maintainable, and testable software systems and participate in team conversations and efforts on engineering excellence.

  • Collaborate with data scientists, computational biologists, researchers, analysts, and other engineers to understand data requirements and deliver practical solutions that drive analytics, research, and AI/ML applications.

  • Have opportunities to learn about scientific data and technologies, though no prior experience is

What You'll Bring



  • 8+ years of experience as Software Engineer with data building data pipelines.

  • Proficiency in programming languages (Python, Java) and SQL.

  • Experience with big data, AWS(EC2, S3, EKS, IAM, SQS etc), Docker, and Argo Workflows.

  • Strong data modeling, database design, and data integration skills, including ETL and pipeline orchestration tools.

  • Strong fundamentals in systems design, data structures, algorithms, and object oriented programming principles.

  • Experience with CI/CD, data governance, and observability/monitoring tools.

  • Excellent communication, teamwork, and analytical problem-solving abilities.

  • Passion for the CZI mission, innovation, and open, collaborative culture.

  • Computer Science Engineering degree.

  • Strong problem solving and analytical skills.

  • Excellent written and verbal communication skills.

  • Enthusiasm to ramp up on technologies and learn a new science domain.

  • Must be self-driven and comfortable supporting data needs of multiple systems and products.

Nice to Have


  • Experience working with Biology, Imaging or Sequencing data

  • Experience working with data formats related to biodata and solving challenges with that data.

  • Experience building AI Agents related to data movement or ETL.

Compensation


The Redwood City, CA base pay range for a new hire in this role is $214,000 - $294,800. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.

Better Together


As we grow, we’re excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team’s manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.

Benefits for the Whole You


We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.


  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.

  • Paid time off to volunteer at an organization of your choice.

  • Funding for select family-forming benefits.

  • Relocation support for employees who need assistance moving

If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.

#LI-Hybrid

Posted 2026-02-13

Recommended Jobs

Embedded DSP Software Engineer, Senior Staff

Qualcomm
San Diego, CA

Company: Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group Software Engineering General Summary: As a leading technology innovator, Qualcomm pushes the boun…

View Details
Posted 2026-02-04

Senior Autonomy Operations Manager

Nuro
California

Who We Are Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI …

View Details
Posted 2026-01-30

sales associate

Ralph Lauren
La Jolla, CA

Position Overview Essential Duties & Responsibilities Experience, Skills & Knowledge Ralph Lauren will consider for employment qualified …

View Details
Posted 2026-02-03

Technical Support Assistant - Onboarding and Offboarding

Golden Gate Regional Center
San Francisco, CA

Technical Support Assistant - Onboarding and Offboarding Starting Salary Range: $53,481 - $64,177 GGRC is looking to hire a Technical Support Assistant who provides frontline IT support for GGRC…

View Details
Posted 2026-01-15

Accounts Payable/ Payroll Clerk

Atlas Disposal
Rancho Cordova, CA

Accounts Payable /Payroll Clerk  Job Location: Rancho Cordova, CA Salary Range: $25 to $27 per hour Position Overview  Process and Maintain Vendor and Accounts Payable transactions and corres…

View Details
Posted 2026-02-16

Associate Product Manager Program Lead- Project Hire

ESPN
Burbank, CA

About the Role: We are seeking a Program Lead for a 12 month project hire to design, develop, and implement a new Associate Product Management Program that will serve as a strategic talent pipelin…

View Details
Posted 2026-01-24

Project Manager (Wet Utilities)

Gables Search Group
Empire, CA

Project Manager – Wet Utilities / Underground Utilities &##128205; Location: Inland Empire Area, CA Job Summary We are seeking an experienced Project Manager to lead underground wet util…

View Details
Posted 2026-01-15

Staff Embedded Software Engineer (Networking)

RAVE Aerospace LLC
Brea, CA

The Staff Embedded Software Engineer (Networking) is responsible for the architecture, design, and implementation of high-performance networking software for onboard aircraft video systems. This role…

View Details
Posted 2026-01-27

Case Manager

JVS SoCal
Palmdale, CA

Description As part of the JVS So Cal Veterans Service Team, the SSVF Case Manager will be part of a dedicated, specialized, and passionate team focused on improving the lives of veterans experien…

View Details
Posted 2026-01-30

Senior Software Engineer, Collision Avoidance Testing

Nuro
California

Who We Are Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI …

View Details
Posted 2026-01-27