Principal Data Engineer

Sanas.ai
Palo Alto, CA

Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track record of creating and steering multiple unicorn companies, our groundbreaking GDP-shifting technology sets a gold standard.

Sanas is a 200-strong team, established in 2020. In this short span, we’ve successfully secured over $100 million in funding. Our innovation have been supported by the industry’s leading investors, including Insight Partners, Google Ventures, Quadrille Capital, General Catalyst, Quiet Capital, and other influential investors. Our reputation is further solidified by collaborations with numerous Fortune 100 companies. With Sanas, you’re not just adopting a product; you’re investing in the future of communication.

Weʼre looking for an experienced and forward-thinking Principal Data Engineer to lead the design and implementation of our end-to-end data infrastructure for industry leading Voice AI products. This is a high impact role where you will shape the technical vision, own strategic architecture decisions, and mentor a growing team of Data engineers focused on delivering reliable and scalable data systems for Machine Learning at scale.

Youʼll work cross-functionally with AI research scientists, Infrastructure and product teams to ensure that data - from raw audio to training-ready features - is consistently accessible, compliant and optimized for speed and scale. Youʼll help push the boundaries of real-time Voice AI!

Key Responsibilities:

  • Architect and lead the development of large scale data pipelines and data lakes to ingest, transform and serve high quality data for AI model training, product telemetry and analytics.
  • Drive long‑term data infrastructure strategy across streaming and batch, feature store extensions, Iceberg/Delta lake choices, metadata management, and lakehouse evolution.
  • Drive platform and infrastructure decisions, optimizing compute fleets (e.g.Ray, Spark clusters), orchestration tooling Airflow, Dagster), and streaming stacks Kafka, Flink)
  • Collaborate with AI research scientists, engineering leads, product, finance, marketing, and legal to align data architecture with business and regulatory requirements.
  • Advocate best practices in data governance, lineage, observability, testing, tooling, and disaster recovery across pipelines and data stores.
  • Act as a mentor and technical leader - review design and code, share patterns, elevate team capability, and support recruitment and hiring
  • Drive build vs buy decisions for tools to implement data quality and observability solutions to achieve high data quality.

Qualifications:

  • 10+ years of experience in Data Engineering, Infrastructure, or ML Systems, with at least 2+ years in a technical leadership capacity.
  • Expertise in building distributed batch and real-time data systems
  • Expertise in Databases (like Postgres) andData Lakes (like Snowflake, Databricks and ClickHouse
  • Experience using Data Processing frameworks like Spark, Flink and Ray
  • Deep Experience with cloud platforms AWS/GCP, object storage (e.g., S3), and orchestrators like Airflow and Dagster
  • Strong knowledge of data lifecycle management, including privacy, security, compliance and reproducibility
  • Comfortable working in a fast-paced startup environment
  • Strategic mindset and proven ability to collaborate across engineering, ML and product teams to deliver infrastructure that scales with the business.

Nice to Have:

  • Familiarity with audio data and its unique challenges, like large file sizes, time- series features, metadata handling, is a strong plus
  • Experience with Voice AI models like ASR, TTS and speaker verification.
  • Familiarity with real-time data processing frameworks like Kafka, Flink, Druid and Pinot
  • Familiarity with ML workflows including: MLOps, feature engineering, model training and inference.
  • Experience with labeling tools, audio annotation platforms, or human-in-the- loop annotation pipelines.

Joining us means contributing to the world’s first real-time speech understanding platform revolutionizing Contact Centers and Enterprises alike.

Our technology empowers agents, transforms customer experiences, and drives measurable growth. But this is just the beginning. You'll be part of a team exploring the vast potential of an increasingly sonic future

Posted 2025-09-22

Recommended Jobs

AutoMotive INTERNET SALES PERSON-Experienced

Envision Motors of Milpitas
Milpitas, CA

Job Description Job Description INTERNET SALES PERSON Responsibilities: - Develop and maintain strong relationships with potential customers through various online platforms. - Conduct vir…

View Details
Posted 2025-07-30

AI Engineer - Reasoning

Xai
Palo Alto, CA

About the Position As a Reasoning engineer, you will build frameworks to improve the reasoning capability, build distributed reinforcement learning systems, techniques for inference time compute (…

View Details
Posted 2025-09-22

Landscape Crew Lead

Kniffings Landscape
El Cajon, CA

Job Description Job Description Kniffing's Landscape &Maintenance first opened our Nursery and Landscape Business in 1980. Our team of gardening experts are knowledgeable and passionate about …

View Details
Posted 2025-07-30

Marketing Coordinator

MD Care Hospice
Oxnard, CA

Job Description Job Description Benefits: Competitive salary Flexible schedule Health insurance Opportunity for advancement Training & development Benefits/Perks Competitive …

View Details
Posted 2025-07-29

Software Engineer L4/L5, Model Serving Systems, Machine Learning Platform

Netflix
Los Gatos, CA

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages…

View Details
Posted 2025-09-22

Bartender

SF Cocktail Club
San Francisco, CA

San Francisco Cocktail Club is Hiring Experienced Hospitality folks to join our roster. We are currently looking to add bartenders to our beverage event team for the busy season. Five years of hospi…

View Details
Posted 2025-09-10

Multiple Positions

Santa Clara, CA

ServiceNow Inc is accepting resumes for the following positions in Santa Clara, CA:  Senior Inbound Product Manager (4124359): Spearhead dvlpmt & execution of our product strategy, collab. w/ stakeho…

View Details
Posted 2025-09-08

Machine Learning Engineer - Autotuning

Zoox
Foster, CA

Zoox is looking for machine learning engineers to help build systems to evaluate and improve autonomous driving behaviors by learning from expert human drivers. Our team develops core technologies to…

View Details
Posted 2025-09-22

Senior Machine Learning Engineer

Metropolis
Los Angeles, CA

The Company Metropolis is an artificial intelligence company that uses computer vision technology to enable frictionless, checkout-free experiences in the real world. Today, we are reimagining par…

View Details
Posted 2025-09-22

Software Developer

Robinhood
Menlo Park, CA

Join us in building the future of finance. Our mission is to democratize finance for all. An estimated $124 trillion of assets will be inherited by younger generations in the next two decades. T…

View Details
Posted 2025-09-22