Software Engineer - Data Platform | NYC, Seattle, SF
About Perplexity
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world’s leading AI platforms. Perplexity has raised over $1B in venture investment from some of the world’s most visionary and successful leaders, including Elad Gil, Daniel Gross, Jeff Bezos, Accel, IVP, NEA, NVIDIA, Samsung, and many more. Our objective is to build accurate, trustworthy AI that powers decision-making for people and assistive AI wherever decisions are being made. Throughout human history, change and innovation have always been driven by curious people. Today, curious people use Perplexity to answer more than 780 million queries every month–a number that’s growing rapidly for one simple reason: everyone can be curious.
About the Role
Perplexity is looking for experienced Data Infrastructure Engineers to design, build, and scale the foundational data systems that power our product, AI research, analytics, and decision-making at scale.
In this role, you will develop and own critical infrastructure for batch and streaming data processing , data orchestration , reliability , and developer experience across the data stack. You’ll work closely with engineering and data science teams to ensure data is accurate, timely, discoverable, and trustworthy , while enabling teams to move fast without sacrificing correctness or scale.
This is a high-impact, senior/staff-level role where you will shape architecture, set standards, and drive long-term technical direction for Perplexity’s data ecosystem.
What You’ll Do
Data Platform & Pipelines
Design and operate large-scale batch and streaming data pipelines supporting product features, AI training/evaluation, analytics, and experimentation.
Build and evolve event-driven and streaming systems (e.g., Kafka/Kinesis/PubSub-style architectures) for real-time ingestion, transformation, and delivery.
Own batch processing frameworks for backfills, aggregations, and offline computation.
Orchestration & Reliability
Lead the design and operation of data orchestration systems (e.g., Airflow, Dagster , or equivalent), including scheduling, dependency management, retries, SLAs, and observability.
Establish strong guarantees around data correctness, freshness, lineage, and recoverability .
Design systems that handle scale, partial failure, and evolving schemas.
Platform & Developer Enablement
Build self-serve data platforms that empower engineers, data scientists, and analysts to safely create and operate pipelines.
Improve developer experience for data work through better abstractions, tooling, documentation, and paved paths.
Set standards for data modeling, testing, validation, and deployment.
Architecture & Leadership
Drive architectural decisions across data infrastructure for storage, compute, orchestration, and APIs.
Partner closely with engineering and data science teams to align data systems with evolving requirements.
Mentor engineers, review designs, and raise the technical bar across the organization.
What We’re Looking For
Minimum Qualifications
5+ years (Senior) or 8+ years (Staff) of software engineering experience.
Strong experience building production data infrastructure systems.
Hands-on experience with batch and/or streaming data processing at scale.
Deep familiarity with data orchestration systems (Airflow, Dagster, or similar).
Proficiency in Python and at least one additional backend language (Go, TypeScript, etc.).
Strong systems thinking: you understand tradeoffs across reliability, latency, cost, and complexity.
Experience supporting ML/AI workflows, training pipelines, or evaluation systems.
Familiarity with data quality, lineage, observability, and governance tooling.
Prior ownership of internal platforms used by many teams.
Recommended Jobs
Accounts Payable Clerk
About the Role We are seeking a detail-oriented Accounts Payable Clerk to handle the accurate, timely processing of client-related expenses for a U.S.-based personal injury law firm. Thi…
Product Manager, LangSmith
About LangChain At LangChain, our mission is to make intelligent agents ubiquitous. We provide the agent engineering platform and open source frameworks developers need to ship reliable agents fast.…
Senior Software Engineer (Build Tools and Automation)
Turning Space into a Transportation Layer for Earth Who We Are : Eras of humanity can often be defined by a dominant transportation mode - horse drawn chariots, ocean going boats, or aircraf…
Data Infrastructure Engineer
About the Team You’ll join the team that’s behind OpenAI’s data infrastructure that powers critical engineering, product, alignment teams that are core to the work we do at OpenAI. The systems we su…
Service Sales Account Manager
Service Sales Account Manager Location Sacramento, CA (Natomas Crossing area) : Overview: Grow with us! HCI Systems, Inc is hiring and seeking enthusiastic Service Sales Account Managers to join …
SR Agent, CMP
We Put the World on Vacation Travel + Leisure Co. is the world’s leading vacation ownership and travel membership company, with a dynamic and growing portfolio of resort, travel club, and lifestyl…
Senior Software Engineer - Backend
About Hive Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organization…
AI Engineer & Researcher - Multimodal
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on eng…
Test Automation & QA Systems Intern
Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. In…
Mental Health Specialist
Position Compensation: $27.18/hr - 38.40/hr (Part-Time, 20 hours) JOB PURPOSE: Provides mental health services to support social emotional development of enrolled children and their families thr…