AI Systems Engineer
At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine in 2022. We've grown from answering 2.5 million questions per day at the start of 2024 to around 20 million daily queries in December 2024. We also offer Perplexity Enterprise Pro, which counts leading companies like Nvidia, the Cleveland Cavaliers, Bridgewater, and Zoom as customers.
To support our rapid expansion, we've raised significant funding from some of the most respected technology investors. Our investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Daniel Gross, Naval Ravikant, Tobi Lutke, and many other visionary individuals. In 2024, our employee base grew nearly 300%, and we're just getting started.
We are looking for an AI Systems Engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Responsibilities
- Develop robust APIs for AI inference used by both internal and external customers
- Design, deploy, and maintain scalable, reliable infrastructure for deploying machine learning models
- Benchmark system performance, diagnose bottlenecks, and implement improvements across the inference stack
- Enhance system reliability and observability by integrating modern monitoring and alerting tools
- Respond swiftly to system outages and collaborate across teams to maintain high uptime and performance
Qualifications
- Experience in developing APIs and managing distributed systems
- Strong understanding of Kubernetes and container orchestration
- Experience with deploying reliable, distributed, real-time systems at scale
- High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)
The cash compensation range for this role is $190,000 - $250,000.
Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.
Recommended Jobs
Senior Full Stack Software Engineer
Job Title: Senior Full-Stack Software Engineer Job Summary: Contribute to the design, development, and support of scalable, integrated business systems across web, cloud, and enterprise platfo…
Senior Networking Engineer
We are obsessed with crafting a unique MMO, if you’re up for the challenge, let's make it glorious together! Intrepid Studios’ Ashes of Creation team is looking for a highly motivated and ta…
Lead Alterations Specialist Transform Dresses to Dreams
Bring your passion for precision and bridal fashion to life! David’s Bridal is seeking a Lead Alterations Specialist to oversee fittings, alterations, and a talented team dedicated to making every gow…
Locum Tenens Psychiatry Job CA
This Job at a Glance Job Reference Id: ORD-195426-MD-CA Title: MD Dates Needed: September - Ongoing Shift Type: Day Shift Assignment Type: Clinic Call Required: No Board Ce…
RN Central Utilization Review Nurse - Per Diem
Responsibilities Join the Southwest Healthcare Team! About Us: Creating Health and Harmony, Southwest Healthcare is a comprehensive network of care with convenient hospital and ambulatory…
Wellness Nurse, LVN
Position: Wellness Nurse (Hiring for this position must be approved by VP of Health Services PRIOT TO POSTING) Shifts, Time, and Days: Fulltime/Tuesday - Saturday Pay Range: $37.00 - 39.00 SANTIAN…
Bookkeeper - San Rafael
Position Summary We are seeking a Bookkeeper with to join our team of dedicated professionals. This position will be based fully in-office out of our of San Rafael (CA) office. The Bookkeeper i…
Senior Infrastructure Engineer - Supercomputing
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the ne…
Machine Learning Engineer, Motion Planning
Woven by Toyota is the mobility technology subsidiary of Toyota Motor Corporation. Our mission is to deliver safe, intelligent, human-centered mobility for all. Through our Arene mobility software p…
Regional Marketing Director - Fixed Term Contract
Redgate Software Redgate creates simple software to help data professionals get the most value out of any database. Our solutions solve complex database management challenges across the DevOps lif…