Software Engineer
EMPLOYER: MongoDB, Inc.
Job ID: 9509431
Salary Range: $198,000 – $257,000/year
TITLE:Software Engineer
Job Description: Design, implement, and maintain highly scalable, low-latency backend systems to serve and infer upon AI models using Python and other programming languages (such as Go, C++, or Rust), ensuring millisecond-level latency while handling tens of thousands to millions of requests per second. Engineer complex distributed systems that operate on datasets with billions of records, integrating advanced GPU autoscaling algorithms to dynamically allocate GPU resources for AI workloads and employing sophisticated load balancing strategies to optimize throughput and minimize latency. Deploy and manage applications seamlessly across multiple cloud environments (AWS, GCP, and Azure), utilizing Docker, Kubernetes, and Helm for containerization and orchestration. Implement robust CI/CD pipelines and employ observability tools like Prometheus and Grafana to continuously monitor performance, reliability, and resource utilization of large-scale, production-grade inference platforms. Must appear in office 3 days per week; WFH permissible 2 days per week.
Requirements: Master’s degree or foreign degree equivalent in Computer Science, or related field and two (2) years of experience in the job offered or related role.
Experience and/or education must include:
2 years of experience with Python specifically applied to building large-scale distributed backend systems handling tens of thousands to millions of requests per second and maintaining millisecond-level latency for AI model inference;
2 years of experience with Docker and Kubernetes, including advanced creation and configuration of Helm charts to deploy and manage large-scale, GPU-accelerated inference servers in multi-cloud environments;
2 years of experience with Linux systems and multi-cloud infrastructures (AWS, GCP, and Azure), including expertise in provisioning and scaling resources across multiple regions and platforms to ensure consistent, low-latency AI service delivery;
2 years of experience implementing and optimizing distributed scheduling algorithms, including GPU autoscaling logic to dynamically allocate compute resources, address race conditions, mitigate deadlocks, and ensure multi-server consistency in high-throughput AI inference pipelines;
2 years of experience designing and maintaining gRPC and RESTful APIs, ensuring efficient, secure, and backward-compatible service contracts that meet strict latency and availability requirements at scale;
2 years of experience with streaming and messaging platforms including Kafka and RabbitMQ, architecting ingestion pipelines to handle billions of data points, enabling rapid data access and real-time model updates;
2 years of experience employing large-scale NoSQL and SQL data stores (DynamoDB or BigQuery) to manage, query, and analyze billions of records supporting AI models, ensuring optimal performance and cost-efficiency under sustained heavy load;
2 years of experience optimizing GPU-accelerated model inference using frameworks including PyTorch, CUDA, and TensorFlow, reducing inference time and improving throughput by tuning GPU kernel operations, optimizing memory transfers, and streamlining data pipelines for large-scale production; and
2 years of experience implementing production-grade feature stores and batch job scheduling frameworks, creating reliable feature retrieval endpoints and orchestrating large-scale batch data processing tasks to support continuous improvement of machine learning model inputs.
JOB SITE: 499 Hamilton Avenue, Palo Alto, CA 94301; Must appear in office 3 days per week; WFH permissible 2 days per week.
CONTACT: Please email resume to [email protected] and reference Job ID 9509431
Recommended Jobs
Direct Mail Machine Operator
Opening its doors over 45 years ago, FSSI is a leading document outsourcing company servicing Fortune 500 companies in the financial, banking, insurance and billing industries across the U.S. FSSI…
Human Resources Analyst (20721949)
Location Claremont, 91711 Description *Salary range for these positions are under review.* $2500 SIGN-ON BONUS AVAILABLE! Join Tri-City's transformation as we modernize HR systems an…
Superintendent/Senior Superintendent (Richmond)
Largest Family Owned Contractor in the Bay Area - Continuing to grow and flourish in the healthcare, education sectors and more! This Jobot Job is hosted by: Joseph Salmeri Are you a fit? Easy A…
Account Executive
Company Description AbbVie's mission is to discover and deliver innovative medicines and solutions that solve serious health issues today and address the medical challenges of tomorrow. We striv…
E-Commerce Manager
Dunlop Manufacturing, Inc., a leading manufacturer of accessories for the music industry, is seeking an experienced and strategic E-Commerce Manager to join our team in Benicia, California. As a r…
Principal Machine Learning Engineer
We help the world run better At SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries and 80% of global commerce, and w…
Retail Learning Coach
The Retail Learning Coach at Tiffany & Co. in Costa Mesa will partner with store leaders and retail teams to ensure high sales and service standards. This role involves coaching, developing leadership…
Visiting LVN/Medical Assistant (Palm Springs)
About the Job: Visiting LVN (Palm Springs) Our Home health agency has been in the business for years and has been continuously expanding our team of caring health care professionals to provide our…
Palantir Foundry Application Developer
Palantir Foundry Application Developer Location: San Diego, CA Clearance: Must be able to obtain a Secret clearance The Marlin Alliance, Inc. is seeking a Palantir Foundry Application Developer to …
Software Engineer II
About Pivotal Health Pivotal Health is the leading technology platform that helps healthcare providers get paid fairly in an increasingly complex reimbursement landscape. Today, many providers fa…