ML Infrastructure Engineer

Phizenix
Menlo Park, CA

ML Infrastructure Engineer
Menlo Park, CA | On-Site | Full-Time/Direct Hire


Client Opportunity | Through Phizenix

Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.

We’re looking for a ML Infrastructure Engineer to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.

Responsibilities




  • Design and manage distributed infrastructure for ML training at scale



  • Optimize model serving systems for low-latency inference



  • Build automated pipelines for data processing, model training, and deployment



  • Implement observability tools to monitor performance in production



  • Maximize resource utilization across GPU clusters and cloud environments



  • Translate research requirements into robust, scalable system designs


Must-Haves




  • PhD in Computer Science, Engineering, or a related field (or equivalent experience)



  • Strong foundation in software engineering, systems design, and distributed systems



  • Experience with cloud platforms (AWS, GCP, or Azure)



  • Proficient in Python and at least one systems-level language (C++/Rust/Go)



  • Hands-on experience with Docker, Kubernetes, and CI/CD workflows



  • Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective



  • Understanding of GPU programming and high-performance infrastructure


Nice-to-Haves




  • Experience with large-scale ML training clusters and GPU orchestration



  • Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime)



  • Experience with distributed training strategies (e.g., data/model/pipeline parallelism)



  • Familiarity with orchestration tools like Kubeflow or Airflow



  • Background in performance tuning, system profiling, and MLOps best practices


At Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.

California Pay Range

$180,000 - $200,000 USD

Posted 2025-09-22

Recommended Jobs

Data Scientist - Ads

Xai
Palo Alto, CA

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on e…

View Details
Posted 2025-09-14

Full Time ObGyn Job Los Angeles, CA

GoStaffing GoStaffing
Los Angeles, CA

An outpatient primary care clinic is looking to hire an OBGYN to join their team in the SW area of Los Angeles, CA. ~ Board Certified OBGYN only ~ Part Time or Full Time available ~ Clinic hour…

View Details
Posted 2025-08-29

Full Time Family Practice Job Wilmington, CA

The Inline Group The Inline Group
Wilmington, CA

The Inline Group - Full Time Employed New Graduates Average Patients seen: 20 Call Schedule: Shared, very light Loan Repayment Sign-On Bonus Compensation: - Salary starts at $…

View Details
Posted 2025-09-10

Backend Engineer (Voice Agent)

Intellipro Group
Palo Alto, CA

Position: Backend Engineer (Voice agent) Location : Palo Alto, Ca Full time  Salary - $160k - $180k + Benefits  Job Description ~5+ years of professional experience in backend software eng…

View Details
Posted 2025-09-14

2870-Staff Engineer-AI

Innovaccer Analytics
San Francisco, CA

Engineering at Innovaccer With every line of code, we accelerate our customers' success, turning complex challenges into innovative solutions. Collaboratively, we transform each data point we gath…

View Details
Posted 2025-09-22

Senior Product Manager, Clinical Data

Sprinter Health
Menlo Park, CA

At Sprinter Health, our mission is to dramatically expand access to healthcare by reimagining the patient experience—delivered at home and powered by technology for scale. We're looking for an ene…

View Details
Posted 2025-09-14

Consulting Director Energy & Utility

Gables Search Group
San Diego, CA

We are seeking a dynamic and experienced Energy & Utility Consulting Director to lead high-impact projects and client engagements in the utility and energy sector. This senior-level role combines s…

View Details
Posted 2025-07-29

DevOps Engineer (Flight Software) - Intern (Summer 2025)

Astranis
San Francisco, CA

Astranis is on a mission to bridge the digital divide by connecting the four billion people worldwide who currently lack internet access. We're doing this by building the next generation of smaller, …

View Details
Posted 2025-09-22

Product Manager

Micron Technology
San Jose, CA

Our vision is to transform how the world uses information to enrich life for all . Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation…

View Details
Posted 2025-09-14

AI Engineer

Treering
San Mateo, CA

About Treering Treering, a Silicon Valley-based tech company, helps people preserve and celebrate their memories. By combining just-in-time digital printing with the power of AI tools, Treering de…

View Details
Posted 2025-09-22