ML Infrastructure Engineer
ML Infrastructure Engineer
Menlo Park, CA | On-Site | Full-Time/Direct Hire
Client Opportunity | Through Phizenix
Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.
We’re looking for a ML Infrastructure Engineer to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.
Responsibilities
Design and manage distributed infrastructure for ML training at scale
Optimize model serving systems for low-latency inference
Build automated pipelines for data processing, model training, and deployment
Implement observability tools to monitor performance in production
Maximize resource utilization across GPU clusters and cloud environments
Translate research requirements into robust, scalable system designs
Must-Haves
PhD in Computer Science, Engineering, or a related field (or equivalent experience)
Strong foundation in software engineering, systems design, and distributed systems
Experience with cloud platforms (AWS, GCP, or Azure)
Proficient in Python and at least one systems-level language (C++/Rust/Go)
Hands-on experience with Docker, Kubernetes, and CI/CD workflows
Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective
Understanding of GPU programming and high-performance infrastructure
Nice-to-Haves
Experience with large-scale ML training clusters and GPU orchestration
Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime)
Experience with distributed training strategies (e.g., data/model/pipeline parallelism)
Familiarity with orchestration tools like Kubeflow or Airflow
Background in performance tuning, system profiling, and MLOps best practices
At Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.
California Pay Range
$180,000 - $200,000 USD
Recommended Jobs
Senior Financial Analyst- Call of Duty Live Ops
Job Title: ~ Senior Financial Analyst- Call of Duty Live Ops Requisition ID: ~ R026168 : Job Title : Senior Financial Analyst, Call of Duty Live Ops Reporting To: Director, Call …
MOBILE ASSOCIATE - BILINGUAL
At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation packag…
Barista Apprentice
Caffe Luxxe: 1020 Coast Village Rd, Santa Barbara, CA 93108, USA Interested in coffee, but lacking experience? WE WILL TRAIN YOU! Our extensive APPRENTICESHIP PROGRAM creates professional barist…
Au Pair
Get hired for Haya's aupair Job in Palo Alto, CA. Seeking Au Pair to travel to Amman Jordan for 3 months. Find aupair care work in Palo Alto.
Chief Data and Artificial Intelligence Officer
Job Description and Duties Under the general direction of the Agency Chief Information Officer (CIO), the incumbent serves as the Chief Data and Artificial Intelligence Officer (CDAO), CEA-C. This…
Senior Product Manager - Integrated Experiences
Innovate with purpose At BILL, we believe in empowering the businesses that drive our economy. By replacing outdated financial processes with innovative tools, we help businesses—from startups t…
Software Engineer, Platform & Data Infrastructure
Ideal Candidate A talented software engineer excited to work on the backbone of the Galileo platform. We are looking for someone who has built large-scale real-time infrastructure, services, and API…
Administrative Assistant (385)
Administrative Assistant (385) Location Paso Robles, CA : Apply Here: Kings View is a nonprofit leader in providing behavioral health services to the underserved community, is currently seeking a …
Public Health Nurse II, CF, Sierra Conservation Center, Jamestown
Job Description and Duties Effective July 1, 2025, in accordance with the applicable Memorandum of Understanding, the Personal Leave Program 2025 (PLP 2025) was implemented. PLP 2025 requires each…
Veterinarian (Bella Vista)
Above and Beyond Animal Care Bella Vista, CA (just outside of Redding) Primarily Small Animal + Some Mixed Animal Capabilities | GP + Urgent Care | No Weekends Reputable, High-Quality Medi…