AI Infrastructure Engineer

Alldus International Consulting Ltd
California

Our client, an early-stage, AI-driven startup in the defense industry, is hiring an AI Infrastructure Engineer to join their team in California. The successful candidate will design and scale the foundation of their model training and deployment ecosystem to enable their vision-language-action models to learn from massive real-world datasets and operate seamlessly across both edge and cloud environments.

Responsibilities

  • Design and implement pipelines to ingest, transform and store petabytes of multimodal data from their robotic and operator systems.

  • Develop tools for dataset exploration, curation, versioning and quality monitoring.

  • Build and maintain distributed training infrastructure for large-scale multimodal and foundation model training, both in the cloud and on-premises.

  • Implement orchestration workflows to launch, track and debug large-scale model runs.

  • Identify and resolve bottlenecks in compute, memory, storage and network performance.

  • Collaborate with AI, autonomy and systems teams to support real-time and mission-critical applications.

  • Maintain observability and reliability tools for training and inference pipelines.

  • Stay up to date with best practices in MLOps, distributed training frameworks and AI infrastructure at scale.

Skillset

  • Bachelor’s degree or higher in Computer Science, Electrical Engineering or a related technical field.

  • Minimum of 3 years of experience in ML infrastructure, MLOps or large-scale data systems.

  • Proven experience with distributed training frameworks (e.g. PyTorch DDP, DeepSpeed, Ray) and workflow orchestration tools (e.g. Kubernetes, Airflow, or equivalents).

  • Strong proficiency in Python and hands-on experience with cloud-native infrastructure (AWS, GCP or Azure).

  • Solid understanding of data engineering concepts, including ETL pipelines, object storage, data versioning and metadata management.

  • Familiarity with containerization technologies (Docker, Kubernetes) and monitoring systems (Prometheus, Grafana).

  • Experience optimizing GPU cluster utilization, scaling training jobs and profiling model performance.

  • Experience with edge-deployed ML systems, federated training or robotic data collection pipelines is a plus.

  • Must have legal authorization to work in the U.S.; certain responsibilities may involve access to export-controlled information.

Benefits

  • Salary: $160K – $220K DOE. Exceptional candidates may be considered for higher compensation.

  • Performance Bonus.

  • Equity.

  • Medical, dental and vision insurance.

56740

Posted 2026-02-13

Recommended Jobs

Banquet Houseperson - Hilton Irvine

Hilton
Irvine, CA

The Hilton Irvine is hiring a Banquet Houseperson. We are a union property and cannot guarantee a number of shifts weekly, but do offer benefits (pending eligibility) including insurance, free lunch,…

View Details
Posted 2026-01-21

Travel Occupational Therapist Job in Hemet, CA - $12,987 per Month (2 Years Experience Needed)

Vetted Health
Hemet, CA

Vetted is seeking a Occupational Therapist for a travel job in Hemet, California . Must have 2+ years of experience. This contract pays approximately $12,987/month gross. Assignment detai…

View Details
Posted 2026-02-13

Software Engineer - Backend

Paypal
San Jose, CA

Implements tasks within the Software Development Lifecycle (SDLC), receiving structure and oversight from more experienced staff Follows well-established internal conventions and standard procedures U…

View Details
Posted 2026-02-04

Bookkeeper

Legacy Staffing Solutions
Bakersfield, CA

Job Title: Bookkeeper Location: Bakersfield, CA Employment Type: Full-Time/Direct Hire Pay: $25 - $32 (DOE) Position Overview Legacy Staffing is seeking a detail-oriented and ex…

View Details
Posted 2026-02-04

Sr. Principal Data Scientist Machine Learning and Predictive Analytics

Northrop Grumman
San Diego, CA

RELOCATION ASSISTANCE: Relocation assistance may be available CLEARANCE TYPE: Top Secret TRAVEL: Yes, 25% of the Time Description At Northrop Grumman, our employees have incredible oppor…

View Details
Posted 2026-02-10

Part-Time Chiropractor

Priority Family Care Center
Los Angeles, CA

Full job description: We are a multidisciplinary medical center that offers Medical, Pain management, Orthopedics, Cardiology and now looking to now add chiropractic care. Job Summary: We are se…

View Details
Posted 2026-01-16

Executive Director

Friends of Plumas Wilderness
Quincy, CA

Friends of Plumas Wilderness Executive Director Position Description – DRAFT 12.23.2021; update 1.10.2022; update 4.21.22 General Description The Executive Director oversees operations for Frie…

View Details
Posted 2026-01-25

Per Diem Sr. CRNA

University of California, Irvine
Irvine, CA

Overview: Founded in 1965, UC Irvine is a member of the prestigious Association of American Universities and is ranked among the nation’s top 10 public universities by U.S. News & World Report. The…

View Details
Posted 2026-01-03

Senior Backend Engineer

Instrinsic
San Francisco, CA

Intrinsic is building a trustworthy internet The internet, mediated by opaque algorithms, is fragile and manipulatable. Intrinsic is using AI to help rebuild an internet that’s safe enough to be f…

View Details
Posted 2026-02-13

Travel Nurse - Labor and Delivery

ManorCare Health Services-Summer Trace
Hemet, CA

Seeking a dedicated Travel Nurse for Labor and Delivery in Hemet, CA. Responsibilities Provide exceptional patient care to women during labor, delivery, and postpartum stages Monitor fetal an…

View Details
Posted 2026-01-18