AI Infrastructure Engineer

Alldus International Consulting Ltd
California

Our client, an early-stage, AI-driven startup in the defense industry, is hiring an AI Infrastructure Engineer to join their team in California. The successful candidate will design and scale the foundation of their model training and deployment ecosystem to enable their vision-language-action models to learn from massive real-world datasets and operate seamlessly across both edge and cloud environments.

Responsibilities

  • Design and implement pipelines to ingest, transform and store petabytes of multimodal data from their robotic and operator systems.

  • Develop tools for dataset exploration, curation, versioning and quality monitoring.

  • Build and maintain distributed training infrastructure for large-scale multimodal and foundation model training, both in the cloud and on-premises.

  • Implement orchestration workflows to launch, track and debug large-scale model runs.

  • Identify and resolve bottlenecks in compute, memory, storage and network performance.

  • Collaborate with AI, autonomy and systems teams to support real-time and mission-critical applications.

  • Maintain observability and reliability tools for training and inference pipelines.

  • Stay up to date with best practices in MLOps, distributed training frameworks and AI infrastructure at scale.

Skillset

  • Bachelor’s degree or higher in Computer Science, Electrical Engineering or a related technical field.

  • Minimum of 3 years of experience in ML infrastructure, MLOps or large-scale data systems.

  • Proven experience with distributed training frameworks (e.g. PyTorch DDP, DeepSpeed, Ray) and workflow orchestration tools (e.g. Kubernetes, Airflow, or equivalents).

  • Strong proficiency in Python and hands-on experience with cloud-native infrastructure (AWS, GCP or Azure).

  • Solid understanding of data engineering concepts, including ETL pipelines, object storage, data versioning and metadata management.

  • Familiarity with containerization technologies (Docker, Kubernetes) and monitoring systems (Prometheus, Grafana).

  • Experience optimizing GPU cluster utilization, scaling training jobs and profiling model performance.

  • Experience with edge-deployed ML systems, federated training or robotic data collection pipelines is a plus.

  • Must have legal authorization to work in the U.S.; certain responsibilities may involve access to export-controlled information.

Benefits

  • Salary: $160K – $220K DOE. Exceptional candidates may be considered for higher compensation.

  • Performance Bonus.

  • Equity.

  • Medical, dental and vision insurance.

Posted 2026-02-13

Recommended Jobs

Associate Marriage and Family Therapist

WAVE Therapy
Laguna Hills, CA

Associate Therapist, with an amazing supervision program. Benefits package offered after 30 days of employment. Job type: Full-time- a minimum of 25-30 clients per week. See as many clients…

View Details
Posted 2026-04-08

Part-time Emergency Veterinarian- Irvine, California

Pacific Care Pet Emergency + Specialty
Irvine, CA

Part-time Emergency Veterinarian – Pacific Care Pet Emergency + Specialty Irvine, CA | Sunny SoCal Life Awaits! Pacific Care Pet Emergency + Specialty is on the lookout for a skilled an…

View Details
Posted 2026-04-24

Associate Sales Representative - Palo Alto, CA - Orthopaedic Instruments

Stryker
Palo Alto, CA

As an Orthopaedics Instruments Associate Sales Representative, you work as part of a sales team learning and honing your sales skills. You support the marketing and sales of Stryker products by cover…

View Details
Posted 2026-02-28

Loan Officer Development Program

Zillow
Irvine, CA

About the team At Zillow Home Loans, we’re at the forefront of revolutionizing the home financing experience. As part of our dynamic and fast-growing FinTech company, you'll help reshape how con…

View Details
Posted 2026-04-03

Director of Marketing & Public Relations

Marriott
Chula Vista, CA

JOB SUMMARY Directs the development, production and implementation of all marketing strategies and related projects associated with the property's revenue and marketing objectives. Partners with t…

View Details
Posted 2026-04-17

Staff Product Manager, Virtual Vehicle

Platform Science
San Diego, CA

Staff Product Manager, Virtual Vehicle Remote, San Diego CA At Platform Science, we’re working to connect everything that moves. Founded in 2015, we are an open IoT platform that partners wi…

View Details
Posted 2026-04-28

Houseperson

Northstar California Resort
Truckee, CA

  Create Your Experience of a Lifetime!   Come work and play in the mountains! Whether it’s your first-time seeing snow or you were born on the slopes, joining our team means discovering (or re-d…

View Details
Posted 2026-02-28

Insurance Sales / Service Representative

Guy Burnett - State Farm Agency
Apple Valley, CA

Position Overview Successful State Farm Agent is seeking a qualified professional to join their winning team for the role of Sales and Service Representative - State Farm Agent Team Member. Insu…

View Details
Posted 2025-10-31

LOCUM Thoracic Surgery Physician Assistant

Palm Careers
Los Angeles, CA

We are hiring a locum Thoracic Surgery Physician Assistant to come out for 6-months and join our team in Los Angeles, CA!! Come live in one of the most beautiful locations in the states while allowin…

View Details
Posted 2026-01-28