HPC Solution Architect - AI Infrastructure(S2S)
As a Solution Architect on the Silicon2Service team, you will be responsible for:
- Leading architecture for pursuits and active opportunities, including discovery, requirements, constraints, and target-state design
- Creatively defining reference architectures for on-premises, cloud, and hybrid GPU platforms across compute, network, storage, security, software and operations
- Driving architecture trade-offs and decisions across performance, scalability, reliability, locality, total cost of ownership, time-to-value, and risk
- Owning the technical solution strategy in proposals and RFPs, including architecture narrative, assumptions, dependencies, sizing guidance, and delivery approach
- Facilitating client workshops and technical reviews and translating engineering detail into executive-ready communications
- Architecting complex, innovative technology solutions with a focus on business outcomes, cost of quality, and long-term scalability and sustainability.
- Engaging with C-Suite client leadership during sales and delivery, including leading technical pre-sales discussions, shaping proposals, and supporting the closing of new business opportunities
- Supporting go-to-market strategies, including participation in industry events, conferences, and client briefings
- 10+ years of experience in infrastructure architecture or engineering for large-scale platforms including design, implementation, operations, and optimization.
- 4+ years designing or delivering GPU-accelerated platforms for AI, ML, or high-performance computing
- 3+ years Linux system administration in production environments
- 3+ years designing or operating distributed compute clusters for AI/HPC in hybrid cloud setups, including multi-GPU topologies, partitioning, scheduler integration, and scalability for edge-to-cloud workloads.
- 2+ years with high-performance networking or storage for AI/HPC
- 2+ years building containerized platforms using Kubernetes or Red Hat OpenShift, including GPU operators/drivers, CUDA container runtime, and cluster lifecycle automation
- 2+ years automating infrastructure as code(IaC) with tools like Terraform and Ansible
- At least 2 end-to-end deployments of reference architectures in the cloud or on-prem, including variants with security controls, network segmentation, operational runbooks, and validation testing
- Experience in pre-sales or sales engineering, including discovery, solution demonstrations, and proposal/RFP contributions
- Ability to travel 50%, on average, based on the work you do and the clients and industries/sectors you serve.
- Limited immigration sponsorship may be available.
- 2+ years implementing AI/HPC cluster scheduling (Slurm and Kubernetes), including multi-tenant queues, quotas, and GPU-aware policies
- 2+ years supporting generative AI infrastructure patterns, including multi-node distributed training
- Experience with AI agents and frameworks
- Experience with high-throughput storage for AI/HPC
- Experience executing NVIDIA co-sell motions with OEMS (Dell, HPC, Lenovo), CSPs ( AWS, Azure, Google Cloud), or independent software vendors ( Run:ai, OpenShift, Weights & Biases)
Recommended Jobs
Senior Pump Mechanic
Seeking a Senior Pump Mechanic for a direct hire opportunity with our client in Anaheim, CA. This position offers full benefits including PTO, Medical, Dental, Vision, and 401k! Pay is between…
Overnight Stock Associate
Our values start with our people, join a team that values you! Bring your talents to Ross, our leading off-price retail chain with over 2,200 stores, and a strong track record of success and growth…
Fulfillment & Logistics Operations Specialist
We’re looking for someone who loves making things run smoothly behind the scenes to join our team as a Fulfillment & Logistics Operations Specialist. In this role, you’ll be the go-to person for gett…
Electrical Construction Project Manager (Benicia)
Location: Benicia, CA Employment Type: Full-Time Salary Range: $120,000–$160,000 annually (depending on experience) Industry: Commercial & Industrial Electrical Contracting About the…
Pizza Chef
Pizza Cook / Pizza Chef – High-Volume (Anaheim) ArtXRow is hiring a Pizza Cook to join a high-energy, multi-concept kitchen in Anaheim. This role focuses on pizza production within a structured, pr…
IT Help Desk Technician - Night Shift
Job Summary: The IT Helpdesk Technician will be the first point of contact for all IT-related issues, providing prompt, courteous, and effective technical support to internal users. You'll be …
Sales Trainee
This is a training role that is made to prepare the Sales Trainee for the Account Sales Manager role. The role primarily is to support sales initiatives and provide route coverage for the ASM during …
Electrical Designer
Electrical Designer position in Mission Viejo, CA We are a large, growing MEP Design Engineering company that specializes in commercial engineering and design projects throughout California. We h…
Occupational Therapist - Med B
WE ARE PT+!! We believe patients achieve the best outcomes when they receive care in comfortable, familiar settings. Serving communities across New York—including all five boroughs, Long Island, and …
Director of Technical Marketing (Semiconductor)
At Elevate Semiconductor, we empower semiconductor and system test customers by creating world class ICs that tackle the industry’s most complex automated test equipment challenges. Our innovative te…