AI Inference Engineer

quadric.io, Inc
Burlingame, CA

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

Role:

The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the model deployment for efficient inference; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI model algorithms, system architecture and AI toolchains/frameworks.

Responsibilities:

  • Quantize, prune and convert models for deployment
  • Port models to Quadric platform using Quadric toolchain
  • Optimize inference deployment for latency, speed
  • Benchmark and profile model performance and accuracy
  • Develop tools to scale and speed up the deployment
  • Make Improvement to SDK and runtimeProvide technical support and documents to customers and developer community

Requirements:

  • Bachelor’s or Master’s in Computer Science and/or Electric Engineering.
  • 5+ years of experience in AI/LLM model inference and deployment frameworks/tools
  • experience with model quantization (PTQ, QAT) and tools
  • experience with model accuracy measuresexperience with model inference performance profiling
  • experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp
  • Proficiency in C/C++ and Python
  • Demonstrate good capability in problem solving, debug and communication
  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Training & Development
  • Work From Home
  • Free Food & Snacks
  • Stock Option Plan
#J-18808-Ljbffr
Posted 2026-01-15

Recommended Jobs

Machine Operator / Assembly

Coast Personnel
Hayward, CA

Job ID: 5968 POSITION SUMMARY Machine operators (Team Members) perform typical operations and assembly, within the injection molding facility. Full Time Work Schedule Varies Location: Hay…

View Details
Posted 2025-10-31

AI Enablement Program Lead

Menlo Ventures
San Francisco, CA

A financial technology company in San Francisco is looking for a Program Manager, Internal AI, to lead initiatives in AI enablement. This role involves managing cross-functional projects to enhance p…

View Details
Posted 2026-01-15

Accounts Payable Associate

Billiontoone
Menlo Park, CA

Ready to redefine what's possible in molecular diagnostics? Join a team of brilliant, passionate innovators who wake up every day determined to transform healthcare. At BillionToOne, we've built …

View Details
Posted 2026-01-16

Nanny

GreatAuPair LLC
Glendale, CA

?????????? ????? ???????????????????????????? ??????????????????? ??????????????????????????? ???????????????????? ????? ???????????? ?????????????????????????????? ???35-60? ????????????????? ???1375…

View Details
Posted 2025-11-09

Therapeutic Behavioral Specialist (San Jose)

Hope Services
San Jose, CA

Therapeutic Behavioral Specialist Are you a person who enjoys helping others? Are you currently seeking fulfillment in your professional life? Hope Services is Silicon Valleys leading provi…

View Details
Posted 2026-01-06

ABA Program Supervisor

Burnett Therapeutic Services
Livermore, CA

Interested in becoming a BCBA? We offer the internship hours for your Master's Program! Job Type : Full-time or part-time Location : Oakland, Livermore, Dublin, Hayward, San Leandro, Alameda …

View Details
Posted 2025-12-18

Hardware Integration and Test Intern

zoox
Foster, CA

Zoox is transforming mobility with fully autonomous, electric vehicles designed from the ground up for a driverless future. Our mission is to make transportation safer, more sustainable, and accessib…

View Details
Posted 2025-12-18

Tree Climber

Stay Green
Castaic, CA

Location30521 The Old Road, Castaic, CA, 91384, United States Base Pay$20.00 - $25.00 / Hour Job CategoryClimber, Tree Climber, Groundsman IndustryTree Care, Landscaping Employee TypeFull…

View Details
Posted 2025-12-13

Software Engineer II, Databases

Fivetran
Oakland, CA

About the Role Fivetran is building data pipelines to power the modern data stack for thousands of companies. We’re seeking an enthusiastic Software Engineer to join our fast-growing data compa…

View Details
Posted 2026-01-16

Retail Delivery Class A Driver (CDL)

Pet Food Express
Oakley, CA

Pet Food Express is seeking an experienced Class A Delivery Driver with a strong work ethic to deliver to our stores in the Northern California Region. Reporting to the Transportation and Logistics M…

View Details
Posted 2026-01-15