AI Inference Engineer
Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.
Role:
The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the model deployment for efficient inference; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI model algorithms, system architecture and AI toolchains/frameworks.
Responsibilities:
- Quantize, prune and convert models for deployment
- Port models to Quadric platform using Quadric toolchain
- Optimize inference deployment for latency, speed
- Benchmark and profile model performance and accuracy
- Develop tools to scale and speed up the deployment
- Make Improvement to SDK and runtimeProvide technical support and documents to customers and developer community
Requirements:
- Bachelor’s or Master’s in Computer Science and/or Electric Engineering.
- 5+ years of experience in AI/LLM model inference and deployment frameworks/tools
- experience with model quantization (PTQ, QAT) and tools
- experience with model accuracy measuresexperience with model inference performance profiling
- experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp
- Proficiency in C/C++ and Python
- Demonstrate good capability in problem solving, debug and communication
- Health Care Plan (Medical, Dental & Vision)
- Retirement Plan (401k, IRA)
- Life Insurance (Basic, Voluntary & AD&D)
- Paid Time Off (Vacation, Sick & Public Holidays)
- Family Leave (Maternity, Paternity)
- Short Term & Long Term Disability
- Training & Development
- Work From Home
- Free Food & Snacks
- Stock Option Plan
Recommended Jobs
Machine Operator / Assembly
Job ID: 5968 POSITION SUMMARY Machine operators (Team Members) perform typical operations and assembly, within the injection molding facility. Full Time Work Schedule Varies Location: Hay…
AI Enablement Program Lead
A financial technology company in San Francisco is looking for a Program Manager, Internal AI, to lead initiatives in AI enablement. This role involves managing cross-functional projects to enhance p…
Accounts Payable Associate
Ready to redefine what's possible in molecular diagnostics? Join a team of brilliant, passionate innovators who wake up every day determined to transform healthcare. At BillionToOne, we've built …
Nanny
?????????? ????? ???????????????????????????? ??????????????????? ??????????????????????????? ???????????????????? ????? ???????????? ?????????????????????????????? ???35-60? ????????????????? ???1375…
Therapeutic Behavioral Specialist (San Jose)
Therapeutic Behavioral Specialist Are you a person who enjoys helping others? Are you currently seeking fulfillment in your professional life? Hope Services is Silicon Valleys leading provi…
ABA Program Supervisor
Interested in becoming a BCBA? We offer the internship hours for your Master's Program! Job Type : Full-time or part-time Location : Oakland, Livermore, Dublin, Hayward, San Leandro, Alameda …
Hardware Integration and Test Intern
Zoox is transforming mobility with fully autonomous, electric vehicles designed from the ground up for a driverless future. Our mission is to make transportation safer, more sustainable, and accessib…
Tree Climber
Location30521 The Old Road, Castaic, CA, 91384, United States Base Pay$20.00 - $25.00 / Hour Job CategoryClimber, Tree Climber, Groundsman IndustryTree Care, Landscaping Employee TypeFull…
Software Engineer II, Databases
About the Role Fivetran is building data pipelines to power the modern data stack for thousands of companies. We’re seeking an enthusiastic Software Engineer to join our fast-growing data compa…
Retail Delivery Class A Driver (CDL)
Pet Food Express is seeking an experienced Class A Delivery Driver with a strong work ethic to deliver to our stores in the Northern California Region. Reporting to the Transportation and Logistics M…