Senior Software Development Engineer, TensorRT-LLM
We are now looking for a TensorRT-LLM Software Development Engineer!
NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the "iPhone moment" for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must. What you'll be doing:- Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
- Perform benchmarking, profiling, and system-level programming for GPU applications.
- Closely follow academic developments in the field of artificial intelligence and feature update TensorRT
- Provide code reviews, design docs, and tutorials to facilitate collaboration among the team.
- Conduct unit tests and performance tests for different stages of the inference pipeline.
- Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams
- Write safe, scalable, modular, and high-quality (C++/Python) code for our core backend software for LLM inference.
- Improve the usability of the TensorRT-LLM library and build systems (CMake)
- Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)
- 4+ years of relevant software development experience.
- Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
- Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models
- Experience working with deep learning frameworks like TensorFlow and PyTorch
- Self-starter who consistently takes initiative to drive projects forward
- Excellent written and oral communication skills in English
- Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation
- Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application
- Architectural knowledge of CPU and GPU
- GPU programming experience (CUDA or OpenCL)
Recommended Jobs
Restaurant GM: Elevate Teams, Service & Profit
A dedicated restaurant franchise is seeking a General Manager to lead a team and ensure exceptional customer experiences at Taco Bell locations in California. The individual will be responsible for p…
Radiologic Technologist | Flexible Scheduling & Career Growth
A leading urgent care provider is looking for a Radiologic Technologist in San Francisco, CA. This role involves performing x-rays, providing patient care support, and managing administrative tasks. C…
Solutions Engineer — Onboarding & Workflow Architect
A growing environmental solutions firm in California is looking for a Solutions Engineer to manage onboarding projects. The successful candidate will own the onboarding process, translate complex cust…
Manager, National Athlete Marketing
Athletes are a cornerstone of the Red Bull brand. Their stories, performances, and innovations authentically communicate Red Bull’s brand attributes, personality, and product functionality to consume…
Program Lead: Product Planning and Operations - Autonomous Vehicles
About the Role The Global Digital Experience team at Uber plays a crucial role in ensuring the development and rollout of products that drive the business forward. As a Program Lead you will colla…
Accounts Receivable Manager
Summary/Objective: In this role, the Accounts Receivable Manager is responsible for managing all accounts receivable functions and tracking activities such as: tracking accounts, any and all typ…
Project Manager
Summary Are you a highly motivated and experienced Project Manager with a passion for transforming visions into reality? We are actively seeking a dynamic and results-oriented professional to …
Aquatics Instructor
Job Description GENERAL SUMMARY Under the direction of the Aquatics Director, the Aquatics Instructor is responsible for providing private and group swim instruction to clients; maintaining a s…
Test Automation & QA Systems Intern
Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. In…
Co-Founder & CEO - AI RIA Compliance
FutureSight is hiring an experienced and visionary Founding CEO to lead the launch of a new AI Compliance venture serving SEC- and FINRA-regulated firms. Today, compliance teams at RIAs and broker-…