Senior AI Performance Engineer

Genmo

San Francisco, CA

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.

Role overview:

As a Deep Learning Performance Engineer at Genmo, you will play a critical role in optimizing the performance of our large generative AI models. Your expertise will ensure that our models run efficiently on clusters, leveraging advanced techniques and tools to enhance their performance. This role is perfect for someone with a deep understanding of deep learning performance bottlenecks, kernel optimization, and distributed training strategies.

Key responsibilities:

Analyze and optimize the performance of massively parallel and distributed systems
Implement and fine-tune distributed training strategies for multi-GPU and multi-node environments
Implement high-performance CUDA, Triton, C++ and PyTorch code.
Profile model performance and identify bottlenecks using tools like NVIDIA NSight Systems, PyTorch Profiler, and TensorFlow Profiler
Develop and maintain benchmarking suites for continuous performance monitoring

Qualifications:

Master's or PhD in Computer Science, Electrical Engineering, or a related field
5+ years of experience in optimizing deep learning models, preferably in a production environment
Must have
- Strong programming skills in Python and C++. Experience in training large models using Python & PyTorch and/or TensorFlow including their distributed training frameworks.
- Proven track record of optimizing large-scale models (10B+ parameters)
- Deep understanding of GPU architecture and CUDA programming
- Experience in entire development pipeline from data processing, preparation & data loading to training and inference.
- Experience optimizing and deploying inference workloads for throughput and latency across the stack (inputs, model inference, outputs, parallel processing etc.)
- Demonstrated expertise in high-performance computing using NVIDIA Triton and CUDA
- Demonstrated ability to significantly improve model inference and training speeds through low-level optimizations
Ideal candidates will have:
- Knowledge of distributed inference systems for handling high-volume workloads
- Strong background in linear algebra, optimization, and machine learning algorithms
- Experience with generative AI models (GANs, Diffusion Models, Transformers)
- Knowledge of hardware-aware neural architecture design
- Experience with high-performance computing (HPC) environments
- Contributions to relevant open source projects or research publications

Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish .

Posted 2025-09-22

Recommended Jobs

General Manager

LUV Car Wash

Ukiah, CA

Description: LUV Car Wash was founded in Sept of 2021, and we have rapidly grown to over 70 locations in 6 states, including CA, NV, FL, GA, PA, and NJ. We are looking to add a General Manager to our…

View Details

Posted 2025-10-24

Senior Cardiothoracic Surgery Physician Assistant

Palm Careers

West Hollywood, CA

Join Our Dynamic Cardiothoracic Surgery Team! Position: Senior Cardiothoracic Surgery Physician Assistant Location: Los Angeles, California Why Choose Us? We're not just a healthcare organi…

View Details

Posted 2025-10-31

Senior Data Engineer

Alembic

San Francisco, CA

About Us Alembic is where top engineers are solving marketing's hardest problem: proving what actually works. If you're looking for frontier technical challenges at an applied science company, this …

View Details

Posted 2025-09-22

Front Office Manager

Holiday Inn Express & Suites San Jose - Silicon Valley

San Jose, CA

Job Summary: We are looking for a Front Office Manager to lead and manage the front office operations of our hotel. The ideal candidate will be responsible for overseeing a team of front desk staff, g…

View Details

Posted 2025-08-31

Staff Product Manager, Infrastructure as a Service

Together Ai

San Francisco, CA

Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fast, reliable inference and model shaping services with state-of-the-art…

View Details

Posted 2025-10-01

Senior Project Manager - Santa Monica, 90404

Universal Music Group

Santa Monica, CA

Senior Project Manager - Santa Monica, 90404, United States of America How you’ll LEAD: We are looking for a passionate and experienced Project Manager (PM) to be part of UMG’s Global Data Manage…

View Details

Posted 2025-10-28

Chief Engineer II

Department of General Services

Sacramento County, CA

Job Description and Duties Excellence in the Business of Government! Come join a team that creates: A collaborative team atmosphere founded upon ethics, integrity, and stewardship. A posit…

View Details

Posted 2025-10-31

Storage Engineer

CSV-TAUREAN

Presidio of Monterey, CA

Location: Presidio of Monterey, CA Clearance: Secret Overview: Designs and manages storage platforms to ensure high availability and data integrity. Responsibilities: Configure SAN/NAS envi…

View Details

Posted 2025-09-16

Program Manager, Optimus

Tesla

California

What To Expect Tesla is seeking a highly motivated and passionate Technical Program Manager to join the Optimus Manufacturing team and manage program execution alongside a team of world-class engi…

View Details

Posted 2025-11-01

Sr./Staff Software Engineer, Big Data

Predactiv

Palo Alto, CA

About Us and the Role: ShareThis, a Predactiv Company is a big data company that owns online behavior data of 1b+ users globally. We are developing an audience intelligence platform with cutting e…

View Details

Posted 2025-09-03