AI/ML Inference Engineer

Krea

San Francisco, CA

About Krea:

At Krea, we're dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not replace it. We believe AI is a new medium that allows us to express ourselves through various formats—text, images, video, sound, and even 3D. We're building better, smarter, and more controllable tools to harness this medium.

We’re backed by Bain Capital Ventures, A16Z, Abstract Ventures, Pebblebed and many others. If you're passionate about pushing the boundaries of AI and empowering human creativity, we'd love to hear from you.

We're looking for a Machine Learning Engineer to help us optimize the inference and training of our AI models. You will collaborate closely with our AI research and infrastructure teams to integrate optimizations seamlessly.

Our culture:

We work full-time and in-person at our waterfront office in San Francisco.
We believe that demonstrated interest in the creative space is key: our team includes musicians, designers, visual artists and more.

What you'll do:

Write custom CUDA Kernels to speed up multi-node inference on image and video models.
Work on various caching and dynamic compilation techniques to optimize the loading and unloading of the variety of AI models we serve at Krea.
Speed up and efficiency of training runs across our GPU clusters.

We'd like you to have:

Proficiency in CUDA or parallel programming.
Python/C++ programming experience.
Experience in optimizing diffusion/transformer models for performance and scalability.
High agency and resourcefulness.

What we offer:

Openness to sponsoring International candidates (e.g STEM OPT, OPT, H1B, O1, E3)
Work alongside a world class developing the future of AI tooling
Significant impact on Krea’s market presence and growth
Competitive compensation (75% percentile of market rates) with significant equity upside

Posted 2025-09-22

Recommended Jobs

Software Engineer, Data

Airtable

San Francisco, CA

Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes. More than 500,000 organizations, including 80% of the Fortune 100, …

View Details

Posted 2025-10-01

Principal Outbound Product Manager, AI Platform Security

Servicenow

Santa Clara, CA

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow st…

View Details

Posted 2025-09-11

Software Engineer, Multimedia

Fireworks Ai

Redwood City, CA

About Us: Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable…

View Details

Posted 2025-10-01

Data Infrastructure Engineer

Heygen

Lake Forest, CA

About HeyGen At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, a…

View Details

Posted 2025-09-22

Administration level 3- Sr Configuration Data Manager (MD-3)

Brycetech

El Segundo, CA

Company Description BryceTech is a trusted leader in complex technology domains, delivering data-driven solutions in aerospace, biosecurity, and defense. We specialize in systems engineering, ad…

View Details

Posted 2025-09-22

Software Engineer, Triton Compiler

Openai

San Francisco, CA

About the Team Our mission at OpenAI is to discover and enact the path to safe, beneficial AGI. To do this, we believe that many technical breakthroughs are needed in generative modeling, reinforcem…

View Details

Posted 2025-11-04

START Service Technician (Fall 2025)

Tesla

California

What To Expect The START program is a comprehensive 16-week EV service training initiative tailored for individuals with prior automotive experience. Throughout this intensive program, candidates …

View Details

Posted 2025-11-01

Lead Package Handler

OnTrac

Sun Valley, CA

Lead Package Handler Location Sun Valley, CA : OnTrac is hiring a Lead Package Handler ! Are you eager to join a dynamic and expanding company where you can both learn and make a meaningful impact? If…

View Details

Posted 2025-11-04

Data Engineer

Sunnyvale, CA

WAYFINDER Our Wayfinder team is building scalable, certifiable autonomy systems to power the next generation of commercial aircraft. Our team of experts is driving the maturation of machine learni…

View Details

Posted 2025-09-22