Machine Learning Engineer: ML Infra and Model Optimization

Genies

Los Angeles, CA

Genies is an avatar technology company powering the next era of interactive digital identity through AI companions. With the Avatar Framework and intuitive creation tools, Genies enables developers, talent, and creators to generate and deploy game-ready AI companions. The company’s technology stack supports full customization, AI-generated fashion and props, and seamless integration of user-generated content (UGC). Backed by investors including Bob Iger, Silver Lake, BOND, and NEA, Genies’ mission is to become the visual and interactive layer for the LLM-powered internet.

Genies is looking for a ML Infra and Model Optimization Engineer to join our R&D team. Based either in our Los Angeles or San Francisco offices (Hybrid), you will work closely with a dedicated and talented team of technical artists, engineers and artists. Together, you will explore new concepts and technologies to further Genies' mission of empowering users to develop their own avatar ecosystems. We're looking for someone who is passionate about creating high-quality visuals and has the technical foundation to help us build the next wave of digital identity.

What You’ll be Doing:

Design, build, and maintain production-grade ML infrastructure for image and 3D generative models.

Develop and own backend services and APIs that support model inference at scale (high concurrency, low latency, high reliability).

Deploy, monitor, and operate ML models on cloud and large-scale platforms (e.g., SageMaker, Kubernetes, Ray Serve, custom GPU services).

Optimize inference pipelines using model acceleration techniques such as:

quantization, pruning, mixed precision

ONNX / TensorRT / torch.compile

Partner with ML researchers to productionize diffusion models, transformer-based models, and 3D generation systems.

Implement evaluation, logging, monitoring, and alerting to ensure system stability and performance.

Improve end-to-end system efficiency across data loading, inference, post-processing, and storage.

Support rapid experimentation while maintaining production safety and scalability.

What You Should Have:

Strong experience building backend and infrastructure systems in production environments.

Proficiency in Python and experience designing APIs/services (e.g., FastAPI, Flask, gRPC).

Hands-on experience deploying and operating ML models at scale, including:

GPU-based inference services

concurrency handling and request batching

latency and throughput optimization

Experience with cloud platforms and ML deployment stacks, such as:

AWS (SageMaker, EC2, EKS), GCP, or similar

Docker, containers, CI/CD pipelines

Solid understanding of systems performance, debugging, and reliability engineering.

Experience supporting real user traffic, not just offline research workflows.

Bonus Skills (Nice-to-Have)

Experience with generative models, especially:

diffusion models

transformer-based architectures

multimodal image / 3D pipelines

Familiarity with 3D generation or computer graphics pipelines (e.g., meshes, textures, multi-view data).

Hands-on experience with model optimization and acceleration, such as:

quantization, pruning, distillation

ONNX Runtime, TensorRT, FSDP, DeepSpeed

Experience with distributed systems or scalable inference frameworks (Ray, Triton, TorchServe).

Background in machine learning fundamentals (training, evaluation, model behavior), even if not research-focused.

Here's why you'll love working at Genies:

You'll work with a team that you’ll be able to learn from and grow with, including support for your own professional development

You'll be at the helm of your own career, shaping it with your own innovative contributions to a nascent team and product

You'll enjoy the culture and perks of a startup, with the stability of being well funded

Comprehensive health insurance for you and your family (Anthem + Kaiser Options Available), Dental and Vision Insurance

Flexible paid time off, sick time, and paid company holidays, in addition to paid parental leave, bereavement leave, and jury duty leave for full-time employees

Health & wellness support through programs such as monthly wellness reimbursement

Working in a brand new, bright, open-environment and fun office space - there’s even a slide!

Choice of MacBook or windows laptop

Salary Range: $215K-$275K depending on experience

Genies is an equal opportunity employer committed to promoting an inclusive work environment free of discrimination and harassment. We value diversity, inclusion, and aim to provide a sense of belonging for everyone.

Posted 2026-01-10

Recommended Jobs

Accounts Payable Specialist

Northwoodspace

Los Angeles, CA

About Northwood: Northwood is a modern space infrastructure company focused on connecting space and Earth. The world runs on space. Space will run on Northwood. Our global ground network ensures tha…

View Details

Posted 2025-11-19

Territory Manager - UniFirst First Aid + Safety

UniFirst

San Diego, CA

Our Team is Kind of a Big Deal! UniFirst First Aid + Safety is seeking a reliable and hardworking Territory Manager to join our family. As a Territory Manager, you will be responsible for servici…

View Details

Posted 2026-01-03

Collection Representative

firstsourc

Thousand Oaks, CA

Collection Representative – Full-Time, In-Office (Thousand Oaks, CA) Pay: $18.00 - $20.00 per hour + Monthly Performance Bonus Location: 555 St. Charles Dr., Suite 100, Thousand Oaks, CA 91360 …

View Details

Posted 2025-10-07

Media Associate

SGS Consulting

California

Job Responsibilities: Assist in translating Mattel’s Media Buying Strategy into campaign builds within the Google Ads platform. This includes campaign creation, trafficking, optimizations, and pac…

View Details

Posted 2025-11-14

Senior Software Engineer, AI Entities

Evenup

San Francisco, CA

EvenUp is one of the fastest-growing generative AI startups in history, on a mission to level the playing field for personal injury victims, which range from motor vehicle accidents to child abuse ca…

View Details

Posted 2025-12-19

LLM Platform Engineer

Whatnot

San Francisco, CA

&##128640; Join the Future of Commerce with Whatnot! Whatnot is the largest live shopping platform in North America and Europe to buy, sell, and discover the things you love. We’re re-defining e-comme…

View Details

Posted 2026-01-07

Senior Backend Engineer

Instrinsic

San Francisco, CA

Intrinsic is building a trustworthy internet The internet, mediated by opaque algorithms, is fragile and manipulatable. Intrinsic is using AI to help rebuild an internet that’s safe enough to be f…

View Details

Posted 2025-11-25

[Summer 2026] Software Engineer Intern

Roblox

San Mateo, CA

As a Software Engineer Intern, we'll support you as you tackle the hardest problems in tech today – distributed systems, real-time communication, 3D co-experience, extensive data processing, social n…

View Details

Posted 2026-01-07

Aircraft MXS Research/Data Analyst, Senior WPAFB

Diaconia

Patterson, CA

Full-time Description Diaconia is looking for a talented Research Data Analyst, Senior to join our Amazing team! If you're looking to join a company that truly appreciates you and your ta…

View Details

Posted 2026-01-07

COE Business Systems Data Analyst

Intuitive

Sunnyvale, CA

Company Description At Intuitive, we are united behind our mission: we believe that minimally invasive care is life-enhancing care. Through ingenuity and intelligent technology, we expand the po…

View Details

Posted 2025-11-25