Machine Learning Engineer: ML Infra and Model Optimization
Genies is an avatar technology company powering the next era of interactive digital identity through AI companions. With the Avatar Framework and intuitive creation tools, Genies enables developers, talent, and creators to generate and deploy game-ready AI companions. The company’s technology stack supports full customization, AI-generated fashion and props, and seamless integration of user-generated content (UGC). Backed by investors including Bob Iger, Silver Lake, BOND, and NEA, Genies’ mission is to become the visual and interactive layer for the LLM-powered internet.
Genies is looking for a ML Infra and Model Optimization Engineer to join our R&D team. Based either in our Los Angeles or San Francisco offices (Hybrid), you will work closely with a dedicated and talented team of technical artists, engineers and artists. Together, you will explore new concepts and technologies to further Genies' mission of empowering users to develop their own avatar ecosystems. We're looking for someone who is passionate about creating high-quality visuals and has the technical foundation to help us build the next wave of digital identity.
What You’ll be Doing:
- Design, build, and maintain production-grade ML infrastructure for image and 3D generative models.
- Develop and own backend services and APIs that support model inference at scale (high concurrency, low latency, high reliability).
- Deploy, monitor, and operate ML models on cloud and large-scale platforms (e.g., SageMaker, Kubernetes, Ray Serve, custom GPU services).
- Optimize inference pipelines using model acceleration techniques such as:
- quantization, pruning, mixed precision
- ONNX / TensorRT / torch.compile
- Partner with ML researchers to productionize diffusion models, transformer-based models, and 3D generation systems.
- Implement evaluation, logging, monitoring, and alerting to ensure system stability and performance.
- Improve end-to-end system efficiency across data loading, inference, post-processing, and storage.
- Support rapid experimentation while maintaining production safety and scalability.
What You Should Have:
- Strong experience building backend and infrastructure systems in production environments.
- Proficiency in Python and experience designing APIs/services (e.g., FastAPI, Flask, gRPC).
- Hands-on experience deploying and operating ML models at scale, including:
- GPU-based inference services
- concurrency handling and request batching
- latency and throughput optimization
- Experience with cloud platforms and ML deployment stacks, such as:
- AWS (SageMaker, EC2, EKS), GCP, or similar
- Docker, containers, CI/CD pipelines
- Solid understanding of systems performance, debugging, and reliability engineering.
- Experience supporting real user traffic, not just offline research workflows.
Bonus Skills (Nice-to-Have)
- Experience with generative models, especially:
- diffusion models
- transformer-based architectures
- multimodal image / 3D pipelines
- Familiarity with 3D generation or computer graphics pipelines (e.g., meshes, textures, multi-view data).
- Hands-on experience with model optimization and acceleration, such as:
- quantization, pruning, distillation
- ONNX Runtime, TensorRT, FSDP, DeepSpeed
- Experience with distributed systems or scalable inference frameworks (Ray, Triton, TorchServe).
- Background in machine learning fundamentals (training, evaluation, model behavior), even if not research-focused.
Here's why you'll love working at Genies:
- You'll work with a team that you’ll be able to learn from and grow with, including support for your own professional development
- You'll be at the helm of your own career, shaping it with your own innovative contributions to a nascent team and product
- You'll enjoy the culture and perks of a startup, with the stability of being well funded
- Comprehensive health insurance for you and your family (Anthem + Kaiser Options Available), Dental and Vision Insurance
- Flexible paid time off, sick time, and paid company holidays, in addition to paid parental leave, bereavement leave, and jury duty leave for full-time employees
- Health & wellness support through programs such as monthly wellness reimbursement
- Working in a brand new, bright, open-environment and fun office space - there’s even a slide!
- Choice of MacBook or windows laptop
Salary Range: $215K-$275K depending on experience
Genies is an equal opportunity employer committed to promoting an inclusive work environment free of discrimination and harassment. We value diversity, inclusion, and aim to provide a sense of belonging for everyone.
Recommended Jobs
Accounts Payable Specialist
About Northwood: Northwood is a modern space infrastructure company focused on connecting space and Earth. The world runs on space. Space will run on Northwood. Our global ground network ensures tha…
Territory Manager - UniFirst First Aid + Safety
Our Team is Kind of a Big Deal! UniFirst First Aid + Safety is seeking a reliable and hardworking Territory Manager to join our family. As a Territory Manager, you will be responsible for servici…
Collection Representative
Collection Representative – Full-Time, In-Office (Thousand Oaks, CA) Pay: $18.00 - $20.00 per hour + Monthly Performance Bonus Location: 555 St. Charles Dr., Suite 100, Thousand Oaks, CA 91360 …
Media Associate
Job Responsibilities: Assist in translating Mattel’s Media Buying Strategy into campaign builds within the Google Ads platform. This includes campaign creation, trafficking, optimizations, and pac…
Senior Software Engineer, AI Entities
EvenUp is one of the fastest-growing generative AI startups in history, on a mission to level the playing field for personal injury victims, which range from motor vehicle accidents to child abuse ca…
LLM Platform Engineer
&##128640; Join the Future of Commerce with Whatnot! Whatnot is the largest live shopping platform in North America and Europe to buy, sell, and discover the things you love. We’re re-defining e-comme…
Senior Backend Engineer
Intrinsic is building a trustworthy internet The internet, mediated by opaque algorithms, is fragile and manipulatable. Intrinsic is using AI to help rebuild an internet that’s safe enough to be f…
[Summer 2026] Software Engineer Intern
As a Software Engineer Intern, we'll support you as you tackle the hardest problems in tech today – distributed systems, real-time communication, 3D co-experience, extensive data processing, social n…
Aircraft MXS Research/Data Analyst, Senior WPAFB
Full-time Description Diaconia is looking for a talented Research Data Analyst, Senior to join our Amazing team! If you're looking to join a company that truly appreciates you and your ta…
COE Business Systems Data Analyst
Company Description At Intuitive, we are united behind our mission: we believe that minimally invasive care is life-enhancing care. Through ingenuity and intelligent technology, we expand the po…