Senior Machine Learning Engineer II - LLM
What You Will Do
We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra team covers a variety of responsibilities including distributed training and inference pipeline for large language models(LLM), model evaluation and monitoring framework, LLM latency optimization, etc. These frameworks serve as a strong foundation for our hundreds of ML and NLP models in production serving hundreds of millions of enterprise employees. We are solving many challenges on scalability of services as well as optimization of core algorithms.
In this role you will work closely with our machine learning team, data infrastructure team and every core skill. Above all, your work will impact the way our customers experience AI. Put another way, this role is absolutely critical to the long term scalability of our core AI product and ultimately the company. You will be responsible for building and productionizing ML infrastructure that runs state of the art models. If you are looking for a high-impact, fast-moving role to take your work to the next level, we should have a conversation.
- Design, build and optimize scalable machine learning infrastructure to support training, evaluation, and deployment of large language models.
- Build abstractions to automate various steps in different ML workflows
- Collaborate with cross functional teams of engineers, data analytics, machine learning experts, and product to build new features
- Leverage your experience to drive best practices in ML and data engineering
What You Bring To The Table
- 2+ years of industry experience in Machine Learning, Infrastructure or related fields
- Experience with deep learning framework such as Pytorch or Huggingface or LLM serving frameworks such as vLLM or TensorRT-LLM.
- Experience with building and scaling end-to-end machine learning systems
- Experience building scalable micro services and ETL pipelines
- Expertise in Python and experience with performant language such as C++ or GoLang
- Bachelor's in Computer Science, Computer Engineering, Mathematics, or equivalent field.
- A love of research publications in the machine learning and software engineering communities
- Effective communicator with experience collaborating cross-functionally with other teams
Nice To Haves
- Experience with ML Inference optimization using TensorRT.
- Experience with distributed training frameworks such as Deepspeed.
- Experience in managing and scaling GPU Inference services via Kubernetes
Base salary compensation range: $200,000 - $275,000
Recommended Jobs
Staff Software Engineer | Activision Blizzard Media
Job Title: Staff Software Engineer | Activision Blizzard Media Requisition ID: R025894 : Your Role Within the Kingdom Do you want to build amazing high-scale backend systems for Adve…
Montessori Preschool Teacher(s)
Laughter Educare Montessori Preschool (LEP) in Fremont is dedicated to enriching the lives of both students and staff. Conveniently located on Warm Springs Blvd near I-880 and I-680, it is surrounded…
Customer Success
Meter's mission and long term ambition Meter builds better internet infrastructure. We make it exceptionally easy for any business to have great computer networking, internet, and Wi-Fi. Businesse…
Commercial Real Estate and Finance Attorney
Real Estate Finance & Development Associate Practice Area : Real Estate; Project Development & Finance Location : San Francisco, CA Salary : $260,000 - $390,000 Annually. This position…
Visual Designer
Our Mission: Happiest Baby is looking for a passionate, highly versatile Graphic Designer to lead visual storytelling across every brand touchpoint. Based in Los Angeles, this individual will be a…
Staff Backend Engineer
Who We Are Tonal is the world’s first all-in-one home gym with a simply stunning design. It has completely revolutionized the fitness journey, with adaptive weight and coaching cues powered by adv…
Senior Product Manager (Print)
We're a fast-growing and profitable startup building the defining company at the intersection of AI, apparel, and culture. We move at a blinding pace, operate in one-week release cycles, and are look…
Senior Sales Representative - MedTech Startup
Senior Sales Representative â MedTech Startup (1099 Role) Locations: LA, Dallas, Chicago, Boston, or Miami Weâre seeking a driven, results-oriented sales professional to join a fast-growing …
Product Manager - AI Datacenters
What to Expect Heron Power is a startup company building cutting-edge power electronics for the 21st-century grid. We aim to debottleneck the growth of electricity generation and consumption wit…