Senior Machine Learning Engineer II - LLM
What You Will Do
We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra team covers a variety of responsibilities including distributed training and inference pipeline for large language models(LLM), model evaluation and monitoring framework, LLM latency optimization, etc. These frameworks serve as a strong foundation for our hundreds of ML and NLP models in production serving hundreds of millions of enterprise employees. We are solving many challenges on scalability of services as well as optimization of core algorithms.
In this role you will work closely with our machine learning team, data infrastructure team and every core skill. Above all, your work will impact the way our customers experience AI. Put another way, this role is absolutely critical to the long term scalability of our core AI product and ultimately the company. You will be responsible for building and productionizing ML infrastructure that runs state of the art models. If you are looking for a high-impact, fast-moving role to take your work to the next level, we should have a conversation.
- Design, build and optimize scalable machine learning infrastructure to support training, evaluation, and deployment of large language models.
- Build abstractions to automate various steps in different ML workflows
- Collaborate with cross functional teams of engineers, data analytics, machine learning experts, and product to build new features
- Leverage your experience to drive best practices in ML and data engineering
What You Bring To The Table
- 2+ years of industry experience in Machine Learning, Infrastructure or related fields
- Experience with deep learning framework such as Pytorch or Huggingface or LLM serving frameworks such as vLLM or TensorRT-LLM.
- Experience with building and scaling end-to-end machine learning systems
- Experience building scalable micro services and ETL pipelines
- Expertise in Python and experience with performant language such as C++ or GoLang
- Bachelor's in Computer Science, Computer Engineering, Mathematics, or equivalent field.
- A love of research publications in the machine learning and software engineering communities
- Effective communicator with experience collaborating cross-functionally with other teams
Nice To Haves
- Experience with ML Inference optimization using TensorRT.
- Experience with distributed training frameworks such as Deepspeed.
- Experience in managing and scaling GPU Inference services via Kubernetes
Base salary compensation range: $200,000 - $275,000
Recommended Jobs
Senior Software Engineer, Database Infra
Are you excited about scaling a relational data system to tens of millions of queries per second? How about making sure that the system offers availability and consistency required to serve as the fou…
RF / mmWave ATE Test Engineer
Imagine being part of a team that’s redefining the future of mobility—where your ideas don’t just sit in a lab but power next-generation technology. At indie, we are developing cutting-edge semicon…
Staff Data Scientist, Algorithms
At Lyft, our purpose is to serve and connect. We aim to achieve this by cultivating a work environment where all team members belong and have the opportunity to thrive. Data Science is at the hear…
Au Pair
Hello, You will work Monday- Friday 730-430. You will help us watch our 2 year old daughter Athena and also help the boys when they get home from school to ensure they do their homework. You will cook…
Human Computer Interaction Researcher Intern
The Experience Team collaborates across Zoox to define and champion the “Human Experience”. We embrace a human-centered design process that is collaborative, data-driven, and iterative. We are growin…
Senior/Staff Software Engineer - Operational Tools
Zoox is looking for software engineers to build the tools that power our autonomous ride sharing service. Our Operational Tools Team is a crucial part of Zoox’s mission to deliver autonomous ride-hai…
Senior Software Engineer (iOS) - Consumer Experience
StubHub is on a mission to redefine the live event experience on a global scale. Whether someone is looking to attend their first event or their hundredth, we’re here to delight them all the way from…
Senior Technical Product Manager, Manufacturing & Supply Chain Systems
At Relativity Space, we’re building rockets to serve today’s needs and tomorrow’s breakthroughs. Our Terran R vehicle will deliver customer payloads to orbit, meeting the growing demand for launch ca…
Data Engineer & Analyst
TerraVerde is seeking a Data Engineer & Analyst to join our team. This role is central to ensuring our clients receive accurate, timely, and actionable reports across a growing portfolio of commercia…