Senior Machine Learning Engineer II - LLM
What You Will Do
We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra team covers a variety of responsibilities including distributed training and inference pipeline for large language models(LLM), model evaluation and monitoring framework, LLM latency optimization, etc. These frameworks serve as a strong foundation for our hundreds of ML and NLP models in production serving hundreds of millions of enterprise employees. We are solving many challenges on scalability of services as well as optimization of core algorithms.
In this role you will work closely with our machine learning team, data infrastructure team and every core skill. Above all, your work will impact the way our customers experience AI. Put another way, this role is absolutely critical to the long term scalability of our core AI product and ultimately the company. You will be responsible for building and productionizing ML infrastructure that runs state of the art models. If you are looking for a high-impact, fast-moving role to take your work to the next level, we should have a conversation.
- Design, build and optimize scalable machine learning infrastructure to support training, evaluation, and deployment of large language models.
- Build abstractions to automate various steps in different ML workflows
- Collaborate with cross functional teams of engineers, data analytics, machine learning experts, and product to build new features
- Leverage your experience to drive best practices in ML and data engineering
What You Bring To The Table
- 2+ years of industry experience in Machine Learning, Infrastructure or related fields
- Experience with deep learning framework such as Pytorch or Huggingface or LLM serving frameworks such as vLLM or TensorRT-LLM.
- Experience with building and scaling end-to-end machine learning systems
- Experience building scalable micro services and ETL pipelines
- Expertise in Python and experience with performant language such as C++ or GoLang
- Bachelor's in Computer Science, Computer Engineering, Mathematics, or equivalent field.
- A love of research publications in the machine learning and software engineering communities
- Effective communicator with experience collaborating cross-functionally with other teams
Nice To Haves
- Experience with ML Inference optimization using TensorRT.
- Experience with distributed training frameworks such as Deepspeed.
- Experience in managing and scaling GPU Inference services via Kubernetes
Base salary compensation range: $200,000 - $275,000
Recommended Jobs
Sr. Product Manager
About Carlsmed Our mission is to improve outcomes and decrease the cost of healthcare for spine surgery and beyond. The Carlsmed aprevo® personalized surgery platform is designed to improve the…
Sr. ML Engineer, AI Cloud
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mus…
Full-Stack Developer
Job Opening: Full-Stack Developer – GenAI & LLM Applications Location: South San Francisco (SSF) Start Date: Immediate About the Role We are seeking a hands-on Full-Stack Developer with Gene…
Senior Operational Safety Data Engineer
The Senior Operational Safety Engineer/Operational Safety Engineering Manager will be responsible for leading day-to-day activities related to Zoox’s processes to manage operational safety risks. In …
SAP Transportation Management Manager Save for Later Remove job
At PwC, our people in business application consulting specialise in consulting services for a variety of business applications, helping clients optimise operational efficiency. These individuals an…
Registered Nurse ICU
Job Description Job Description Cath Lab Staffing Inc is seeking a Registered Nurse with Icu experience to Join Our team! Full-time contract position for a well-established acute Hospital i…
Forward Deployed Product Manager
About Us: Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable…
Sr. Product Manager
Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and govern…
Apartment Maintenance Technician
Are you ready to jump into the dynamic world of Property Management? The Apartment Industry is booming, and the opportunities are endless! Let BGSF, one of the largest staffing firms in the nati…
California
Calling all innovators - find your future at Fiserv. We're Fiserv, a global leader in Fintech and payments, and we move money and information in a way that moves the world. We connect financial in…