Machine Learning Systems Engineer, Encodings and Tokenization (San Francisco)
Machine Learning Systems Engineer, Encodings and Tokenization
Join to apply for the Machine Learning Systems Engineer, Encodings and Tokenization role at Anthropic
Machine Learning Systems Engineer, Encodings and Tokenization
Join to apply for the Machine Learning Systems Engineer, Encodings and Tokenization role at Anthropic
About Anthropic
Anthropics mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About Anthropic
Anthropics mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About The Role We are seeking an experienced Machine Learning Systems Engineer to join our Encodings and Tokenization team at Anthropic. This cross-functional role will be instrumental in developing and optimizing the encodings and tokenization systems used throughout our Finetuning workflows. As a bridge between our Pretraining and Finetuning teams, you'll build critical infrastructure that directly impacts how our models learn from and interpret data. Your work will be foundational to Anthropic's research progress, enabling more efficient and effective training of our AI systems while ensuring they remain reliable, interpretable, and steerable. Responsibilities- Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows
- Optimize encoding techniques to improve model training efficiency and performance
- Collaborate closely with research teams to understand their evolving needs around data representation
- Build infrastructure that enables researchers to experiment with novel tokenization approaches
- Implement systems for monitoring and debugging tokenization-related issues in the model training pipeline
- Create robust testing frameworks to validate tokenization systems across diverse languages and data types
- Identify and address bottlenecks in data processing pipelines related to tokenization
- Document systems thoroughly and communicate technical decisions clearly to stakeholders across teams
- Have significant software engineering experience with demonstrated machine learning expertise
- Are comfortable navigating ambiguity and developing solutions in rapidly evolving research environments
- Can work independently while maintaining strong collaboration with cross-functional teams
- Are results-oriented, with a bias towards flexibility and impact
- Have experience with machine learning systems, data pipelines, or ML infrastructure
- Are proficient in Python and familiar with modern ML development practices
- Have strong analytical skills and can evaluate the impact of engineering changes on research outcomes
- Pick up slack, even if it goes outside your job description
- Enjoy pair programming (we love to pair!)
- Care about the societal impacts of your work and are committed to developing AI responsibly
- Working with machine learning data processing pipelines
- Building or optimizing data encodings for ML applications
- Implementing or working with BPE, WordPiece, or other tokenization algorithms
- Performance optimization of ML data processing systems
- Multi-language tokenization challenges and solutions
- Research environments where engineering directly enables scientific progress
- Distributed systems and parallel computing for ML workflows
- Large language models or other transformer-based architectures (not required)
Seniority level
Seniority level
Mid-Senior level
Employment type
Employment type
Full-time
Job function
Job function
Engineering and Information TechnologyIndustries
Research Services
Referrals increase your chances of interviewing at Anthropic by 2x
Get notified about new Machine Learning Engineer jobs in San Francisco, CA .
Oakland, CA $10,279.00-$11,900.00 1 week ago
San Francisco, CA $149,998.00-$250,000.00 2 days ago
San Francisco, CA $140,670.00-$195,400.00 20 hours ago
Research Engineer - Machine Learning (ML)
San Francisco, CA $180,000.00-$240,000.00 2 weeks ago
San Francisco, CA $140,000.00-$180,000.00 6 months ago
Machine Learning Scientist, NLP (All Levels)
San Francisco, CA $200,000.00-$300,000.00 5 months ago
Machine Learning Engineer (I, II, or Sr.)
San Francisco, CA $140,000.00-$235,000.00 6 days ago
San Francisco, CA $160,000.00-$185,000.00 2 weeks ago
Machine Learning Scientist, NLP (All Levels)
San Francisco, CA $200,000.00-$300,000.00 5 months ago
San Francisco, CA $150,000.00-$260,000.00 5 months ago
San Francisco, CA $100,000.00-$180,000.00 1 year ago
San Francisco, CA $115,000.00-$185,000.00 3 weeks ago
Redwood City, CA $167,200.00-$250,800.00 2 weeks ago
Machine Learning Engineer (I, II, or Sr.)
San Francisco, CA $180,000.00-$270,000.00 4 months ago
San Francisco, CA $140,000.00-$290,000.00 8 months ago
San Francisco, CA $190,000.00-$355,000.00 2 weeks ago
San Francisco, CA $100,000.00-$300,000.00 3 weeks ago
San Francisco, CA $150,000.00-$250,000.00 1 month ago
San Francisco, CA $175,000.00-$225,000.00 9 months ago
San Mateo, CA $140,000.00-$210,000.00 1 month ago
Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-LjbffrRecommended Jobs
Physical Therapist, PRN
We currently seek an exceptional State Board Licensed Physical Therapist to join our team in an Outpatient setting in the beautiful California seaside town of Monterey. This position is open to 1…
Payroll Tax Accountant, FT Days
Payroll Tax Accountant will have primary responsibility for supporting the payroll tax operations. You will work in a high-volume office, processing changes for over 8,000 employees. You will work col…
Staff Enterprise Security Engineer (San Francisco)
Join to apply for the Staff Enterprise Security Engineer role at Gemini Join to apply for the Staff Enterprise Security Engineer role at Gemini About The Company Gemini is a global crypt…
Proposal Coordinator
Job Description Job Description Salary: $68,640-$72,000 PBS Engineersis a leading MEP design engineering firm with over 75 professionalsofferingexceptional Mechanical, Electrical, Plumbing en…
Senior SOC Engineer (AI-Driven,Cloud Security) - Full time - Hybrid, Toronto
Role: Senior SOC Engineer (AI-Driven,Cloud Security) Location: Toronto; 2 days/week in office -- We have a great new opportunity to support one of our Online Marketplace clients in a fu…
Director, Data Centre Commissioning
About the job Posted 15 hours ago Competitive USD / Year Are you looking to join the team building the backbone of AI? CrimsonXT have partnered with an AI Cloud Platform company, which, due to gr…
Neurologist Opportunity with Kaiser Permanente in Bakersfield, CA at Kaiser Permanente Southern[...] (Bakersfield)
Neurologist Opportunity with Kaiser Permanente in Bakersfield, CA job at Kaiser Permanente Southern California Permanente Medical Gro.... Bakersfield, CA. Salary Range : $422,775.00 to $443,031.00 …
Conscious Company Media Content Now Available in the Cognella Digital Library
Cognella authors can now search for Conscious Company magazine articles in the publisher’s online library San Diego, CA – January 22, 2019 – Cognella today announced that third-party content created …
Caregivers for Waltham Area
Caregivers with Guardian Angel Senior Services are not just doing a job, they are making a difference in people’s lives. Part time, full time, per diem work available. Day, evening, afternoon,…
Executive Director, Business & Legal Affairs
OVERVIEW OF THE COMPANY FOX Entertainment With a legacy spanning more than 35 years, FOX Entertainment is one of the world’s most recognizable media brands and a prolific content producer acros…