Machine Learning Research Engineer
We’re partnering with a frontier AI startup that’s redefining how models learn to understand subjective quality — from writing tone and design aesthetics to emotional resonance and creative expression.
They’re collaborating with leading AI labs and building new methods that help models reason about creativity, taste, and quality. The founding team is small, highly technical, and deeply curious. Building at the intersection of research, product, and creativity .
As a Machine Learning Research Engineer , you’ll own end-to-end research cycles — designing, running, and analyzing post-training experiments that help models evaluate style and subjective quality. You’ll collaborate directly with AI labs and creative experts, and have the opportunity to publish your work.
Requirements
• 2–6+ years of professional experience in machine learning research, post-training, or ML engineering
• Strong skills in Python and PyTorch, with hands-on experience training and fine-tuning models
• Deep familiarity with LLMs, multimodal models (text–image/video), and post-training techniques such as RLHF or DPO
• Proven ability to run experiments end-to-end — from dataset design and model training to evaluation and iteration
• Experience developing or using evaluation benchmarks for generative or subjective tasks
• Comfort collaborating in a fast-moving, early-stage environment with limited infrastructure
• Clear, thoughtful communication — you can explain research outcomes to both technical and non-technical collaborators
• A collaborative, low-ego mindset and genuine excitement about building something new
• Interest in subjective or creative domains (e.g. writing, design, visual aesthetics)
• Experience working with or at data vendors (Scale, Labelbox, Snorkel, etc.)
• Prior work with alignment, preference modeling, or reinforcement learning
• A record of publications, blog posts, or open-source contributions in the ML community
Benefits
• Meaningful equity — real ownership in an early-stage, high-impact company
• Competitive compensation ($200K–$350K base)
• In-person, collaborative culture. Work alongside a small, highly talented team in Jackson Square, San Francisco
• Creative research freedom. Autonomy to design, publish, and share your work across open channels
• Partnerships with leading AI labs. Direct exposure to cutting-edge research and experiments
• Growth potential. Shape foundational systems at an early stage and scale with the company
• Support for learning and inspiration. Funding for courses, conferences, or creative exploration related to your work
• Low-ego, non-toxic environment. A team culture built on curiosity, collaboration, and mutual respect
Recommended Jobs
Senior Full Stack Software Engineer
ABOUT US: HiveWatch is a tech-forward, inclusive organization fostering the evolution of the physical security industry. We are a diverse team of forward thinkers who empower each other to find cr…
CUSTOMER SALES & SERVICE REP I - BILINGUAL PREFERRED (ENGLISH/SPANISH)
Company Overview SiteOne associates are customer obsessed, always safe, continuously improving, and having fun! Whether you are experienced in the green industry, a professional looking for a care…
Senior UX Researcher, Operations Research
As the Senior UX Researcher, you will help guide design, product, and engineering teams towards the development of a suite of tools designed to support our fleet of autonomous vehicles in action. You…
MarTech Product Manager
Job Description We are looking for a customer-obsessed MarTech Product Manager to lead the strategy, development, and optimization of our marketing technology stack. In this role, you’ll partn…
Staff Product Manager (Developer tools)
About Workato Workato transforms technology complexity into business opportunity. As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting…
Senior Data Scientist, Product Analytics - Credit Card
The role: We are seeking a Senior Data Scientist to join our Pricing team in the Lending Organization, with focus on our Credit Card business. This is an exciting role for someone looking to make …
SGH - GPATH- Lead Nurse - Full Time - Nights
Hours : Shift Start Time: 7 PM Shift End Time: 7:30 AM AWS Hours Requirement: 12/36 - 12 Hour Shift Additional Shift Information: Weekend Requirements: Every Other On-Call…
Remote Piloting Test Engineer
We're building safety-enhancing technology for aviation that will save lives. Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tran…
Financial Analyst III
Description Internet Brands is adding a finance partner to our Financial Planning & Analysis (FP&A) team. This is not a “just send the report” analyst role — you’ll own monthly performance for y…
Machine Learning Engineer, Motion Planning
Woven by Toyota is the mobility technology subsidiary of Toyota Motor Corporation. Our mission is to deliver safe, intelligent, human-centered mobility for all. Through our Arene mobility software p…