Researcher, Pretraining Safety
- Develop upstream safety evaluations that to monitor how and when unsafe behaviors and goals emerge;
- Create safer priors through targeted pretraining and mid-training interventions that make downstream alignment more effective and efficient
- Design safe-by-design architectures that allow for more controllability of model capabilities
- Identify safety-relevant behaviors as they first emerge in base models
- Evaluate and reduce risk without waiting for full-scale training runs
- Design architectures and training setups that make safer behavior the default
- Strengthen models by incorporating richer, earlier safety signals
- Develop new techniques to predict, measure, and evaluate unsafe behavior in early-stage models
- Design data curation strategies that improve pretraining priors and reduce downstream risk
- Explore safe-by-design architectures and training configurations that improve controllability
- Introduce novel safety-oriented loss functions, metrics, and evals into the pretraining stack
- Work closely with cross-functional safety teams to unify pre- and post-training risk reduction
- Have experience developing or scaling pretraining architectures (LLMs, diffusion models, multimodal models, etc.)
- Are comfortable working with training infrastructure, data pipelines, and evaluation frameworks (e.g., Python, PyTorch/JAX, Apache Beam)
- Enjoy hands-on research - designing, implementing, and iterating on experiments
- Enjoy collaborating with diverse technical and cross-functional partners (e.g., policy, legal, training)
- Are data-driven with strong statistical reasoning and rigor in experimental design
- Value building clean, scalable research workflows and streamlining processes for yourself and others
Recommended Jobs
Janitor - Lynwood
Alliance Building Services, a full-service building maintenance company, is seeking a dedicated PART-TIME Night Janitor for our Lynwood locations. We pride ourselves on delivering exceptional mainten…
Medical Director
SAGE Veterinary Centers - Redwood City is looking for Medical Director to help lead our collaborative and innovative team! Located in the heart of Silicon Valley, about 25 miles from San Francisco, …
Software Engineer III (Back-end)
Turning Space into a Transportation Layer for Earth Who We Are: Eras of humanity can often be defined by a dominant transportation mode - horse drawn chariots, ocean going boats, or aircraft.…
Backend Engineer
Launched in 2012, Koding, a fast-growing startup (with over a million users), is seeking a Backend Engineer. In this role, you will be implementing available, resilient and highly performant services…
Physician Assistant Practitioner, Bakersfield, CA
$205,000 Per Year Practicing Half the Month in Bakersfield, CA Position: Physician Assistant or Family Nurse Practitioner - All new grads are welcome to apply and all training will be provided onsit…
Product Manager Growth, Otter - San Francisco
Who we are Otter is building the digital infrastructure for better food. Restaurants today face a world where off-premise demand is growing faster than the tools built to support it. We help opera…
Software Engineer II
About the Role You will drive the development and operations of security-focused software solutions and analytical frameworks that support user-facing products and platforms across Uber. You are a…
Full Stack Software Engineer, Research Team
About the Team Post-Training is responsible for training the models to be deployed into ChatGPT, the API, and future products. The team partners closely with research and product teams across the co…
Software Engineer, Frontend
ConductorOne is the modern identity governance platform that makes it possible to move beyond the limitations of legacy IGA and reduce the identity attack surface with confidence. Designed for flexi…
Staff Technical Product Manager
At Commure, our mission is to simplify healthcare. We have bold ambitions to reimagine the healthcare experience, setting a new standard for how care is delivered and experienced across the industry.…