Research Engineer, AI Safety & Alignment
About the role and team
Joining us as a Research Engineer, you'll be at the forefront of tackling one of the most critical challenges in AI today: safety and alignment. Your work will be pivotal in understanding and mitigating the risks of advanced AI, conducting foundational research to make our models safer, and solving the core technical problems of AI alignment —ensuring our models behave in accordance with human values and intentions.
The Safety team is dedicated to pioneering and implementing techniques that make our models more robust, honest, and harmless. As a Research Engineer, you will bridge the gap between theoretical research and practical application, writing high-quality code to test hypotheses and integrating successful safety solutions directly into our products. Your research will not only protect millions of users but also contribute to the broader scientific community's understanding of how to build safe, beneficial AI.
What you'll do
Develop and implement novel evaluation methodologies and metrics to assess the safety and alignment of large language models.
Research and develop cutting-edge techniques for model alignment, value learning, and interpretability.
Conduct adversarial testing to proactively uncover potential vulnerabilities and failure modes in our models.
Analyze and mitigate biases, toxicity, and other harmful behaviors in large language models through techniques like reinforcement learning from human feedback (RLHF) and fine-tuning.
Collaborate with engineering and product teams to translate safety research into practical, scalable solutions and best practices.
Stay abreast of the latest advancements in AI safety research and contribute to the academic community through publications and presentations.
Who you are
Hold a PhD (or equivalent experience) in a relevant field such as Computer Science, Machine Learning, or a related discipline.
Write clear and clean production-facing and training code
Experience working with GPUs (training, serving, debugging)
Experience with data pipelines and data infrastructure
Strong understanding of modern machine learning techniques, particularly transformers and reinforcement learning, with a focus on their safety implications.
Are passionate about the responsible development of AI and dedicated to solving complex safety challenges.
Nice to Have
Experience with product experimentation and A/B testing
Experience training large models in a distributed setting
Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud)
Experience with explainable AI (XAI) and interpretability techniques.
Have research in AI safety, alignment, ethics , or a related area.
Knowledge of the broader societal and ethical implications of AI, including policy and governance.
Publications in relevant academic journals or conferences in the field of machine learning
About Character.AI
Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventure s.
In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.
Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!
At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.
Recommended Jobs
Mobile Product Manager
Perplexity is looking for an experienced product manager to join our small team revolutionizing the way people search and interact with the internet. You will be trusted to envision the future of sea…
Staff RF Design Validation and Test Engineer Wireless Connectivity & System Integration
Company: Qualcomm Atheros, Inc. Job Area: Engineering Group, Engineering Group Systems Test Engineering General Summary: About Us: We are a leading technology company at the fore…
Director of Admissions
This position will ensure, the Admissions team meet pre-set goals and performance standards for the continued success of the campus. The Director of Admissions will monitor employee staffing levels,…
Area Leader
The Area Leader for Kate Spade in Los Angeles is responsible for managing retail operations across multiple locations, driving sales strategies, and ensuring brand standards. This role requires a seas…
Full Time Family Practice Job TX
Family Medicine with Ob/Gyn JOB SUMMARY: Provide physician services to patients of the Clinic, inpatient, and OB patients Be available for consultation, assistance with medical emergencies a…
Android AI ML Engineer - On-Device
FocusKPI is looking for an Android AI ML Engineer - On-Device to join one of our clients, a high-tech SaaS company. The client is looking for a highly capable Android AI/ML Engineer - On-Device…
Customer Support Specialist
MealSuite, an Inc. 5000 Fastest-Growing Company , is a privately owned SaaS organization comprising 190+ team members across the globe, with hub locations in Cambridge, ON, Canada, Dallas, TX, US…
Operations Manager
THE COMPANY Our client is an innovative manufacturer specializing in high-end architectural and design-focused products. And they are seeking a hands-on Operations Manager to lead daily production a…
Test Engineer (Electrical)
About the Company At General Matter, we’re strengthening America’s capacity in nuclear energy to create a new set of possibilities for our shared future, from generating clean energy at scale to f…
Bookkeeper
Mochi Health’s mission is to be the discovery layer of healthcare. We are building a platform that makes it easier for patients to find the right providers, access the right medications, and take con…