Senior Data Scientist - Generative AI
WHY DATA SCIENCE & ANALYTICS?
The Data Science & Analytics organization's mission is to increase our speed, frequency, and acumen in making decisions at scale by instilling a data-influenced approach to building products. We cover a wide area of the data spectrum, including analytical data engineering, product analytics, experimentation, causal inference, statistical modeling, and machine learning. Aligned and partnered with product verticals, we use this extensive tool belt to discover new opportunities and unmet use cases, influence and craft the product roadmap, and prioritize, build data products, and measure impact on our community of players and developers.
WHY GENERATIVE AI?
The Foundation AI group’s mission is to enable Roblox Creators to accelerate their workflows and bring GenAI capabilities to millions of users. We envision a future where experiences on Roblox leverage generative text and speech to enable new interactions, and generative 3D and 4D capabilities to empower new creative workflows and user experience.
As a Data Scientist on the team, you will design, build, and operationalize evaluation for GenAI systems, and work with cross-functional teams to improve model performance and the AI data generation flow. Since AI evaluation is core to GenAI safety, quality, and iteration speed, we are building rigorous and scalable human and model-based evaluation systems that guide product decisions and model improvement. You’ll combine annotation analysis, design of experiments, causal inference, product analytics, and model-based evaluation methods (such as LLM-as-a-judge / VLM-as-a-judge) to measure quality, safety, and user satisfaction—and translate these findings into model and product improvements. You’ll also help develop groundbreaking methodologies and tools that advance AI evaluation at Roblox and set industry standards. Beyond AI evaluation, we proactively explore opportunities and solutions to improve the AI model and data generation flow.
Additionally, we will build agentic workflows and AI agents for data solutions that enable teams to effectively access data, extract data insights, follow best practices, and make data-informed decisions.
If you are a self-starter who is curious, rigorous, and passionate about building innovative solutions that deliver real business value—and thrive in a dynamic, collaborative environment—this role is for you.
You Will:
- Develop and improve evaluation frameworks for GenAI features (text, image, 3D, 4D, agentic workflow), including eval experiment design, eval dataset design, label reliability analysis, results analysis, and online evaluation based on user behavior and feedback.
- Establish best practices and guidelines for GenAI evaluation.
- Conduct product analytics, online experiments (A/B tests) and causal analyses to quantify GenAI feature impact and identify opportunities.
- Build automated evaluation systems, such as research and implement LLM-as-judge and VLM-as-judge methods.
- Research and apply state-of-the-art methodologies in GenAI evaluation.
- Advance reproducible evaluation tooling to lift evaluation rigor and efficiency at the company.
- Proactively explore and develop solutions to improve the AI model and data generation flow, ensuring high-quality input for training and deployment.
- Design and implement agentic workflows and AI agents to enable teams to effectively access data, extract data insights, and follow best data practices.
- Partner closely with cross-functional teams to align goals, plans, and execution.
You Have:
- An advanced Degree and/or PhD in Statistics, Economics, Operations Research, Computer Science, Applied Math, Physics, Engineering, or another quantitative field.
- 5+ years of experience in data science or a related field.
- Familiarity with GenAI models and GenAI evaluation methods.
- Passion for the GenAI field and enthusiasm for continuously improving methods and practices to drive product quality and business impact.
- Ability to effectively use AI tools to enhance productivity in research, ideation, coding, and documentation.
- Strong learning agility, experience conducting applied research or writing technical papers is a plus.
- Proficiency in SQL, Hive, or Spark for transforming and manipulating large datasets.
- Experience with scripting languages such as Python or R.
- A demonstrated track record of solving open-ended data science and modeling problems that drive business impact and improve user experience.
Recommended Jobs
911 Police Dispatcher (Lateral) (20623443)
Location 3901 Alamo Street Simi Valley, 93063 Description Join our team! We’re looking for skilled 9-1-1 Police Dispatchers who are passionate about public safety and excel in communi…
ML Infrastructure Engineer
About the Company Virtue AI is at the forefront of AI security. As enterprises increasingly adopt Large Language Models, the need for robust, trustworthy, and safe AI has never been greater. Our mis…
Contract Administrator
IDR is seeking a Contract Administrator to join one of our top clients in West Hollywood, CA. This role is pivotal in managing and executing contract-related activities for the procurement of goods a…
Process and Controls Financial Manager
About Gusto Gusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and…
Test Engineer, Actuation Systems (Starship)
SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolog…
Software engineer
About The Company Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. We're a Series C company backed by a16z, transforming industries tha…
Software Engineer (AI Performance)
Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefi…
Travel Nurse - ICU
We are seeking a dedicated Travel Nurse - ICU to provide expert care in the Intensive Care Unit in Lodi, CA. Deliver high-quality, compassionate care to critically ill patients in a fast-paced ICU …
Quality Engineer II / Senior - Machining
About The Role ABOUT ROCKET LAB Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and mor…