Software Engineer, Networking
About the Team
The Platform Networking team is responsible for the collective communication stack used in our largest training jobs. Using a combination of C++ and CUDA we work on novel collective communication techniques that enable efficient training of our flagship models on our largest custom built supercomputers.
The models we train are key ingredients to the AI research progress at OpenAI and the field as a whole, and we continually incorporate learnings from our entire research org into our training platform.
About the Role
As a Software Engineer, Networking you will design and implement custom networking collectives that are tightly integrated into our training stack.
We’re looking for people who have a background in low level performance critical software. Experience with collective communication is a bonus.
This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will:
Collaborate closely with ML researchers to design and implement efficient collective operations in C++ and CUDA.
Ensure that our largest training jobs take full advantage of the different network transports used in our supercomputers.
Work on simulations to inform our future supercomputer network designs.
You might thrive in this role if you:
Have written distributed algorithms using RDMA in the past.
Are comfortable writing low level performance sensitive CPU and/or GPU code.
Are familiar with network simulation techniques.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Recommended Jobs
Sr Frontend Software Engineer I Design System
AXS connects fans with the artists and teams they love. Each year we sell millions of tickets to thousands of incredible events – from concerts and festivals to sports and theater – at some of the mo…
10802 - Sr. Software Engineer, KMNA Development
Purpose: This position will be involved in the design, development, maintenance and modification of code for Connected Car Applications partnering with the product owner, scrum master, solution arch…
Senior Lead Product Manager - Compliance
Responsibilities also encompass overseeing product profitability, leading operational reviews, and partnering with various experts to maximize efficiency in a dynamic environment. Drives product strat…
Senior Backend Engineer
We are looking for a senior backend engineer to join our team in India. As one of the founding members of our team, you will play a critical role in shaping our product and engineering culture. Wh…
Medical Surgical Med Surg Clinical Nurse RN Coordinator
Medical Surgical Clinical RN Nurse Coordinator — ignite nursing leadership in a high-growth Southern California hospital. The Medical Surgical Clinical RN Nurse Coordinator will drive Med-Surg excel…
Computer Software System Architect
Req ID: 127891 Region: Americas Country: USA State/Province: California City: San Jose Summary This position is for a Principal Engineer, Compute Software System Architect. As software…
RAN Infrastructure Engineer
About Skylo Skylo is a global Non-Terrestrial Network service provider based in Mountain View, CA, offering a service that allows smartphone and IoT cellular devices to connect directly over exist…
Sr Director, Systems Engineering - Strategic SME
Req ID: 127896 Region: Americas Country: USA State/Province: California City: San Jose Summary The role is for a highly accomplished and forward-thinking Sr Director Technical Engineer…
Dishwasher and Kitchen Prep - Part Time
Job Description: Summary of Position: Wash and clean tableware, pots, pans and cooking equipment. Prep food as requested. Keep the dish washing area and equipment clean and organized. Duti…
Senior Software Engineer ($160K $250K + Equity) at Series B Multimodal AI Lab
This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates from Jack's network. The next step is to speak to Jack . …