Full-Stack Software Engineer, Compute Foundations
OpenAIGenerative AI company
San Francisco, United States$230K - $347K USDMid
Software Engineering
About the role
Build full-stack tools to monitor and improve supercomputer cluster operations and reliability.
- •Build web-based tools and scalable backend services to improve reliability, observability, and operations for OpenAIs largest supercomputing clusters.
- •Key Responsibilities Build full-stack web applications to surface cluster health and job failures in real time.
- •Design data models, APIs, and visualizations for scheduling and resource allocation.
- •Develop scalable backend services for high-volume cluster data with low latency.
- •Collaborate with researchers and infrastructure teams to solve operational problems.
- •Improve reliability, performance, and security of compute operation systems.
- •Requirements Significant full-stack development experience with modern frontend frameworks (e.g., React).
- •Backend experience using languages such as Python, Go, or Node.js.
- •Experience building scalable, high-performance web apps for distributed systems.
- •Knowledge of APIs, distributed data systems, and cloud infrastructure.
Tech stack
ReactPythonGoNode.jsKubernetesDockerAWSGrafanaPrometheusSQL
Match insights
Tech:React, Python, Go, Node.js, Kubernetes
Level:Mid
Location:San Francisco, United States