Senior Software Engineer, Compute Architecture
CoreWeaveGPU Cloud company
Livingston, United StatesSenior
Software Engineering
About the role
Build and operate Go-based services managing GPU data center hardware lifecycle and automation.
- •Build and operate Go-based distributed services that manage hardware lifecycle and automation for large-scale GPU data centers, improving observability and reliability at fleet scale.
- •Key Responsibilities Design, build, and operate Go services for data center infrastructure lifecycle management.
- •Automate data center bring-up, hardware discovery, health monitoring, and remediation.
- •Develop APIs and workflows for BMCs, firmware, server health, and rack infrastructure.
- •Improve observability, alerting, and operational tooling to resolve production issues.
- •Requirements 5+ years building and operating infrastructure or backend systems.
- •Proficiency in Go and building production services and APIs (gRPC/REST).
- •Experience with Kubernetes and containerized production workloads.
- •Familiarity with observability tools such as Prometheus and Grafana.
Tech stack
GogRPCREST APIKubernetesPrometheusGrafana
Match insights
Tech:Go, gRPC, REST API, Kubernetes, Prometheus
Level:Senior
Location:Livingston, United States