Researcher, Context - Agent Post-Training
OpenAIGenerative AI company
San Francisco, United States$250K - $380K USDMid
Data & AI
About the role
Run experiments and improve post-training systems to scale context for agent models.
- •Work on Agent Post-Training to design experiments and improve scaling of compute on context, turning failures into training data and shipping improvements into frontier models.
- •Key Responsibilities Design and run experiments to improve context compute scaling.
- •Own post-training stack improvements: data pipelines, RL, graders, reward signals, and evals.
- •Build evals/environments and convert failures into training data or product fixes.
- •Collaborate with product, infrastructure, and safety teams to decide model run inclusions.
- •Requirements Strong technical fundamentals in ML, systems, statistics, or software engineering.
- •Hands-on experience with LLMs, RL, RLHF/RLAIF, post-training, or production ML systems.
- •Ability to run end-to-end experiments: hypothesis, pipeline, run, analyze, and act.
- •Cross-functional communication across research, product, infra, data, and safety.
Tech stack
LLMsPython
Match insights
Tech:LLMs, Python
Level:Mid
Location:San Francisco, United States