BAIR
2024 BAIR Graduate Directory
8 months ago
Rethinking the Role of PPO in RLHF
11 months ago
Training Diffusion Models with Reinforcement Learning
11 months ago
Goal Representations for Instruction Following
11 months ago