BAIR
2024 BAIR Graduate Directory
10 months ago
Rethinking the Role of PPO in RLHF
12 months ago
Training Diffusion Models with Reinforcement Learning
12 months ago
Goal Representations for Instruction Following
12 months ago