Brian Christian
brianchristian
AI & ML interests
None yet
Recent Activity
published a dataset about 1 month ago
self-model/sycophancy-two-sides-eval published a dataset about 1 month ago
self-model/discrim-eval-templated updated a collection 4 months ago
Reward Models Inherit Value Biases from Pretraining ICLR2026