AI
Starstrek
Stars321123
AI & ML interests
AI
Recent Activity
upvoted a paper 14 minutes ago
Direct Preference Optimization: Your Language Model is Secretly a Reward
Model liked a model 14 minutes ago
VladShash/deepseek-math-7b-lean-prover-dpo-olmo-3 upvoted a collection 16 minutes ago
Stuff I'm going to read