Hwang yechan PRO
SoonOk
AI & ML interests
AI&ML&ReinforcementLearning&DeepRL &DeepLearning
Recent Activity
liked a dataset 28 days ago
ReneeYe/werewolf_game_reasoning upvoted a paper 2 months ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models