LinkedIn

company

Verified

https://www.linkedin.com

AI & ML interests

None defined yet.

Recent Activity

pb09204048 submitted a paper about 10 hours ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

JasonZhu13 published an article about 1 month ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

pb09204048 authored a paper about 1 month ago

Debunk the Myth of SFT Generalization

View all activity

Papers

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

View all Papers

Articles

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

models 0

None public yet

datasets 0

None public yet