1 7 3

Yibo Li

liushiliushi

https://liushiliushi.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 13 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper 28 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

View all activity

Organizations

None yet

upvoted 2 papers 13 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 15 days ago • 141

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 14 days ago • 82

upvoted a paper 28 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 186

liked 2 models 4 months ago

liushiliushi/ConfTuner-Qwen

8B • Updated Sep 19, 2025 • 4 • 2

liushiliushi/ConfTuner-Ministral

Text Generation • 8B • Updated Sep 20, 2025 • 1 • 3

updated a model 4 months ago

liushiliushi/ConfTuner-Ministral

Text Generation • 8B • Updated Sep 20, 2025 • 1 • 3

New activity in liushiliushi/ConfTuner-Ministral 4 months ago

Improve model card: Add pipeline tag, library, description, and usage instructions

#1 opened 4 months ago by

nielsr

authored 3 papers 4 months ago

updated 2 models 4 months ago

liushiliushi/ConfTuner-LLaMA

8B • Updated Sep 19, 2025 • 49

liushiliushi/ConfTuner-Qwen

8B • Updated Sep 19, 2025 • 4 • 2

upvoted a paper 4 months ago

ConfTuner: Training Large Language Models to Express Their Confidence Verbally

Paper • 2508.18847 • Published Aug 26, 2025 • 2

updated a model 7 months ago

liushiliushi/Qwen2.5-7B-Instruct_gpt

8B • Updated Jun 18, 2025 • 1

published 2 models 7 months ago

liushiliushi/ConfTuner-LLaMA

8B • Updated Sep 19, 2025 • 49

liushiliushi/Qwen2.5-7B-Instruct_gpt

8B • Updated Jun 18, 2025 • 1

updated 2 models 7 months ago

liushiliushi/Llama-3.1-8B-Instruct_gpt

8B • Updated Jun 18, 2025

liushiliushi/llama-uncertainty

8B • Updated Jun 18, 2025

published 2 models 7 months ago

liushiliushi/Llama-3.1-8B-Instruct_gpt

8B • Updated Jun 18, 2025

liushiliushi/ConfTuner-Ministral

Text Generation • 8B • Updated Sep 20, 2025 • 1 • 3

Yibo Li

AI & ML interests

Recent Activity

Organizations

liushiliushi's activity

Improve model card: Add pipeline tag, library, description, and usage instructions