kang

qiyue

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

OpenEvals/evaluation-guidebook

upvoted an article 8 days ago

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

upvoted a paper 18 days ago

Kimi Linear: An Expressive, Efficient Attention Architecture

View all activity

Organizations

None yet

liked a Space 4 days ago

Evaluation Guidebook

📝

216

Display benchmark evaluation data for LLMs

upvoted an article 8 days ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

May 7, 2024

•

111

upvoted a paper 18 days ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 119

upvoted an article about 1 month ago

Article

Diffusers welcomes FLUX-2

Nov 25

•

165

upvoted an article about 2 months ago

Article

What makes good reasoning data

Oct 30

•

upvoted a paper about 2 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27 • 29

upvoted an article 11 months ago

Article

Open R1: Update #2

Feb 10

•

218

liked a model 12 months ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 901k • • 4.01k

upvoted an article about 1 year ago

Article

Hugging Face welcomes the Aya Expanse family of multilingual models

Oct 24, 2024

•

upvoted a paper over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

liked a model over 1 year ago

mistralai/Mistral-Small-Instruct-2409

22B • Updated Jul 28 • 10.2k • 393

upvoted an article over 1 year ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Aug 21, 2024

•

upvoted a paper over 1 year ago

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18, 2024 • 17

upvoted 2 articles over 1 year ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

Jul 11, 2024

•

Article

The Rise of Agentic Data Generation

Jul 15, 2024

•

liked a dataset over 1 year ago

tasksource/tasksource_dpo_pairs

Viewer • Updated Jul 1, 2024 • 5.13M • 844 • 21

upvoted an article over 1 year ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

liked 3 datasets over 1 year ago

kang

AI & ML interests

Recent Activity

Organizations

qiyue's activity

Evaluation Guidebook

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Diffusers welcomes FLUX-2

What makes good reasoning data

Open R1: Update #2

Hugging Face welcomes the Aya Expanse family of multilingual models

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

RegMix: Data Mixture as Regression for Language Model Pre-training

The Rise of Agentic Data Generation

Putting RL back in RLHF