dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

upvoted an article 3 days ago

🐯 Liger GRPO meets TRL

upvoted a paper 3 days ago

Kimi K2.5: Visual Agentic Intelligence

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

Paper • 2602.04442 • Published 1 day ago • 4

upvoted an article 3 days ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

upvoted 3 papers 3 days ago

upvoted a paper 6 days ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 13 days ago • 40

upvoted an article 7 days ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

128

upvoted an article 9 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.06k

upvoted an article 21 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Jan 2

•

upvoted an article 22 days ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

upvoted a paper about 1 month ago

VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation

Paper • 2601.02256 • Published Jan 5 • 33

upvoted a paper about 2 months ago

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 43

upvoted a collection about 2 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 148

upvoted a paper about 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 295

upvoted 2 papers 2 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 90

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 119

upvoted a paper 3 months ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22, 2025 • 69

upvoted a paper 4 months ago

Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings

Paper • 2508.18733 • Published Aug 26, 2025 • 10

upvoted a collection 4 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 621

upvoted a paper 4 months ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 54

dfuhoiysOHSVFh82934gfjklb

AI & ML interests

Recent Activity

Organizations

huba-buba's activity

🐯 Liger GRPO meets TRL

Small Language Models (SLM): A Comprehensive Overview

Mixture of Experts Explained

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

From GRPO to DAPO and GSPO: What, Why, and How