Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.13923

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

about 10 hours ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

Personal Favourites

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

Vision Language Models: 2025 Update

This collection includes all the models, datasets and Spaces mentioned in the blog Vision Language Models: 2025 Update

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 148k • 1.83k
Running

Featured

363

Qwen2.5 Omni 7B Demo

🏆

363

Generate text and speech from text, audio, images, and videos
Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5 • 102k • 1.27k

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429
Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 264

LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

Paper • 2106.13914 • Published Jun 26, 2021 • 1
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Paper • 2506.15196 • Published Jun 18 • 3
Ascend HiFloat8 Format for Deep Learning

Paper • 2409.16626 • Published Sep 25, 2024 • 1
Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30 • 1

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 54

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 123
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 34
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14 • 18
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models

Paper • 2502.08130 • Published Feb 12 • 9

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

about 10 hours ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

Personal Favourites

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429
Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 264

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

Paper • 2106.13914 • Published Jun 26, 2021 • 1
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Paper • 2506.15196 • Published Jun 18 • 3
Ascend HiFloat8 Format for Deep Learning

Paper • 2409.16626 • Published Sep 25, 2024 • 1
Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30 • 1

Vision Language Models: 2025 Update

This collection includes all the models, datasets and Spaces mentioned in the blog Vision Language Models: 2025 Update

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 148k • 1.83k
Running

Featured

363

Qwen2.5 Omni 7B Demo

🏆

363

Generate text and speech from text, audio, images, and videos
Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5 • 102k • 1.27k

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 54

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 123
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 34
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14 • 18
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models

Paper • 2502.08130 • Published Feb 12 • 9

Previous
1
2
3
4
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs