3 18

Raul

lefutureman

rally12

AI & ML interests

LLM, CV

Recent Activity

upvoted a collection about 1 month ago

DeepSeek-V4

upvoted a paper about 1 month ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

liked a model 5 months ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

DeepSeek-V4

Collection

4 items • Updated Apr 24 • 685

upvoted a paper about 1 month ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Paper • 2603.19312 • Published Mar 13 • 48

liked 2 models 5 months ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated Dec 3, 2025 • 1.69M • • 1.11k

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated Jan 29 • 1.9M • 1.62k

upvoted a paper 5 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

liked 2 models 5 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 283k • 2.56k

numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated 14 days ago • 21.3k • 475

liked a Space 8 months ago

The Smol Training Playbook

📚

3.21k

The secrets to building world-class LLMs

liked 2 models 8 months ago

nvidia/DLER-R1-1.5B-Research

2B • Updated Oct 25, 2025 • 173 • 19

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 138k • 1.2k

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.89k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

Babelscape/t5-base-summarization-claim-extractor

0.2B • Updated Jan 22 • 3.65k • 15

liked a dataset about 1 year ago

HPLT/HPLT2.0_cleaned

Updated 8 days ago • 22.4k • 43

liked 2 models over 1 year ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 10.1M • • 503

Weyaxi/Qwen-72B-Llama

Text Generation • 72B • Updated Feb 2, 2024 • 81 • 12

liked 3 models about 2 years ago

liked a dataset about 2 years ago

uonlp/CulturaX

Viewer • Updated Dec 16, 2024 • 7.18B • 20k • 641

liked a model about 2 years ago

NYTK/translation-mt5-small-128-en-hu

Translation • Updated Jan 31, 2023 • 14 • 2