ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted a paper 2 days ago

SWE-smith: Scaling Data for Software Engineering Agents

liked a model 2 days ago

zai-org/GLM-4.7

liked a dataset 3 days ago

nvidia/Nemotron-Math-v2

View all activity

Organizations

upvoted a paper 2 days ago

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30, 2025 • 12

upvoted a paper 7 days ago

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published 17 days ago • 101

upvoted a paper 15 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 17 days ago • 255

upvoted a collection 23 days ago

Molmo2 Data

Artifacts for the Molmo2 data release • 16 items • Updated 25 days ago • 31

upvoted 3 papers about 1 month ago

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Paper • 2512.02551 • Published Dec 2, 2025 • 12

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 291

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 47

upvoted a paper about 2 months ago

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21, 2025 • 13

upvoted 3 papers 2 months ago

Motif 2 12.7B technical report

Paper • 2511.07464 • Published Nov 7, 2025 • 39

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 128

upvoted a collection 3 months ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 23 days ago • 73

upvoted 2 papers 3 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28, 2025 • 40

upvoted a collection 3 months ago

Reasoning Efficiency Research

Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 1 day ago • 11

upvoted an article 3 months ago

Article

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

+9

Sep 16, 2025

•

47

upvoted a paper 3 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 67

upvoted an article 3 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

296

upvoted a paper 3 months ago

CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20, 2025 • 19

upvoted a collection 3 months ago

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5, 2025 • 15