1 23 5

Tian Shulin

shulin16

https://shulin16.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

upvoted an article 1 day ago

Welcome Gemma 4: Frontier multimodal intelligence on device

upvoted a paper 1 day ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

View all activity

Organizations

upvoted a paper about 18 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 3 days ago • 193

upvoted an article 1 day ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

7 days ago

•

783

upvoted a paper 1 day ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published 3 days ago • 32

upvoted a paper 3 days ago

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published 7 days ago • 66

upvoted 2 papers 7 days ago

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Paper • 2603.26653 • Published 12 days ago • 17

HippoCamp: Benchmarking Contextual Agents on Personal Computers

Paper • 2604.01221 • Published 7 days ago • 27

upvoted a paper 16 days ago

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2603.18118 • Published 21 days ago • 12

upvoted a paper 29 days ago

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published Mar 3 • 62

upvoted a paper 30 days ago

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published Mar 8 • 85

upvoted a paper about 2 months ago

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

upvoted a paper 5 months ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 70

upvoted 4 papers 6 months ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15, 2025 • 11

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Paper • 2510.05094 • Published Oct 6, 2025 • 38

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

Paper • 2509.24897 • Published Sep 29, 2025 • 46

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29, 2025 • 37

upvoted a paper 7 months ago

On the Theoretical Limitations of Embedding-Based Retrieval

Paper • 2508.21038 • Published Aug 28, 2025 • 21

upvoted 2 papers 8 months ago

EgoTwin: Dreaming Body and View in First Person

Paper • 2508.13013 • Published Aug 18, 2025 • 21

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Paper • 2508.13154 • Published Aug 18, 2025 • 62

upvoted a paper 9 months ago

PhysX: Physical-Grounded 3D Asset Generation

Paper • 2507.12465 • Published Jul 16, 2025 • 44

upvoted a paper 10 months ago

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Paper • 2506.13654 • Published Jun 16, 2025 • 43

Tian Shulin

AI & ML interests

Recent Activity

Organizations

shulin16's activity

Welcome Gemma 4: Frontier multimodal intelligence on device