DONGFANG ZIHAO's picture

4 3

DONGFANG ZIHAO

UUUserna

·

UUUserna

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

upvoted a paper 3 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

upvoted a paper 3 months ago

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

View all activity

Organizations

None yet

upvoted a paper 12 days ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published 18 days ago • 18

upvoted 2 papers 3 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29, 2025 • 16

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Paper • 2510.07143 • Published Oct 8, 2025 • 12

upvoted a paper 4 months ago

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Paper • 2509.12989 • Published Sep 16, 2025 • 28

updated a dataset 6 months ago

UUUserna/OSR-Bench

Viewer • Updated Jul 30, 2025 • 4.1k • 1.47k • 3

published a dataset 8 months ago

UUUserna/OSR-Bench

Viewer • Updated Jul 30, 2025 • 4.1k • 1.47k • 3

liked 2 models 10 months ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 2.39M • • 1.42k

OpenGVLab/InternVL2_5-8B

Image-Text-to-Text • 8B • Updated Mar 25, 2025 • 35.8k • 97

liked a model over 1 year ago

ProsusAI/finbert

Text Classification • Updated May 23, 2023 • 2.12M • • 1.06k