MME-Benchmarks

non-profit

MME-Benchmarks

Activity Feed

AI & ML interests

Multimodal LLMs

Recent Activity

THUdyh authored a paper about 1 hour ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

THUdyh authored a paper about 1 hour ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

THUdyh authored a paper about 1 hour ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

View all activity

Papers

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

View all Papers

THUdyh

authored 4 papers about 1 hour ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 10 days ago • 39

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

Paper • 2606.27313 • Published 3 days ago • 37

BradyFU

submitted a paper to Daily Papers 5 days ago

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

Paper • 2606.21649 • Published 9 days ago • 31

yifanzhang114

submitted a paper to Daily Papers 13 days ago

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Paper • 2606.14702 • Published 16 days ago • 31

EliYuan00

updated a dataset 16 days ago

MME-Benchmarks/Video-MME-v2

Benchmark • Updated 16 days ago • 3.2k • 4.19k • 43

THUdyh

authored a paper about 1 month ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published May 27 • 75

BradyFU

submitted a paper to Daily Papers 2 months ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 46

THUdyh

updated a dataset 2 months ago

MME-Benchmarks/Video-MME-v2

Benchmark • Updated 16 days ago • 3.2k • 4.19k • 43

yifanzhang114

updated a dataset 3 months ago

MME-Benchmarks/Video-MME-v2

Benchmark • Updated 16 days ago • 3.2k • 4.19k • 43

EliYuan00

authored a paper 3 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

yifanzhang114

authored 4 papers 3 months ago

VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents

Paper • 2603.16289 • Published Mar 17 • 1

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Paper • 2603.29620 • Published Mar 31 • 49

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Paper • 2604.03016 • Published Apr 3 • 37

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

BradyFU

authored 4 papers 3 months ago

AI & ML interests

Recent Activity

Papers

Team members 4

MME-Benchmarks's activity