arxiv:2508.19205
zhiliang
zzliang
AI & ML interests
multimodal
Recent Activity
upvoted a paper about 1 month ago
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding upvoted a paper about 2 months ago
Online Experiential Learning for Language Models liked a Space 3 months ago
microsoft/VibeVoice-ASR