3 114 6

Donghao Zhou

donghao-zhou

https://correr-zhou.github.io

Correr-Zhou

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 3 days ago

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

upvoted a paper 4 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

upvoted a paper 10 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

View all activity

Organizations

upvoted a paper 3 days ago

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published 7 days ago • 62

upvoted a paper 4 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 7 days ago • 131

upvoted a paper 10 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 11 days ago • 60

upvoted a paper 13 days ago

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published 19 days ago • 21

upvoted a paper 16 days ago

MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

Paper • 2603.17117 • Published 18 days ago • 87

upvoted a paper 26 days ago

WildActor: Unconstrained Identity-Preserving Video Generation

Paper • 2603.00586 • Published Feb 28 • 37

submitted a paper to Daily Papers 30 days ago

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Paper • 2603.02210 • Published Mar 2 • 29

upvoted a paper about 1 month ago

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 183

authored a paper about 1 month ago

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Paper • 2603.02210 • Published Mar 2 • 29

upvoted 2 papers about 1 month ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 102

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Paper • 2603.02210 • Published Mar 2 • 29

updated a dataset about 1 month ago

donghao-zhou/HP-Image-40K

Updated Mar 3 • 46 • 4

published a dataset about 1 month ago

donghao-zhou/HP-Image-40K

Updated Mar 3 • 46 • 4

authored a paper about 1 month ago

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

Paper • 2602.19895 • Published Feb 23 • 13

upvoted 4 papers about 1 month ago

Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Paper • 2602.18422 • Published Feb 20 • 30

upvoted 2 papers about 2 months ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Paper • 2602.06949 • Published Feb 6 • 36

Donghao Zhou

AI & ML interests

Recent Activity

Organizations

donghao-zhou's activity