Boyuan Sun's picture

Boyuan Sun

BBBBCHAN

·

https://bbbbchan.github.io/

BBBBCHAN

AI & ML interests

None yet

Organizations

None yet

authored 8 papers 6 months ago

Depth Anything at Any Condition

Paper • 2507.01634 • Published Jul 2 • 49

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Paper • 2506.21862 • Published Jun 27 • 36

HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context

Paper • 2506.21277 • Published Jun 26 • 14

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Paper • 2501.15111 • Published Jan 25 • 1

Towards RAW Object Detection in Diverse Conditions

Paper • 2411.15678 • Published Nov 24, 2024 • 1

LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding

Paper • 2501.05067 • Published Jan 9 • 1

Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness

Paper • 2501.07978 • Published Jan 14 • 1

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

Paper • 2306.04300 • Published Jun 7, 2023 • 2