Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Boyuan Sun's picture
15 14 5

Boyuan Sun

BBBBCHAN
21world's profile picture natalie5's profile picture lalala125's profile picture
·
https://bbbbchan.github.io/
  • BBBBCHAN

AI & ML interests

None yet

Organizations

None yet

authored 8 papers 6 months ago

Depth Anything at Any Condition

Paper • 2507.01634 • Published Jul 2 • 49

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Paper • 2506.21862 • Published Jun 27 • 36

HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context

Paper • 2506.21277 • Published Jun 26 • 14

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Paper • 2501.15111 • Published Jan 25 • 1

Towards RAW Object Detection in Diverse Conditions

Paper • 2411.15678 • Published Nov 24, 2024 • 1

LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding

Paper • 2501.05067 • Published Jan 9 • 1

Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness

Paper • 2501.07978 • Published Jan 14 • 1

CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation

Paper • 2306.04300 • Published Jun 7, 2023 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs