Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
rayruiyang 's Collections
VST
Haplo-VL

VST

updated Feb 1

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.

Upvote
6

  • rayruiyang/VST-3B-RL

    Image-Text-to-Text • 4B • Updated Nov 11, 2025 • 65 • 3

  • rayruiyang/VST-3B-SFT

    Image-Text-to-Text • 4B • Updated Nov 11, 2025 • 74

  • rayruiyang/VST-7B-SFT

    Image-Text-to-Text • 8B • Updated Nov 11, 2025 • 753

  • rayruiyang/VST-7B-RL

    Image-Text-to-Text • 8B • Updated Nov 11, 2025 • 289

  • Visual Spatial Tuning

    Paper • 2511.05491 • Published Nov 7, 2025 • 53

  • rayruiyang/vst_3d_grounding_benchmark

    Preview • Updated Feb 1 • 39
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs