Tang Zhenyu

Tzy010822

Tzy010822

AI & ML interests

computer vision

Recent Activity

upvoted an article 3 days ago

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

upvoted a paper 11 days ago

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

upvoted a paper 11 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

View all activity

Organizations

upvoted an article 3 days ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

6 days ago

•

upvoted 2 papers 11 days ago

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Paper • 2601.09697 • Published 11 days ago • 8

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 11 days ago • 124

upvoted a paper 13 days ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published 13 days ago • 51

upvoted 3 papers 2 months ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25, 2025 • 40

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Paper • 2511.20561 • Published Nov 25, 2025 • 32

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 69

upvoted a paper 4 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 96

upvoted a paper 5 months ago

GenCompositor: Generative Video Compositing with Diffusion Transformer

Paper • 2509.02460 • Published Sep 2, 2025 • 26

updated a model 9 months ago

Tzy010822/unified_original_cfg

Updated May 8, 2025

published a model 9 months ago

Tzy010822/unified_original_cfg

Updated May 8, 2025

upvoted a paper 10 months ago

NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations

Paper • 2503.23162 • Published Mar 29, 2025 • 10

upvoted a paper over 1 year ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 69

liked a Space over 1 year ago

MeshFormer

🌟

Generate 3D mesh from an image

upvoted 2 papers over 1 year ago

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Paper • 2407.19548 • Published Jul 28, 2024 • 27

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 94

liked a dataset over 1 year ago

ShareGPT4Video/ShareGPT4Video

Viewer • Updated Mar 7, 2025 • 40.2k • 3.95k • 200

upvoted a paper over 1 year ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 74

liked a model almost 2 years ago

internlm/internlm-xcomposer2-4khd-7b

Visual Question Answering • Updated Apr 18, 2024 • 1.71k • 73

updated a model almost 2 years ago

Tzy010822/caption

1B • Updated Apr 20, 2024

Tang Zhenyu

AI & ML interests

Recent Activity

Organizations

Tzy010822's activity

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

MeshFormer