Zikai Zhou's picture

Zikai Zhou

Klayand

·

https://klayand.github.io/

Klayand

AI & ML interests

Knowledge Distillation, Generated Models

Recent Activity

upvoted a paper 4 days ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

upvoted a paper 4 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 4 days ago

Kwai Keye-VL-2.0 Technical Report

View all activity

Organizations

None yet

upvoted 3 papers 4 days ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

Paper • 2606.09076 • Published 7 days ago • 57

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 6 days ago • 40

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 6 days ago • 183

upvoted 2 papers 11 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 14 days ago • 119

Qwen-Image-Flash: Beyond Objective Design

Paper • 2606.03746 • Published 13 days ago • 35

upvoted 2 papers 14 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published 18 days ago • 38

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published 17 days ago • 60

upvoted a paper 17 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 18 days ago • 140

upvoted 3 papers 21 days ago

Geo-Align: Video Generation Alignment via Metric Geometry Reward

Paper • 2605.23903 • Published 24 days ago • 10

StepAudio 2.5 Technical Report

Paper • 2605.23463 • Published 24 days ago • 49

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published 26 days ago • 110

upvoted 6 papers about 1 month ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 86

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 60

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 110

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Paper • 2605.06376 • Published May 7 • 27

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

Paper • 2605.05204 • Published May 6 • 27

Lightning Unified Video Editing via In-Context Sparse Attention

Paper • 2605.04569 • Published May 6 • 18

upvoted a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 118

upvoted a paper 2 months ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

upvoted a paper 3 months ago

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published Mar 30 • 58