AccDiffusion
Generate highβresolution images from text prompts
A collection of my Hugging Face Spaces featured as Space of the Week or highlighted by the Hugging Face team.
Generate highβresolution images from text prompts
Apache Licensed Advanced Video Generation Model
Animation Sketches sequence Colorization
Aesthetically Controllable Text-Driven Stylization w/o Train
Audio Gen, Audio Style Transfer and Audio InPainting
Generate detailed prompts describing an image
3D/4D Scenes from a Single Image w/ Controllable Video Diff
Quickly edit the expression of a face
High-Fidelity Simultaneous Speech-To-Speech Translation
Generates audio environment from an image
Get a music sample inspired by the mood of an image
Generate a story from an uploaded image
Kalman-Inspired Feature Propagation for Video Face Super-Res
Audio Conditioned LipSync with Latent Diffusion Models
Generate live captions for your webcam video
Audio-Driven Multi-Person Conversational Video Generation
Magnify subject details and enhance image quality
Repair video frames by removing unwanted elements
Train Free PersonalizΒ° Diff w/ Stochastic Optimal Control
Text-to-video with Reward Forcing
Get Music from Generated Spectrogram with Diffusion
Deeply interrogate audio file content
Segment objects in a video with clickβbased masks
Generate music from audio tracks
A Lightweight and Plug-and-Play Identity Control for Video G
Unified Framework for Generalized Video Face Restoration
Colorize grayscale images using automatic captions
Extraction & Reconstruction for Efficient Speech Separation
Generate Talking avatars from Text-to-Speech
Audio-based Lip Sync for Talking Head Video Editing
Generates a sound effect that matches video shot
Combine voice cloning and portrait lipsync animation
Expressive Portrait Animation w/ Hierarchical Motion AttentΒ°
Generate music from lyrics and genre tags
watermark-free Modelscope-based video generation
Zero-Shot Material Transfer from a Single Image