generate a video from an image with a text prompt
Generate a video by interpolating two images with a prompt
Audio Conditioned LipSync with Latent Diffusion Models
Generate a video from images with different modes