VidGen-1M: A Large-Scale Dataset for Text-to-video Generation
Paper
•
2408.02629
•
Published
•
15
We trained an excellent video generation model based on the VidGen-1M, a superior training dataset for text-to-video models. Produced through a coarse-to-fine curation strategy, this dataset guarantees high-quality videos and detailed captions with excellent temporal consistency. When used to train the video generation model