MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation (2401.04468v1)
Abstract: The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo-V2 can generate an aesthetically pleasing, high-resolution video with remarkable fidelity and smoothness. It demonstrates superior performance over leading Text-to-Video systems such as Runway, Pika 1.0, Morph, Moon Valley and Stable Video Diffusion model via user evaluation at large scale.
- Gen-2. https://research.runwayml.com/gen2. Accessed: 2023-11-16.
- MoonValley. https://https://moonvalley.ai/. Accessed: 2023-11-16.
- Morph. https://www.morphstudio.com/. Accessed: 2023-11-16.
- Pika 1.0. https://pika.art/. Accessed: 2023-12-26.
- SVD-XT. https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt. Accessed: 2023-11-27.
- Stable video diffusion: Scaling latent video diffusion models to large datasets, 2023.
- Multiple video frame interpolation via enhanced deformable separable convolution, 2021.
- Ldmvfi: Video frame interpolation with latent diffusion models, 2023.
- Emu video: Factorizing text-to-video generation by explicit image conditioning, 2023.
- Animatediff: Animate your personalized text-to-image diffusion models without specific tuning, 2023.
- Videopoet: A large language model for zero-shot video generation, 2023.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- Extracting motion and appearance via inter-frame attention for efficient video frame interpolation. In CVPR, 2023a.
- Adding conditional control to text-to-image diffusion models, 2023b.
- Magicvideo: Efficient video generation with latent diffusion models, 2023.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.