"A beautiful woman, with a pink and platinum-colored ombre mohawk, facing the camera, wearing a composition of bubble wrap, cyberpunk jacket." | "A fat rabbit wearing a purple robe walking through a fantasy landscape." | "A girl is writing something on a book. Oil painting style." | "A girl with a hairband performing a song with her guitar on a warm evening at a local market, children's story book." |
"A group of mongooses scuttle about, set against a desert backdrop, bathed in bright and warm earth tones." | "A lone traveller walks in a misty forest." | "A medieval witch making a poison." | "A monkey making latte art." |
"A panda standing on a surfboard, in the ocean in sunset, 4k, high resolution." | "A polar bear is playing guitar." | "A strong American cowboy with dark skin stands in front of a chair." | "A young, beautiful girl in a pink dress is playing piano gracefully." |
"An old-fashioned windmill surrounded by flowers, 3D design." | "At a tranquil lake, a white swan gracefully glides on the surface, its reflection dancing on the water, seen in a medium shot." | "Hulk wearing virtual reality goggles, 4k, high resolution." | "Ironman flying over a burning city, very detailed surroundings, cities are blazing, shiny iron man suit, realistic, 4k ultra high defi." |
"A giant dragon sitting in a snow covered landscape, breathing fire." | "A large blob of exploding splashing rainbow paint, with an apple emerging, 8k." | "A panda taking a selfie." | "A walking figure made out of water." |
"An elephant wearing a birthday hat walking on the beach." | "Flag of the US on top of a tall white mountain." | "Robot emerging from a large column of billowing black smoke, high quality." | "Teddy bears holding hands, walking down rainy 5th ave." |
MagicVideo-V2 | SVD-XT | Pika 1.0 | ||
Gen-2 |
"Traveler walking alone in the misty forest at sunset." |
"LEGO, standing Darth Vader super mario." |
"1910s sitcom of everyday life and routines in society." |
"Ironman flying over a burning city, very detailed surroundings, cities are blazing, shiny iron man suit, realistic, 4k ultra high defi." |
"Muppet walking down the street in a red shirt, cinematic, 8k." |
"A little boy is riding a bike on a park path, the wheels crunching on the gravel." |
"In the swamp, a crocodile stealthily surfaces, revealing only its eyes and the tip of its nose as it moves forward." |
"A fox dressed in suit dancing in park." |
"A fat rabbit wearing a purple robe walking through a fantasy landscape." |
"A panda standing on a surfboard, in the ocean in sunset, 4k, high resolution." |
"Burning chicken running around on fire." |
"Flying through an intense battle between pirate ships in a stormy ocean." |
The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field. In this work, we introduce MagicVideo-V2 that integrates the text-to-image model, video motion generator, reference image embedding module and frame interpolation module into an end-to-end video generation pipeline. Benefiting from these architecture designs, MagicVideo-V2 can generate an aesthetically pleasing, high-resolution video with remarkable fidelity and smoothness. It demonstrates superior performance over leading Text-to-Video systems such as Runway, Pika 1.0, Morph, Moon Valley and Stable Video Diffusion model via user evaluation at large scale.