Built by Shengshu Technology with Tsinghua University Vidu is made to speed up video creation for all sorts of uses - film animation ads you name it.
Vidu AI works in three ways:
Text to Video. Type in a description and get a matching video.
Image to Video. Feed it a picture and watch it move.
Reference to Video. Keeps characters objects and backgrounds consistent using a reference.
One of its big claims? Speed. Vidu AI says it can spit out a video in lower resolution in just 10 seconds. And it's not just about being fast—it understands meaning quite so the results actually match what you asked for in your prompt. Plus it's designed to make motion look natural no stiff robotic movements.
The latest update from Vidu, bringing forth the multiple elements videos powered by Q1 model is on another level. The crispiness, the fidelity, the details preservation is amagzing and a big leap from company's previous quality.
Use Vidu invite code 1bc88288 for bonus credits + new user perks!
Educators and TrainersCreative ProfessionalsContent CreatorsMedia and Film MakersMarketing and Branding SpecialistsDevelopers and Tech CreatorsSmall Business OwnersEntertainment and Performance ArtistsProfessional Content Creators
* Unspecified amount of the bonus credits monthly renewal
Standard
lowest
10.00
800
4.00
Allowance per month: 3.33 minutes.
* High-resolution video generation. 8s video generation. 50 references/month. Fast channel generation. Credits Purchase. Commercial use allowed. Download without watermark. Generate 4 videos at once. Access to the Prompt Editing Pro Mode
Premium
medium
35.00
4000
4.00
Allowance per month: 16.67 minutes.
* High-resolution video generation. 8s video generation. 100 references/month. Fast channel generation. Credits Purchase. Commercial use allowed. Download without watermark. Generate 4 videos at once. Access to the Prompt Editing Pro Mode. Early feature access
Ultimate
top
95.00
8000
4.00
Allowance per month: 33.33 minutes.
* High-resolution video generation. 8s video generation. 100 references/month. Ultra-Fast channel generation. Credits Purchase. Commercial use allowed. Download without watermark. Generate 4 videos at once. Access to the Prompt Editing Pro Mode. Early feature access. Generate unlimited videos in off-peak mode
Where multiple modes are available, the calculations are done for the most advanced (and costly) ones.
Pricing can change, make sure to check relevant links for any updates to the subscription plans.
A cute, high-quality miniature figurine Christmas ornament inspired by the attached subject / pet reference image, hanging from a beautifully decorated Christmas tree.
The subject is clearly a small premium toy-like figurine, not a real animal — slightly stylized proportions (subtle chibi influence: gently rounded head, simplified paws, softened edges), while accurately preserving the subject’s unique facial features, markings, ear shape, and expression so the likeness remains instantly recognizable.
The figurine is made from clearly artificial materials — painted resin / polymer clay / molded plastic, with visible handcrafted texture, tiny brush strokes, soft seams, and a smooth satin finish.
Fur is sculpted, not real: simplified grooves and embossed shapes instead of individual hairs.
The ornament hangs by a small metallic hook attached to the figurine’s head but concealed with a smal red ribbon bow, making it unmistakably a decorative object.
Scale is obvious: the figurine is palm-sized, surrounded by oversized pine needles, fairy lights, and glass baubles to reinforce its miniature nature.
Cinematic macro product photography, shallow depth of field with warm festive bokeh.
Soft studio-style holiday lighting — warm key light, gentle fill, subtle rim light to outline the figurine’s silhouette.
Shot at ornament eye-level, 85mm macro lens look, ultra-clean focus on the figurine while the background tree softly blurs.
Explicit constraints:
– not a real animal
– not lifelike fur or skin
– no biological realism
– clearly a toy, figurine, or collectible ornament
Style: cute but premium, Pixar-adjacent holiday décor, collectible toy photography
Mood: cozy, magical, wholesome, festive
Detail level: high, but intentionally stylized
@image1 This subject @image2 presented as a premium collectible figurine on a designer’s desk. Scene: • Center: a realistic 1/6-scale figurine of the subject on a clear round stand, natural museum pose one foot slightly forward. Keep their current outfit, hair, and accessories exactly as in the source. • Right: an upright glossy retail box showing the same subject as box art matching outfit and look, with brand AI Creators Tools; clean typography and a small authenticity sticker. • Left: a widescreen monitor displaying the subject as a grayscale digital sculpt/turntable in a 3D app UI that clearly matches the figurine. • Desk props: keyboard, mouse pad, a couple of notes; tidy and minimal. Lighting & look: • Bright natural studio daylight from windows, soft shadows, subtle tabletop reflections. • Photoreal materials plastics, crisp print on the box, no duplicates or mismatches between figure, box, and monitor. Style tags: photoreal, product photography, studio lighting, sharp focus, clean composition
So yes, if you upload an additional clear headshot of your character it improves facial features clarity. Box size is now too small again (because model is huge)
@image1This subject presented as a premium collectible figurine on a designer’s desk. Scene: • Center: a realistic 1/6-scale figurine of the subject on a clear round stand, natural museum pose one foot slightly forward. Keep their current outfit, hair, and accessories exactly as in the source. • Right: an upright glossy retail box showing the same subject as box art matching outfit and look, with brand AI Creators Tools; clean typography and a small authenticity sticker. • Left: a widescreen monitor displaying the subject as a grayscale digital sculpt/turntable in a 3D app UI that clearly matches the figurine. • Desk props: keyboard, mouse pad, a couple of notes; tidy and minimal. Lighting & look: • Bright natural studio daylight from windows, soft shadows, subtle tabletop reflections. • Photoreal materials plastics, crisp print on the box, no duplicates or mismatches between figure, box, and monitor. Style tags: photoreal, product photography, studio lighting, sharp focus, clean composition
An improvement over Q1 that the box isn't much smaller than the figurine itself. Her facial features aren't too crisp, likely could be helped with an additional reference image with her closeup portrait.
@image1 A couple sits at a small white iron table outside café. They hold hands and look at each other. The shot stays steady with a light film-like grain. It starts focused on the couple, then shifts to the back. A man with a suitcase walks into view. His face shows shock. That changes the mood fast. The street has striped awnings, café chairs, and a busy but quiet flow of people. You hear street sounds, some footsteps, and soft clinks of dishes. No traffic noise or music. While all this happens, the woman says, “He’s in New York till Friday, darling.” The man says, “So I can have you all to myself.”
@image1 is walking forward on a street from@image2, then slightly leans forward to look straight into the camera, waves and says: "Hi, I'm just testing Vidu's new video model!"
Soundscape: Footsteps, sea waves.
Generated on October 22, 2025:
Test with 2 images and speech/sound in text prompt. Resemblance is there, text on t-shirt and fabric quality - all preserved. But refuses to speak the full sentence, this is the 2nd attempt and same result.
A muscular man stands confidently, arms crossed, wearing a bold purple sweatshirt emblazoned with "PUNCH ME". Text "WW.AICREATORS.TOOLS" on the wall behind him.
Suddenly, a fist enters frame and strikes his head near his cheek from the left.
The impact unfolds, a shockwave running through his body.
The camera captures his expression shifts—surprise shifting into resilience, his eyes blazing with intensity.
Ambient light catches the sweat glistening on his skin, crafting a gritty, dynamic atmosphere. Kinetic, dynamic, cinematic.
Done many variations of this prompt but this is the closest I could get to a realistic punch. Mostly his head doesn't move at all on impact and the fist barely touches him.
A smoky backroom where four capybaras dressed in 1940s gangster attire sit around a poker table under a brass hanging lamp. Cigars smolder in the mouths of the two of them: capybara on the right and capybara with slicked fur in the back, tucked snugly beside their large front incisors, poker chips clatter, and a portrait of a glamorous capybara in a silky dress hangs slightly askew on the dark wooden wall. The camera cuts to a close-up of one capybara with slicked fur and a thick cigar - his eyes narrowing with suspicion as he studies the cards and his opponents through the haze. The camera lingers on the glowing cigar tip, then cuts back to a wide shot of the table as the capybaras exchange cards, chips sliding across the felt under the golden light, the tension thick and cinematic.
This subject presented as a premium collectible figurine on a designer’s desk. Scene:
• Center: a realistic 1/6-scale figurine of the subject on a clear round stand, natural museum pose one foot slightly forward. Keep their current outfit, hair, and accessories exactly as in the source.
• Right: an upright glossy retail box showing the same subject as box art matching outfit and look, with brand AI Creators Tools; clean typography and a small authenticity sticker.
• Left: a widescreen monitor displaying the subject as a grayscale digital sculpt/turntable in a 3D app UI that clearly matches the figurine.
• Desk props: keyboard, mouse pad, a couple of notes; tidy and minimal. Lighting & look:
• Bright natural studio daylight from windows, soft shadows, subtle tabletop reflections.
• Photoreal materials plastics, crisp print on the box, no duplicates or mismatches between figure, box, and monitor. Style tags: photoreal, product photography, studio lighting, sharp focus, clean composition@image1
Realistic, preserves likeness. Main issue is text. Note that only newly introduced text is a problem, model correctly copies the text present on reference image.
A close-up medium shot portrait against a bright blue sky with white puffy clouds, stylized with a modern, fashion-influenced aesthetic. The subject, a woman, wears a light-grey t-shirt with print that reads "AICREATORS.TOOLS" in purple and lime colors with rainbow paint splashes around. On her head she wears a dark-grey baseball cap with silver details. Her long blonde hair falls freely from underneath the cap. Her expression is cheeky and flirtatious, gazing directly at the camera with a light smirk and a playful wink. She holds a little dog in her hands. The lighting is natural and bright, creating a high-contrast image against the vibrant sky, rendered with sharp focus and natural color tones.
Vidu AI's Q2 image model is now live. It supports text to image, reference to image and basic image edits.
It generates fast, around 5 seconds, with 4K quality and strong consistency.
Members can make as many images as they want until Dec 31. New users can enter code VIDUQ2RTI to get extra credits.
model
November 29, 2025
Up to 40% OFF yearly plans on Vidu right now starting $4.8 per month billed annually.
promo
November 19, 2025
10-Image video creation with seamless creative transitions. Make videos using up to 10 keyframes.
feature
October 27, 2025
Vidu AI’s Halloween templates are free till Nov 1. Anyone can use them to turn photos into spooky, funny or cute Halloween images or videos.
promo
October 22, 2025
Vidu Q2 Reference-to-Video model is now live.
Better consistency, faster generation, more affordable pricing. It supports adding sound effects, lip-synced voices and timbres matching with references.
model
Useful Links
No additional links available for this tool.
This page was last updated on December 1, 2025 at 11:08 PM