This is an AI video generation comparison for
image-to-video
prompt:
High-quality, cinematic footage of a dialogue with humorous vibe. Two dogs: a Miniature Schnauzer on the left and a chocolate-brown Labrador on the right sit in a professional podcast studio. On the walls, there are framed pictures of various dogs with vane and noble expressions and a large certificate-award reading "Certified Good Boy*" in bold and below, smaller "*Self-Certified". The Miniature Schnauzer dog on left starts by saying enthusiastically while turning its head slightly towards...
Log in to see full prompt.
Tested: November 9, 2025
Lines matching the speakers. Slightly robotic but not bad. Lab doesn't glance even briefly at the schnauzer though.
Tested: November 9, 2025
Pretty good. Limes matching speakers, they're very naturally animated.
Tested: November 9, 2025
Lines matching the speakers and the dogs are very well animated, with clear, cheerful demeanor.
Tested: November 9, 2025
Lines matching the speakers and the dogs appear quite believable. There's a small mishap at the end when lab's open mouth has small deformities inside.
Tested: November 9, 2025
Wan only makes 5 or 10 seconds, so it had to squeeze in these lines into 5. I'll upload the 10s version once it's ready.
Tested: November 10, 2025
If you'd like ot get rid of captions, JSON format like this works well on Grok too. Use my custom gpt to convert your prompt for free (see link to the original)
Tested: November 12, 2025
This is a very good overall generation with animal lipsync.
Does the Schnauzer appear to speak first with enthusiastic expression and slight head turn toward the Labrador?
Does the Labrador respond next with an excited tone and a brief glance at the Schnauzer?
Is the lip-sync (or muzzle motion) aligned with each dog’s dialogue segment?
Do both dogs laugh or show clear laughter expressions at the end?
Is the humor tone conveyed naturally through timing, reaction, and delivery?
Is the camera steady, focused, and does it capture both characters evenly during their exchange?
Are there no unnatural distortions or animation errors (mouth warps, misaligned eyes, body clipping)?
Are the framed dog portraits and the “Certified Good Boy* / *Self-Certified” award clearly readable and humorously placed on the wall?
Check out the results from Pollo AI (Sora 2) vs FLOW (Veo 3 Fast) vs GROK (Grok Imagine v0.9) vs Fal AI (LTX-2) vs Wan (Online Platform) (Wan2.5 Preview) vs GROK (Grok Imagine v0.9) vs Pollo AI (Pollo 2.0) for similar or identical prompts side-by-side.
Earth zoom in or out effect with zebra