Wan (Open-source)

Wan by Alibaba’s Wan team is an open-source AI suite for generating videos from text and images. It handles motion physics, text rendering, and more—leading the VBench benchmark. Free to use under Apache 2.0.

Visit This Site

Overview

Wan is an open-source AI video tool built by Alibaba’s Wan team. It transforms text and images into high-quality videos while handling motion dynamics, physics, and text rendering in both Chinese and English.

The best part is that it’s free AND supports Consumer-grade GPUs: The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with almost all consumer-grade GPUs. It can generate a 5-second 480P video on an RTX 4090 in about 4 minutes. Mind you, the size of the model is gigantic, dozens of gigabytes, so must have a lot of disk space available.

Wan is fully open-source under the Apache 2.0 license. No hidden fees, no subscriptions—just grab the code and start creating.

You can start using Wan right away. Check out their GitHub for installation and usage instructions. Pre-trained models are also available on Hugging Face and ModelScope for easy integration.

There's GGUF versions for Comfy UI. Wan2.2 is now supported in ComfyUI from Day 0!

Features include cinematic control, complex motion handling & precise semantics—all with MoE architecture.
Available in FP16 & FP8, with 5B models running on 8GB VRAM via auto-offloading.

Links

Video Effects, Templates, Transformations

Educators and Trainers Creative Professionals Content Creators Media and Film Makers Marketing and Branding Specialists Developers and Tech Creators Nonprofit and Advocacy Creators Small Business Owners Entertainment and Performance Artists Professional Content Creators

This tool offers the following AI models:

This tool is free to use when installed locally and is offered under Apache License 2.0.

People are testing the WAN 2.1 I2V model using compressed GGUF files in ComfyUI, mainly on RTX 3060 cards. Here's what they're seeing:

Performance on RTX 3060

416x416, 25 steps → About 9 minutes for 2 seconds of video
512x512, 25 steps → Around 13.5 minutes for 2 seconds

Key Resources & Process

GGUF compressed models are up on Hugging Face
Basic setup guide available [here]
More details on the ComfyUI example page [here]

Hardware Tips

Hardware used: 12GB VRAM, 48GB RAM (extra RAM helps a lot)
Some users get by with 16-32GB RAM

Choosing Compression Levels

Q4_0 was used, but higher levels (bigger files) give better quality

How It Compares

More stable than SkyReels. Less "melting" effect than some other tools.

[ Reddit ]

Prompt:

Animate this pencil drawing of a cute surreal magical adorable scene, let penguin walk turning around itself while chasing butterflies. Make sure all the butterflies are animated.

Compare Tools

Generated on July 31, 2025:

The penguin is a bit hyperactive and butterflies sleepy.

Wan2.2 TI2V-5B

Prompt:

The sky is alive with energy as two giant flying turtles soar majestically above a blurred landscape of mountains and rivers, carrying serene monks in meditation atop their shells. The turtles' eyes suddenly blaze with an intense blue glow, crackling bolts of electricity erupting into the air, illuminating the turbulent sky with flashes of supernatural power. As the lightning flickers, an intense wind bursts forth, thrashing the monks' flowing robes, creating a dynamic and chaotic atmosphere. The turtles split apart in a dramatic arc, veering left and right as they abandon the scene, leaving the monks momentarily isolated in the vast expanse. The focus shifts smoothly to the empty valley below, where mist thrums with energy, swirling dynamically

Compare Tools

Generated on May 23, 2025:

Wan 2.1 Plus has done a great job

Prompt:

Double dolly shot: A fierce and determined woman stands in the center of a dusty Western street, her expression focused and intense as she points a shotgun towards an unseen foe. The camera glides backward, while the town around her drifts ominously, buildings and debris swirling in the distance. She remains unnaturally steady, her hair swirling in a slight breeze, amplifying the tension of the moment. This visual dissonance captures her psychological struggle, symbolizing her inner strength. The harsh sunlight casts long shadows, enhancing the gritty texture of her rugged attire and the arid landscape.

Compare Tools

Generated on May 22, 2025:

Wan 2.1 14b quantized image to video 480p Q4_K_S gguf. Online version censors this image and won't generate anything wit it. No double dolly visible here anyway.

Prompt:

Surreal scene of a delicately ornate porcelain mug with an embossed gold floral pattern along the rim and upper edges, featuring a highly realistic detailed nature scene wrapping around the central surface of the mug. In the middle of the design, a flamingo stands gracefully in a small reflective water pond, surrounded by lush green reeds, blooming wildflowers, and delicate pink and white blossoms. The painted lake water ripples and overflows, spilling realistically from the lower edge of the cup onto the table. The pool of water on the table grows larger and larger, soon surrounding the cup. Several yellow butterflies flutter around the flamingo—some in mid-air, some perched on flowers. The background within the design has a soft pastel sky with a dreamy, painterly gradient from light blue to warm beige. The mug is placed on a neutral grey surface, and a few tiny flower petals are scattered around its base, reinforcing the ethereal and serene atmosphere. The handle is elegantly curved and highlighted with subtle golden detailing.

Compare Tools

Generated on May 19, 2025:

Wanx 2.1 Plus, generated on their online platform, with sound added there too

Prompt:

A natural beauty blonde woman with a confident and assertive pose is in the center looking at the viewer with fierce focused annoyed gaze, wearing a slightly dirty Western-inspired outfit, including a Western-style hat, a dress with a vest-like undergarment, and a feminine yet strong aesthetic. She is pointing a large 8-Gauge Shotgun (aka 8-bore) straight in front of herself, at the viewer, with the massive barrel dominating the foreground and appearing exaggerated in size due to dramatic perspective, creating an intense visual focus. Her skin is slightly dirty. The style is a realistic photograph with a focus on fashion and a cinematic quality, a still from a Western-themed film. The background consists of a series of buildings, a hint of a mountain range, and a rustic atmosphere.

Compare Tools

Generated on May 17, 2025:

Image output — Generated by Wan2.1 Plus. This is best of 4 outputs. Good effort, but barrel is obviously not the best. Cinematic vibe - great.

Prompt:

An elderly lady is driving her electric wheelchair forward at 140 km/h on a winding mountain road. Due to the high speed, her scarf quickly flaps in front of the camera lens, which is 10 meters behind the wheelchair and moves with the vehicle. The camera is shot from behind; the elements at the side of the road are blurred due to the high speed. The lady is driving very fast and overtaking other cars. This realistic shot was taken with 35mm film.

Compare Tools

Generated on May 17, 2025:

Wan2.1 Plus text to video produced a really nice result. Sound effect also by Wan.

Prompt:

A man lunges forward, throwing a lightning-fast jab at a goat in a red hoodie. The goat ducks effortlessly, its horns grazing the air, then counters with a powerful left hook that sends sweat flying. Their gloves collide with a thunderous impact as the camera whips around them in a tight 360-degree spin. Neon lights flicker overhead while dust and spit explode into slow motion. Every strike lands like a cinematic thunderclap.

Compare Tools

Generated on May 3, 2025:

Wanx 2.1 through Pollo.ai

Prompt:

A sleek humanoid robot performs a mesmerizing dance in a stark, minimalistic futuristic room. The seamless white walls glow softly with pulsing lines of light, creating a sharp contrast against the robot’s dark figure. The camera starts with a static wide shot, gradually moving into a slow orbit around the robot, capturing its fluid, precise movements. The room’s dynamic lighting synchronizes with the rhythm of the dance, casting glowing patterns and subtle shadows on the floor and walls. The overall scene combines elegance, sophistication, and a high-tech sci-fi aesthetic.

Compare Tools

Generated on February 28, 2025:

Image to Video done in Comfy UI's Wan2.1-I2V-14B-480P-gguf version Q4

Prompt:

The scene begins with a close-up of striking red high heels, sharp and polished, walking away on a fractured asphalt road. The camera remains low to the ground, fully focused on the legs as they walk with deliberate confidence. The camera steadily tracks the legs from behind, capturing their motion as they stride through a desolate, post-apocalyptic street.

Compare Tools

Generated on February 28, 2025:

1.3 billion Wan2.1

Prompt:

Close-up shot of a woman’s tear-filled eyes as she pleads during a heated argument with her partner, seen from his back. The camera slowly zooms in on the tears streaking her flushed cheeks, the soft glow of kitchen lights barely illuminating the scene behind her.

Compare Tools

Generated on February 28, 2025:

Test in local Comfy UI install of 1.3 billion Wan t2v model

Prompt:

A sausage dog wearing stylish wind goggles drives a gleaming chrome motorcycle, its long ears flapping wildly in the breeze, and its mouth open in…

to see full prompt.

Compare Tools

Generated on February 28, 2025:

Test in local Comfy UI install of 1.3 billion Wan model

Prompt:

A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage.

to see full prompt.

Generated on February 26, 2025:

Fal.ai published video example for WAN image-2-video

Prompt:

A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red…

to see full prompt.

Generated on February 26, 2025:

Fal.ai published video example for WAN t2v

Rating:

Favorite

Latest Wan (Open-source) News

August 26, 2025

WAN 2.2-S2V, an open-source cinematic speech-to-video model, is set to launch soon. The model generates video content based on audio input and is designed with a focus on visual fidelity.

model

July 31, 2025

WanGP 7.5 now works with Wan 2.2. Most things from Wan 2.1 still work fine like low VRAM use, extra long videos, image making, and MMaudio. It needs more RAM now since the model’s twice as big but still uses the same VRAM.

model

July 30, 2025

ComfyUI dropped a new patch for Wan 2.2. It cuts VAE decoding memory use by about 10%. The 5B I2V model got a big boost too. They also added new setup templates for the 14B models. You can grab the latest version on Git, Portable, or Desktop.

feature

July 29, 2025

ComfyUI adds native Wan2.2 support.
Available in FP16 & FP8, with 5B models running on 8GB VRAM via auto-offloading.

feature

July 28, 2025

Wan 2.2 is out now! Featuring a faster model, open-source access, and upgraded tools from Tongyi Lab at Alibaba Group. Improved prompt following, motion control, and visual detailing make it ideal for cinematic AI creation.

model

Useful Links

Wan 2.2 First/Last Frame Video Fast

Other

Hugging Face spaces for Wan 2.2 Fast with start and end frames capability.

Added on: September 1, 2025

Instareal WAN 2.2

Version

From the creators of Instagirl, this is the next step in their pursuit of perfect photorealism. Instareal is a specialized foundational LoRA for Wan 2.2

Added on: August 18, 2025

Daxamur's WAN 2.2 Workflows

Workflow

Workflow pre configured for the best balance for speed / quality

Added on: August 16, 2025

ComfyUI Wan2.2 Fun Inp Start-End Frame Video Generation Example

Workflow

How to use ComfyUI to complete the Wan2.2 Fun Inp start-end frame video generation example

Added on: August 14, 2025

Wan 2.2 Image ComfyUI workflow

Workflow

Quantized versions for low VRAM using GGUF models

Added on: August 6, 2025

QuantStack's Quantized Wan 2.2

Version

Compressed GGUF models collection for Wan 2.2

Added on: August 3, 2025

Wan2.2 ComfyUI Official Native Workflow Example

Tutorial

Official usage guide for Alibaba Cloud Tongyi Wanxiang 2.2 video generation model in ComfyUI

Added on: July 30, 2025

Wan 2.2 with 8GB VRAM

Workflow

The OP used Wan 2.2 image-to-video GGUF model Q6 with image-to-video ligtx2v lora. They shared how they tweaked the ComfyUI default workflow.

Added on: July 30, 2025

Drone-style push‑in motion LoRA for Wan 2.1

LoRA

A new LoRA for Wan 2.1 brings realistic drone-style push‑in motion 🎥 Trained on 100 clips and refined through 40+ versions, it includes a ComfyUI workflow and is triggered by “Push‑in camera.” Perfect for adding cinematic movement to your videos.

Added on: July 17, 2025

Fast 4 steps Wan 2.1 I2V (14B) with CausVid LoRA

Version

CausVid is a distilled version of Wan 2.1 to run faster in just 4-8 steps, extracted as LoRA by Kijai and is compatible with ðŸ§¨ diffusers

Added on: May 23, 2025

WanGP by DeepBeepMeep

Version

Open and advanced large-scale video generative models for the GPU poor

Added on: May 11, 2025

ComfyUI Workflows + Guide for Wan2.1

Workflow

Using native ComfyUI nodes and using the kijai wan wrapper nodes allowing for more features.

Added on: March 19, 2025

Viral Effects LoRAs for Wan2.1 14B 480p I2V

LoRA

A wide range of open-sourced effects - such as Assassin, Jungle, Pirate Captain, Baby, Princess, Painting, Warrior, Samurai, Snow White, Bride, Mona Lisa, Zen, VIP, Puppy, Classy, and Disney Princess-all available on Hugging Face under the Apache 2.0 license trained on Wan2.1 14B I2V 480p model.

Added on: March 19, 2025

This page was last updated on August 26, 2025 at 6:13 AM

Wan (Open-source)

Overview

Tags

Links

What can it do?

Video Effects, Templates, Transformations

Who is it for?

AI models offered

How much does it cost?

Community feedback and reviews

Wan (Open-source) examples

Latest Wan (Open-source) News

Useful Links