Mochi (Genmo AI)

Mochi is an AI-powered video generator from Genmo. It turns text prompts into videos aiming for smooth motion and strong control over characters and settings.

Overview

It’s an AI tool that creates videos based on text input giving users control over movement expressions and environment.

It exists both as a free tool - if you run it locally and as a freemium platform with limited use per day (Genmo) Free Plan means 2 fast generations per day 30 videos per month watermark included.

ComfyUI wrapper nodes for Mochi video generator.

Tags

Freemium Apache License 2.0 Web-based #Video & Animation

  • Text-2-Video

Educators and Trainers Creative Professionals Content Creators Media and Film Makers Marketing and Branding Specialists Developers and Tech Creators Nonprofit and Advocacy Creators Small Business Owners Entertainment and Performance Artists Professional Content Creators

Plan Name Tier Type Cost per Month
Free free 0.00
* 2 fast generations per day 30 videos per month watermark included. Up to 30 videos monthly.
Lite lowest 10.00
* More creativity with the Lite Plan—no watermarks, commercial rights, and 4x more videos. Up to 8 fast generations per day. Up to 80 videos monthly
Standard top 30.00
* Up to 32 fast generations per day. Up to 180 videos monthly. Watermark-free creations with Stealth Mode for privacy and highest priority generations.

Where multiple modes are available, the calculations are done for the most advanced (and costly) ones.

Pricing can change, make sure to check relevant links for any updates to the subscription plans.

Community Reactions to Mochi 1 on a 3060 12GB

Users are excited about running Mochi 1 on consumer GPUs like the RTX 3060 12GB especially since it previously needed high-end hardware. The workflow tweaks shared by the OP (@Jonseed) made it possible to generate 61 frames (2.5 seconds) in about 17 minutes plus 1 minute for VAE decoding.

  • VRAM Matters. The 3060’s 12GB VRAM makes it possible to run Mochi locally though lower VRAM cards struggle.
  • Speed vs Quality. Generating 61 frames took about 17 minutes on a 3060 but faster GPUs like a 4090 still take 40 minutes in some setups Meanwhile a hypothetical RTX 5090 could do it in 3 minutes.
  • Optimizations Help. Users found that using Kijai’s Mochi VAE Decoder and adjusting tiling settings improved performance reducing memory errors.
  • Excitement for the Future. Many are amazed at how quickly hardware requirements dropped from needing 4 H100 GPUs to running on a mid-range consumer card in just weeks.

Source: Reddit

Mochi 1 Tutorial with SwarmUI

A tutorial on using Mochi 1 with SwarmUI has people talking especially since it’s tested on an RTX 3060 12GB and works well The video consists of 64 AI-generated clips each 5 seconds at 24 FPS with an open-access tutorial available.

  • Performance Insights. The RTX 3060 12GB handles 49-frame clips in about 5 minutes but a 4090 can do the same in under 2 minutes
  • Cloud vs Local. While the tutorial claims it’s tested on a 3060 many clips were actually generated in the cloud which some found a bit misleading
  • Excitement for Accessibility. Compared to online video generators running Mochi locally is much faster and more customizable making it an exciting option for AI video enthusiasts

Source: Reddit

Prompt: A sausage dog wearing stylish wind goggles drives a gleaming chrome motorcycle, its long ears flapping wildly in the breeze, and its mouth open in an excited, playful expression. The dog looks thrilled as it grips the handlebars tightly. In the front basket of the motorcycle, a ginger-and-white cat sits energetically, its fur tousled by the wind, with wide, excited eyes and an open-mouthed expression of joy. The background features a vast countryside road stretching into the distance, lined with golden fields and distant mountains under warm golden sunlight. The entire scene exudes quirky, dynamic energy with a fun and cinematic vibe.

Generated on February 12, 2025:

Mochi 1 model through Genmo AI playground

Prompt: A highly intense and cinematic scene of a blonde female soldier in camouflage bandana with piercing eyes in the foreground, partially submerged in muddy water, aiming a modern assault rifle directly at the viewer and firing. Her face and head are soaked, with mud and grime on her shoulders emphasizing her rugged, battle-worn appearance. Behind her, a group of heavily armed soldiers, all in tactical gear and helmets, are advancing through the water, partially blurred to suggest depth. The background features a dense, misty jungle with faint outlines of trees, adding a humid, gritty atmosphere. The lighting is diffused, with overcast conditions casting a moody, natural glow on the scene. The color palette is dominated by cold earthy tones such as greens, browns, and greys, enhanced by the reflective surfaces of water and wet gear. The overall mood is tense and cinematic, capturing a sense of danger and urgency.

Generated on February 13, 2025:

Cinematic style test with Mochi 1 model through Genmo AI playground

Prompt: Raw footage of a 22-year-old influencer screaming from excitement while flying with a parachute in the sky to impress his followers. Loose closeup on his screaming face, fisheye lens, crisp and candid raw footage

Generated on February 13, 2025:

Realistic selfie footage test with Mochi 1 model through Genmo AI playground

Prompt: The scene begins with a close-up of striking red high heels, sharp and polished, walking away on a fractured asphalt road. The camera remains low to the ground, fully focused on the legs as they walk with deliberate confidence. The camera steadily tracks the legs from behind, capturing their motion as they stride through a desolate, post-apocalyptic street. [...]

Generated on February 18, 2025:

Mochi 1, generated in Genmo AI playground. "walking away" part got ignored.

Prompt: Close-up shot of a woman’s tear-filled eyes as she pleads during a heated argument with her partner, seen from his back. The camera slowly zooms in on the tears streaking her flushed cheeks, the soft glow of kitchen lights barely illuminating the scene behind her.

Generated on February 18, 2025:

Mochi 1 emotional coherence prompt test

Rating:

This page was last updated on February 18, 2025 at 2:36 PM