Seedance

A model that supports multi-shot video generation from both text and image. It achieves breakthroughs in semantic understanding and prompt following, and can create 1080p videos with smooth motion, rich details, and cinematic aesthetics.

Visit Website

Introduction

Seedance is ByteDance’s first‑generation video foundation model designed for both text‑to‑video (T2V) and image‑to‑video (I2V) generation. The 1.0 release emphasizes smooth, stable motion and precise prompt following, producing cinematic 1080p clips and natively supporting multi‑shot storytelling with consistent subjects, style, and atmosphere.

Key capabilities

Native multi‑shot narrative generation with strong subject/style consistency.
High spatiotemporal fluidity and structural stability for complex actions and multi‑agent scenes.
Dual‑modality prompting (text and image) and accurate interpretation of diverse styles—from photorealism to illustration.
A unified architecture and post‑training stack (fine‑grained SFT, video‑specific RLHF) with optimizations for fast inference.

The official technical report for Seedance 1.0 was posted in June 2025, detailing the data curation, architecture, training, and acceleration techniques behind the model. Seedance is accessible through ByteDance’s Seed portal with options to try the model and obtain API access, supporting creative workflows from storyboards and ads to multi‑shot narrative shorts.

Back

Information

Websiteseed.bytedance.com
AuthorsByteDance Seed
Published date2025/06/10

More Items

Deep-Live-Cam

2023

hacksider, s0md3v

Deep-Live-Cam is an open-source real-time face-swap / deepfake tool that can replace faces in live webcam streams or videos using only a single source image. Key features include one-click live deepfakes, mouth-mask to retain original mouth motion, multi-face mapping, and pre-built binaries for Windows and Apple Silicon. It supports multiple execution providers (CUDA, CoreML, DirectML, OpenVINO) and includes built-in content checks and ethical guidance.

ai-video video ai-tools github AIGC+2

LTX-Video

2024

Lightricks

LTX-Video is an open-source, DiT-based real-time video generation project from Lightricks. It supports synchronized audio+video generation, image-to-video, video extension, multi-keyframe control, and offers multiple model sizes (2B, 13B), distilled/quantized variants, ComfyUI and Diffusers integrations, online demos, and tooling for training and fine-tuning.

ai-video foundation-model ai-train ai-inference

ComfyUI-LTXVideo

2024

Lightricks

ComfyUI-LTXVideo is a collection of custom nodes and workflows for ComfyUI that extend support for the LTX-2 video generation model. It makes LTX-2 features accessible inside ComfyUI, provides example pipelines (text-to-video, image-to-video, video-to-video), and includes instructions for required model files, upscalers, and low-VRAM usage.

video ai-video github ai-workflow ai-tools