Comfy-Org · comfyui-wiki · Jan 25, 2026 · Jan 25, 2026 · Jan 25, 2026 · Jan 25, 2026
diff --git a/tutorials/flux/flux-2-dev.mdx b/tutorials/flux/flux-2-dev.mdx
@@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Dev"
 
 import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/TzTS74Ii23A?si=f2NFmhNbU2VI3PwX" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 ## About FLUX.2
 
 [FLUX.2](https://bfl.ai/blog/flux-2) is a next-generation image model from [Black Forest Labs](https://blackforestlabs.ai/), delivering up to 4MP photorealistic output with far better lighting, skin, fabric, and hand detail. It adds reliable multi-reference consistency (up to 10 images), improved editing precision, better visual understanding, and professional-class text rendering.

diff --git a/tutorials/flux/flux-2-klein.mdx b/tutorials/flux/flux-2-klein.mdx
@@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Klein"
 
 import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/Y9foxm9OYEU?si=FeueXTTBoIkydjk7" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 ## About FLUX.2 [klein]
 
 FLUX.2 [Klein] is the fastest model in the Flux family, unifying text-to-image and image editing in one compact architecture. It’s designed for interactive workflows, immediate previews, and latency-critical applications, with distilled variants delivering end-to-end inference around one second while keeping strong quality for single- and multi-reference editing.

diff --git a/tutorials/image/z-image/z-image-turbo.mdx b/tutorials/image/z-image/z-image-turbo.mdx
@@ -6,6 +6,8 @@ sidebarTitle: "Z-Image"
 
 import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/HvaEsnyOfSw?si=bsOQzcL0vsPcaE83" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 **Z-Image (造相)** is a powerful and highly efficient image generation model with **6B** parameters, developed by Alibaba's Tongyi Lab. It uses a **Scalable Single-Stream DiT** (S3-DiT) architecture where text, visual semantic tokens, and image VAE tokens are concatenated at the sequence level to serve as a unified input stream, maximizing parameter efficiency.
 
 **Model Variants**:

diff --git a/tutorials/video/ltx/ltx-2.mdx b/tutorials/video/ltx/ltx-2.mdx
@@ -5,6 +5,8 @@ description: "A DiT-based audio-video foundation model for synchronized video an
 
 import UpdateReminder from "/snippets/tutorials/update-reminder.mdx";
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/7uaU4Rm7fEo?si=tune56PDf9QfD-JY" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 [LTX-2](https://huggingface.co/Lightricks/LTX-2) is a 19B parameter DiT-based audio-video foundation model by Lightricks. It generates synchronized video and audio in a single pass, creating cohesive experiences where motion, dialogue, background noise, and music are produced together.
 
 <UpdateReminder/>

diff --git a/zh-CN/tutorials/flux/flux-2-dev.mdx b/zh-CN/tutorials/flux/flux-2-dev.mdx
@@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Dev"
 
 import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/TzTS74Ii23A?si=f2NFmhNbU2VI3PwX" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 ## 关于 FLUX.2
 
 [FLUX.2](https://bfl.ai/blog/flux-2) 是 [Black Forest Labs](https://blackforestlabs.ai/) 推出的下一代图像模型，可生成高达 4MP 的照片级真实输出，在光照、皮肤、织物和手部细节方面有显著提升。它支持可靠的多参考一致性（最多 10 张图像）、改进的编辑精度、更好的视觉理解能力以及专业级的文字渲染。

diff --git a/zh-CN/tutorials/flux/flux-2-klein.mdx b/zh-CN/tutorials/flux/flux-2-klein.mdx
@@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Klein"
 
 import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/Y9foxm9OYEU?si=FeueXTTBoIkydjk7" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 ## 关于 FLUX.2 [Klein]
 
 FLUX.2 [Klein] 是 Flux 系列中目前(2026年1月15日)最快的模型，将文生图与图像编辑统一在紧凑架构中。它面向交互式工作流、即时预览与低延迟场景；蒸馏版本可在约 1 秒内完成端到端推理，并在单图与多图参考编辑中保持出色画质。

diff --git a/zh-CN/tutorials/image/z-image/z-image-turbo.mdx b/zh-CN/tutorials/image/z-image/z-image-turbo.mdx
@@ -6,6 +6,8 @@ sidebarTitle: "Z-Image"
 
 import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/HvaEsnyOfSw?si=bsOQzcL0vsPcaE83" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 **Z-Image（造相）** 是阿里巴巴通义实验室开发的一个强大且高效的图像生成模型，拥有 **6B** 参数。它采用 **可扩展单流 DiT**（S3-DiT）架构，将文本、视觉语义 token 和图像 VAE token 在序列级别进行拼接，作为统一的输入流，最大化参数效率。
 
 **模型变体**：

diff --git a/zh-CN/tutorials/video/ltx/ltx-2.mdx b/zh-CN/tutorials/video/ltx/ltx-2.mdx
@@ -5,6 +5,8 @@ description: "基于 DiT 的音视频基础模型，支持同步生成视频和
 
 import UpdateReminder from "/snippets/zh/tutorials/update-reminder.mdx";
 
+<iframe width="560" height="315" src="https://www.youtube.com/embed/7uaU4Rm7fEo?si=tune56PDf9QfD-JY" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
+
 [LTX-2](https://huggingface.co/Lightricks/LTX-2) 是 Lightricks 推出的 190 亿参数 DiT 音视频基础模型。它可以在单次生成中同步产出视频和音频，将动作、对话、背景音效和音乐融为一体。
 
 <UpdateReminder/>