Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions tutorials/flux/flux-2-dev.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Dev"

import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'

<iframe width="560" height="315" src="https://www.youtube.com/embed/TzTS74Ii23A?si=f2NFmhNbU2VI3PwX" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

## About FLUX.2

[FLUX.2](https://bfl.ai/blog/flux-2) is a next-generation image model from [Black Forest Labs](https://blackforestlabs.ai/), delivering up to 4MP photorealistic output with far better lighting, skin, fabric, and hand detail. It adds reliable multi-reference consistency (up to 10 images), improved editing precision, better visual understanding, and professional-class text rendering.
Expand Down
2 changes: 2 additions & 0 deletions tutorials/flux/flux-2-klein.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Klein"

import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'

<iframe width="560" height="315" src="https://www.youtube.com/embed/Y9foxm9OYEU?si=FeueXTTBoIkydjk7" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

## About FLUX.2 [klein]

FLUX.2 [Klein] is the fastest model in the Flux family, unifying text-to-image and image editing in one compact architecture. It’s designed for interactive workflows, immediate previews, and latency-critical applications, with distilled variants delivering end-to-end inference around one second while keeping strong quality for single- and multi-reference editing.
Expand Down
2 changes: 2 additions & 0 deletions tutorials/image/z-image/z-image-turbo.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ sidebarTitle: "Z-Image"

import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'

<iframe width="560" height="315" src="https://www.youtube.com/embed/HvaEsnyOfSw?si=bsOQzcL0vsPcaE83" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

**Z-Image (造相)** is a powerful and highly efficient image generation model with **6B** parameters, developed by Alibaba's Tongyi Lab. It uses a **Scalable Single-Stream DiT** (S3-DiT) architecture where text, visual semantic tokens, and image VAE tokens are concatenated at the sequence level to serve as a unified input stream, maximizing parameter efficiency.

**Model Variants**:
Expand Down
2 changes: 2 additions & 0 deletions tutorials/video/ltx/ltx-2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,8 @@ description: "A DiT-based audio-video foundation model for synchronized video an

import UpdateReminder from "/snippets/tutorials/update-reminder.mdx";

<iframe width="560" height="315" src="https://www.youtube.com/embed/7uaU4Rm7fEo?si=tune56PDf9QfD-JY" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

[LTX-2](https://huggingface.co/Lightricks/LTX-2) is a 19B parameter DiT-based audio-video foundation model by Lightricks. It generates synchronized video and audio in a single pass, creating cohesive experiences where motion, dialogue, background noise, and music are produced together.

<UpdateReminder/>
Expand Down
2 changes: 2 additions & 0 deletions zh-CN/tutorials/flux/flux-2-dev.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Dev"

import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'

<iframe width="560" height="315" src="https://www.youtube.com/embed/TzTS74Ii23A?si=f2NFmhNbU2VI3PwX" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

## 关于 FLUX.2

[FLUX.2](https://bfl.ai/blog/flux-2) 是 [Black Forest Labs](https://blackforestlabs.ai/) 推出的下一代图像模型,可生成高达 4MP 的照片级真实输出,在光照、皮肤、织物和手部细节方面有显著提升。它支持可靠的多参考一致性(最多 10 张图像)、改进的编辑精度、更好的视觉理解能力以及专业级的文字渲染。
Expand Down
2 changes: 2 additions & 0 deletions zh-CN/tutorials/flux/flux-2-klein.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ sidebarTitle: "Flux.2 Klein"

import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'

<iframe width="560" height="315" src="https://www.youtube.com/embed/Y9foxm9OYEU?si=FeueXTTBoIkydjk7" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

## 关于 FLUX.2 [Klein]

FLUX.2 [Klein] 是 Flux 系列中目前(2026年1月15日)最快的模型,将文生图与图像编辑统一在紧凑架构中。它面向交互式工作流、即时预览与低延迟场景;蒸馏版本可在约 1 秒内完成端到端推理,并在单图与多图参考编辑中保持出色画质。
Expand Down
2 changes: 2 additions & 0 deletions zh-CN/tutorials/image/z-image/z-image-turbo.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ sidebarTitle: "Z-Image"

import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'

<iframe width="560" height="315" src="https://www.youtube.com/embed/HvaEsnyOfSw?si=bsOQzcL0vsPcaE83" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

**Z-Image(造相)** 是阿里巴巴通义实验室开发的一个强大且高效的图像生成模型,拥有 **6B** 参数。它采用 **可扩展单流 DiT**(S3-DiT)架构,将文本、视觉语义 token 和图像 VAE token 在序列级别进行拼接,作为统一的输入流,最大化参数效率。

**模型变体**:
Expand Down
2 changes: 2 additions & 0 deletions zh-CN/tutorials/video/ltx/ltx-2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,8 @@ description: "基于 DiT 的音视频基础模型,支持同步生成视频和

import UpdateReminder from "/snippets/zh/tutorials/update-reminder.mdx";

<iframe width="560" height="315" src="https://www.youtube.com/embed/7uaU4Rm7fEo?si=tune56PDf9QfD-JY" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>

[LTX-2](https://huggingface.co/Lightricks/LTX-2) 是 Lightricks 推出的 190 亿参数 DiT 音视频基础模型。它可以在单次生成中同步产出视频和音频,将动作、对话、背景音效和音乐融为一体。

<UpdateReminder/>
Expand Down