Skip to content

Conversation

@junqiangwu
Copy link

@junqiangwu junqiangwu commented Dec 11, 2025

What does this PR do?

This PR introduces the LongCat-Image Series into the diffusers library. The LongCat-Image Series comprises powerful and highly efficient 6B-parameter models, featuring both a state-of-the-art image generation model and a versatile image editing model. Designed to be friendly for consumer-grade hardware, this series demonstrates strong capabilities in high-fidelity image synthesis and precise, controllable image editing. The technical report and checkpoints for both models have been released.

Thanks for the support of @yiyixuxu @apolinario.

Copy link
Collaborator

@yiyixuxu yiyixuxu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks a lot for the PR!
looks really good to me, I left some very small feedbacks, I think we can merge this soon:)

we also need doc & tests but can work on that in a follow-up PR if you prefer. Let us know

Copy link
Collaborator

@yiyixuxu yiyixuxu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks! I left a few more comments


def _encode_prompt( self, prompt ):
all_tokens = []
for clean_prompt_sub, matched in split_quotation(prompt[0]):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are we only support single prompt? if so maybe can add a check in check_inputs

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We’ve updated the code so that the text-to-image pipeline now accepts multiple prompts, while the edit pipeline still takes a single prompt. Both behaviors are enforced via checks in check_inputs.

@yiyixuxu
Copy link
Collaborator

I think you need to add the doc page into the index here https://github.com/huggingface/diffusers/blob/main/docs/source/en/_toctree.yml

also can you run make style and make fix-copies?

@junqiangwu
Copy link
Author

Thanks for the feedback — we’ve updated the code accordingly and run make style and make fix-copies in this commit. Please let us know if you spot anything else that should be addressed. @yiyixuxu

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants