[Model] Add Qwen-Image-Edit #196

SamitHuang · 2025-12-04T10:39:36Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Add Qwen-Image-Edit, as described #187

Test Plan

python image_edit.py \
    --image qwen_bear.png \
    --prompt "Let this mascot dance under the moon, surrounded by floating stars and poetic bubbles such as 'Be Kind'" \
    --output output_image_edit.png \
    --num_inference_steps 50 \
    --cfg_scale 4.0

Test Result

Input image:

Edited image:

prompt: "Let this mascot dance under the moon, surrounded by floating stars and poetic bubbles such as 'Be Kind'"

prompt: "Add a white art board written with colorful text 'vLLM-Omni' on grassland. Add a paintbrush in the bear's hands. position the bear standing in front of the art board as if painting"

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: samithuang <[email protected]>

chatgpt-codex-connector · 2025-12-04T10:39:40Z

The account who enabled Codex for this repo no longer has access to Codex. Please contact the admins of this repo to enable Codex again.

Signed-off-by: samithuang <[email protected]>

hsliuustc0106 · 2025-12-04T23:22:25Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Add Qwen-Image-Edit, as described #187

TODOs:

align output with diffusers

run pre-processing in DiffusionEngine (or API Server after online serving supported)

Test Plan
python image_edit.py \
    --image qwen_bear.png \
    --prompt "Let this mascot dance under the moon, surrounded by floating stars and poetic bubbles such as 'Be Kind'" \
    --output output_image_edit.png \
    --num_inference_steps 50 \
    --cfg_scale 4.0
Test Result

Input image:
Edited image:
Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".

The test plan, such as providing test command.

The test results, such as pasting the results comparison before and after, or e2e results

(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

could you modify the png for qwen x vllm-omni

SamitHuang · 2025-12-05T07:51:47Z

@hsliuustc0106 added

Signed-off-by: samithuang <[email protected]>

chatgpt-codex-connector · 2025-12-05T09:16:53Z

The account who enabled Codex for this repo no longer has access to Codex. Please contact the admins of this repo to enable Codex again.

ZJY0516 · 2025-12-05T09:35:44Z

vllm_omni/diffusion/registry.py

        )
+
+
+def get_diffusion_pre_process_func(od_config: OmniDiffusionConfig):


Can we unify get_diffusion_pre_process_func and get_diffusion_post_process_func? They have some common code

good suggestion, common code is extracted into a function

ZJY0516 · 2025-12-05T09:36:33Z

vllm_omni/diffusion/diffusion_engine.py

+            postprocess_start_time = time.time()
+            result = self.post_process_func(output.output)
+            postprocess_time = time.time() - postprocess_start_time
+            logger.info(f"Post-processing completed in {postprocess_time:.4f} seconds")


Do we really need this log?

it's used for performance profiling. just two lines of info log. i think it doesn't matter

vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image_edit.py

Signed-off-by: samithuang <[email protected]>

ZJY0516 · 2025-12-08T03:16:49Z

vllm_omni/diffusion/models/qwen_image/pipeline_qwen_image_edit.py

+    def load_weights(self):
+        self.load_transformer()
+
+    def load_transformer(self):


use weight loader introduced in #157

Signed-off-by: samithuang <[email protected]>

Signed-off-by: Samit <[email protected]>

Gaohan123

LGTM, thanks!

SamitHuang added 3 commits December 4, 2025 10:00

init qwenimage edit

51d30fe

Signed-off-by: samithuang <[email protected]>

add example

295257e

Signed-off-by: samithuang <[email protected]>

fix image latent concat

ba95a6b

Signed-off-by: samithuang <[email protected]>

SamitHuang requested a review from hsliuustc0106 as a code owner December 4, 2025 10:39

SamitHuang marked this pull request as draft December 4, 2025 10:39

SamitHuang mentioned this pull request Dec 4, 2025

[RFC]: DiT model and feature support enhancement #85

Open

40 tasks

update result

25698f7

Signed-off-by: samithuang <[email protected]>

hsliuustc0106 assigned SamitHuang Dec 4, 2025

hsliuustc0106 added the new model add new model label Dec 4, 2025

SamitHuang added 3 commits December 5, 2025 07:53

small fix

a2d07af

Signed-off-by: samithuang <[email protected]>

move pre-processing to DiffusionEngine

3bbe010

Signed-off-by: samithuang <[email protected]>

linting

c1d10fb

Signed-off-by: samithuang <[email protected]>

SamitHuang requested a review from ZJY0516 December 5, 2025 09:16

SamitHuang marked this pull request as ready for review December 5, 2025 09:16

SamitHuang changed the title ~~[Model][WIP] Add Qwen-Image-Edit~~ [Model] Add Qwen-Image-Edit Dec 5, 2025

ZJY0516 reviewed Dec 5, 2025

View reviewed changes

SamitHuang added 4 commits December 6, 2025 03:41

clean code

3e1e963

Signed-off-by: samithuang <[email protected]>

Merge branch 'main' into qwenimage_edit

6cbc1a6

linting

b95369b

Signed-off-by: samithuang <[email protected]>

update supported models

775ea31

Signed-off-by: samithuang <[email protected]>

ZJY0516 reviewed Dec 8, 2025

View reviewed changes

use new weight loader

a938e76

Signed-off-by: samithuang <[email protected]>

ZJY0516 approved these changes Dec 8, 2025

View reviewed changes

SamitHuang requested a review from Gaohan123 December 8, 2025 03:31

Merge branch 'main' into qwenimage_edit

5a4b19f

Signed-off-by: Samit <[email protected]>

Gaohan123 approved these changes Dec 8, 2025

View reviewed changes

Merge branch 'main' into qwenimage_edit

682bf13

Gaohan123 enabled auto-merge (squash) December 8, 2025 06:32

Merge branch 'main' into qwenimage_edit

ffdf798

Gaohan123 merged commit e8ea478 into vllm-project:main Dec 8, 2025
4 checks passed

		)


		def get_diffusion_pre_process_func(od_config: OmniDiffusionConfig):

[Model] Add Qwen-Image-Edit #196

[Model] Add Qwen-Image-Edit #196

Conversation

SamitHuang commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot commented Dec 4, 2025

Uh oh!

hsliuustc0106 commented Dec 4, 2025

Purpose

Test Plan

Test Result

Uh oh!

SamitHuang commented Dec 5, 2025

Uh oh!

chatgpt-codex-connector bot commented Dec 5, 2025

Uh oh!

ZJY0516 Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

SamitHuang Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

ZJY0516 Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

SamitHuang Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ZJY0516 Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Gaohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

SamitHuang commented Dec 4, 2025 •

edited

Loading

SamitHuang Dec 6, 2025 •

edited

Loading