Z-Image-Turbo from_single_file fix #12888
Open
+9
−5
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do?
CHECKPOINT_KEY_NAMESchanged fromcap_embedder.0.weighttolayers.0.adaLN_modulation.0.weightandmodel.diffusion_model.layers.0.adaLN_modulation.0.weight. This helps better distinguish vslumina, andmodel.diffusion_model.variant is also required for some checkpoints.convert_z_image_transformer_checkpoint_to_diffusers{"model.diffusion_model.": ""}is added toZ_IMAGE_KEYS_RENAME_DICTto remove the prefixnorm_final.weightkey is removed, this appears to be an erroneous additional weight, we havenorm_finalasnn.LayerNorminall_final_layer.2-1(FinalLayermodule) howeverelementwise_affine=FalsesoweightisNone, also if we check the weight itself in an example checkpoint (unstableRevolution_Bf16.safetensors) we see somenantensor([1.8447e+19, 1.9886e+31, nan, ..., 7.3242e-02, 4.6587e-20, 1.3733e-02], dtype=torch.bfloat16)and other extreme essentially junk values.Repro
Fixes #12887
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
cc @vladmandic