Config structure prevents AutoModel compatibility (Bottleneck for T5Gemma-2 migration)

by PhatcatDK - opened Mar 15

Mar 15

This encoder only extraction contains info about the decoding in the config file meaning if it is loaded and not explicitly told not to use the decoder, transformers may look for something that doesn't exist.
Even with is_encoder_decoder set to false, the presence of the decoder configuration block causes AutoModel to attempt a full model initialization and fail. This forces users to use the specific T5GemmaEncoderModel class rather than the standard AutoModel factory, which breaks cross-compatibility with newer T5Gemma-2 based architectures.

Minthy

Owner Mar 17

Yes, the code needs to be refactored to ensure better compatibility and versatility. I will try to work on this in the coming weeks.
I want to note that using this with T5Gemma2 or other models will require creating a new adapter model from scratch, as they are incompatible with the previous version.

PhatcatDK

Mar 18

I stand corrected, AutoModel refuses to load the T5Gemma encoder at all referring to the specific T5GemmaEncoder class.

PhatcatDK changed discussion status to closed Mar 18

PhatcatDK

Mar 18

Yes, the code needs to be refactored to ensure better compatibility and versatility. I will try to work on this in the coming weeks.
I want to note that using this with T5Gemma2 or other models will require creating a new adapter model from scratch, as they are incompatible with the previous version.

I am aware, please take a look at the github repo for the comfy ui node for llm to sdxl. I already made some suggestions and laid the groundwork for T5Gemma2.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment