Some problem when fine-tuning Infinity-2B + VAE_d32_reg models

google/flan-t5-large, xl, xxl has 780M, 3B, 11B params.
The Infinity uses flan-t5-xl of 3B as vanilla. But My GPU is a single RTX4090 of 24 VRAM, which get OOM error when encoding the texts.
I want to use google/flan-t5-large as text encoder.
But there is an error that the tensor's size doesn't fit the Infinity-2B model's state_dict.

![Image](https://github.com/user-attachments/assets/091c3b74-8882-49c4-bf81-6deb17e6f339)
![Image](https://github.com/user-attachments/assets/ade8e86a-2e55-4b36-b520-219a606e768b)


If Infinity-2B model is only compatible with flan-t5-xl and xxl of 2048 dimensions output?
How can I use Infinity-2B model and google/flan-t5-large at the same time?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Some problem when fine-tuning Infinity-2B + VAE_d32_reg models #114

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Some problem when fine-tuning Infinity-2B + VAE_d32_reg models #114

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions