Skip to content

T5Gemma failing on provided example #39522

@jadermcs

Description

@jadermcs

System Info

  • transformers version: 4.53.2
  • Platform: Linux-6.14.0-23-generic-x86_64-with-glibc2.41
  • Python version: 3.13.3
  • Huggingface_hub version: 0.33.4
  • Safetensors version: 0.5.3
  • Accelerate version: 1.8.1
  • Accelerate config: - compute_environment: LOCAL_MACHINE
    • distributed_type: NO
    • mixed_precision: bf16
    • use_cpu: False
    • debug: False
    • num_processes: 1
    • machine_rank: 0
    • num_machines: 1
    • gpu_ids: all
    • rdzv_backend: static
    • same_network: True
    • main_training_function: main
    • enable_cpu_affinity: True
    • downcast_bf16: no
    • tpu_use_cluster: False
    • tpu_use_sudo: False
    • tpu_env: []
    • dynamo_config: {'dynamo_backend': 'INDUCTOR'}
  • DeepSpeed version: not installed
  • PyTorch version (accelerator?): 2.7.1+cu128 (CUDA)
  • Tensorflow version (GPU?): not installed (NA)
  • Flax version (CPU?/GPU?/TPU?): not installed (NA)
  • Jax version: not installed
  • JaxLib version: not installed
  • Using distributed or parallel set-up in script?:
  • Using GPU in script?:
  • GPU type: NVIDIA GeForce RTX 5060 Ti

Who can help?

@ArthurZucker and @itazap

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • My own task or dataset (give details below)

Reproduction

Run the example from the T5Gemma docs page.

echo -e "Question: Why is the sky blue? Answer:" | transformers run --task text2text-generation --model google/t5gemma-s-s-ul2 --device 0

Expected behavior

When I run I get:

File ".venv/lib/python3.13/site-packages/transformers/configuration_utils.py", line 209, in __getattribute__
    return super().__getattribute__(key)
           ~~~~~~~~~~~~~~~~~~~~~~~~^^^^^
AttributeError: 'T5GemmaConfig' object has no attribute **'vocab_size'**

Indeed. The vocab_size is a sub attribute from encoder/decoder, not a direct attribute.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions