Proposal to revise multimodality statement
#12
by
dkleine
- opened
The current sentence in the model card
Gemma 3 models are multimodal, handling text and image input and generating text output
appears overly broad as not all Gemma 3 model sizes support image input (the smaller 270M and 1B variants are text-only).