Proposal to revise multimodality statement

#12

by dkleine - opened 16 days ago

16 days ago

The current sentence in the model card

Gemma 3 models are multimodal, handling text and image input and generating text output

appears overly broad as not all Gemma 3 model sizes support image input (the smaller 270M and 1B variants are text-only).

Renu11

Google org 8 days ago

Hi @dkleine Thank you for bringing this to our attention. We will forward this feedback to the relevant team to clarify the description.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment