Update default tokenization behavior to "longest" in README

by MichaelR207 - opened 10 days ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

-1

MichaelR207

10 days ago

When you use the default code in the README the tokenizer is set to max_length. This causes an OOM error even on an H100. This is because each sequence is padded to 131072 tokens, the max length for Llama 3.2 3b. A much more reasonable behavior is padding the max length of a sequence. This is accomplished by switching "padding": "longest" in the tokenizer kwargs.

Update default tokenization behavior to "longest" in README7944c25c

Ray2333

Owner 10 days ago

Thank you!

Ray2333 changed pull request status to merged 10 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment