Typos

#80
by iandanforth - opened

"knowledge and technics" -> "knowledge and techniques"

"here to changes that" -> "here to change that"

"simplest to the most raffined one" -> "simplest to the most refined"

"We'll assumes you" -> "We'll assume you"

"how deep learning model are trained" -> "how deep learning models are trained"

"to fully understand how how performing LLMs" -> "to fully understand how high performing LLMs" (guessing at the intent here)

"what it’s advantages and limits are" -> "what its advantages and limits are"

iandanforth changed discussion title from Typo to Typos

In the cheatsheet:
"ep: context parallelism" → "ep: expert parallelism"

thanks for this amazing book HF !

"When training a neural network model, one store several items in memory:" → When training a neural network model, one stores several items in memory:

Nanotron Research org
edited 2 days ago

Awesome! Thanks a lot for the pull request

Is the formula bst=bs∗seq correct? bs = bst * seq seems like the correct formula.

and are roughtly familiar -> and are roughly familiar

Nanotron Research org

Is the formula bst=bs∗seq correct? bs = bst * seq seems like the correct formula.

depends on how you define "bst" and "bs". We chose to define "bst" as batch size in tokens, which would be bs*seq (batch size in samples times sample length)

Small typos:

  • "Using the Pytorch profiler we can understand how memory is allocated througho ut training" -> "Using the Pytorch profiler we can understand how memory is allocated throughout training"
  • "Why does the first step looks different:" -> "Why does the first step look different:"
  • The TeX type text is not visible here: image.png

Sign up or log in to comment