Why does a 7B model take up about 55 gb ram?

#67

by omersajid - opened 3 days ago

Discussion

omersajid

3 days ago

Why does a 7B model take up about 50-55 gb ram?

SaisExperiments

3 days ago

parallel_size: int = 16
Changing this to 1 used ~16gb max

amartyab1729

3 days ago

In the generation inference.py file?

SaisExperiments

3 days ago

Yeah :3

Line 60 i believe

amartyab1729

3 days ago

Yeah, but for me, while loading only it's taking more than 30 GB.

ruiheCat

3 days ago

Hi, can you please share how you setup the model? I can't seem to use the model using transformer, it complains about the not knowing the model

SaisExperiments

2 days ago

Hi, can you please share how you setup the model? I can't seem to use the model using transformer, it complains about the not knowing the model

You can find instructions here: https://github.com/deepseek-ai/Janus?tab=readme-ov-file#janus-pro
They worked for me with no issues

ruiheCat

2 days ago

Hi, can you please share how you setup the model? I can't seem to use the model using transformer, it complains about the not knowing the model

You can find instructions here: https://github.com/deepseek-ai/Janus?tab=readme-ov-file#janus-pro
They worked for me with no issues

thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment