remove reference to deprecated transformers code
#74 opened 1 day ago
by
winglian
Update README.md
#73 opened 2 days ago
by
SamimSaikia
DeepSeek R1 answer ChatGPT ??
4
#72 opened 2 days ago
by
valerebron
ValueError: Unrecognized configuration class <class 'transformers_modules.configuration_deepseek.DeepseekV3Config'> to build an AutoTokenizer.
1
#69 opened 3 days ago
by
ajtakto
Paralelized script
#67 opened 3 days ago
by
ajtakto
I am getting an error message while executing pip install - r requirements. txt
4
#64 opened 8 days ago
by
yu19920006607
Does deepseek allow adding new data?
#63 opened 12 days ago
by
JoshuaBontor
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#61 opened 16 days ago
by
cuichenx
captcha not loading on edge
#60 opened 18 days ago
by
leo-smi
Upload shreya.zip
#59 opened 18 days ago
by
Msdthala
Upload IMG_20250111_184317.jpg
#58 opened 19 days ago
by
Sajalhero
无辅助损失的专家路由
#56 opened 20 days ago
by
qing9
AI Games
#55 opened 21 days ago
by
ChickenUJHAYIUSGU
Upload IMG_0509 4.HEIC
#54 opened 21 days ago
by
borhanrabbany
how to inference with mtp?
#53 opened 22 days ago
by
duanyu
Does it support ollama
2
#52 opened 22 days ago
by
sminbb
Create gngn
#49 opened 22 days ago
by
axingd
Missing tool call in system prompt
#48 opened 23 days ago
by
bchenfireworks
Update config.json
#47 opened 23 days ago
by
STATIKwitak
Rename figures/benchmark.png to figures/𓇋𓀀𓍿.png
#46 opened 23 days ago
by
STATIKwitak
Rename figures/benchmark.png to figures/𓇋𓀀𓍿.png
#45 opened 23 days ago
by
STATIKwitak
Upload IMG_0295.HEIC
#42 opened 24 days ago
by
Umarkhan499
vLLM on A100s
6
#41 opened 25 days ago
by
fsaudm
When do you plan to integrate Huggingface Transformer?
#40 opened 25 days ago
by
echooooooooo
Deciphering messages
1
#39 opened 25 days ago
by
DoctorDonald
Update README.md
#38 opened 26 days ago
by
chaitanyayerroju
Update README.md
1
#37 opened 28 days ago
by
TomGrc
Training problem
3
#29 opened 28 days ago
by
DonGan13
Update README.md
1
#28 opened 28 days ago
by
Wisnet
Update README.md
2
#27 opened 28 days ago
by
Aikun7777777
Failed to run the model with 4 nodes of 8 4090
17
#25 opened 29 days ago
by
aisensiy
kill openai,come on
#24 opened 29 days ago
by
chaochaoli
Update modeling_deepseek.py
1
#23 opened 30 days ago
by
erichartford
is_torch_greater_or_equal_than_1_13 deprecated
#22 opened 30 days ago
by
erichartford
Request: DOI
#21 opened about 1 month ago
by
TheDandyMan
Has anyone tried running this model on Ollama?
6
#20 opened about 1 month ago
by
Yuxin362
vLLM on A100s
4
#19 opened about 1 month ago
by
fsaudm
Fine-tuning roadmap
4
#18 opened about 1 month ago
by
RonanMcGovern
CUDA out of memory error during fp8 to bf16 model conversion + fix
1
#17 opened about 1 month ago
by
sszymczyk
when llm leaderboard?
3
#14 opened about 1 month ago
by
blazespinnaker
Update README.md
#13 opened about 1 month ago
by
BANblongz
Please make V3-lite
3
#12 opened about 1 month ago
by
rombodawg
minimum vram?
11
#9 opened about 1 month ago
by
CHNtentes
Update README.md
#7 opened about 1 month ago
by
Spestly
Converted bf16 Model on Hugging Face
2
#5 opened about 1 month ago
by
OpenSourceRonin
Update README.md
#3 opened about 1 month ago
by
reach-vb
Smaller version for Home User GPU's
10
#2 opened about 1 month ago
by
apcameron
How can we thank you enough, whale bros?
10
#1 opened about 1 month ago
by
KrishnaKaasyap