MrDragonFox's picture

MrDragonFox

MrDragonFox

AI & ML interests

None yet

Recent Activity

Organizations

DeepGHS's profile picture SynthoCraft Ai's profile picture FoxEngineAi's profile picture Social Post Explorers's profile picture Mistral AI Game Jam's profile picture

MrDragonFox's activity

view reply

just limit vllm to 1 gpu and run the rest on a other one .. or use -gmu

view reply

8b repo empty and dataset empty too .. well its a little off from sota .... tbh glm4voice had better results - but its certainly a "ok" poc

gh repo empty / no paper

replied to mitkox's post 23 days ago
replied to mitkox's post 23 days ago
view reply

with 250g ram used ^^ probably running it at a 2 bit quant .

New activity in deepseek-ai/DeepSeek-V3 about 1 month ago

When GGUF?

25
#6 opened about 1 month ago by
ChuckMcSneed
New activity in byroneverson/glm-4-9b-chat-abliterated about 1 month ago

GLM

4
#2 opened about 2 months ago by
MrDragonFox
New activity in THUDM/glm-4-voice-9b about 2 months ago

nnsight output logits nan

#1 opened about 2 months ago by
MrDragonFox
New activity in mistralai/Mistral-Large-Instruct-2411 about 2 months ago

Disappointing

3
#11 opened about 2 months ago by
ChuckMcSneed
reacted to danielhanchen's post with 🔥 2 months ago
New activity in kyutai/mimi 4 months ago

Training code

2
#1 opened 4 months ago by
ChristophSchuhmann
New activity in mistral-community/pixtral-12b-240910 5 months ago

Any Inference code?

12
#6 opened 5 months ago by
DongfuJiang
New activity in G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b 5 months ago

really good model

2
#2 opened 5 months ago by
gileneo