DreamGen
DreamGenX
AI & ML interests
None yet
Recent Activity
new activity
about 24 hours ago
Qwen/QwQ-32B:Is this model native 128K context length, or YaRN extended?
new activity
2 days ago
nvidia/Llama-3_3-Nemotron-Super-49B-v1:trust_remote_code=True
Organizations
DreamGenX's activity
Is this model native 128K context length, or YaRN extended?
7
#28 opened 16 days ago
by
danielhanchen

trust_remote_code=True
1
#2 opened 2 days ago
by
DreamGenX

Lots of wrong ground truths
#1 opened 23 days ago
by
DreamGenX

How about a 3 way merge with a distillation from Mistral Large? :D
#7 opened 5 months ago
by
DreamGenX

Plans for updates
3
#1 opened 6 months ago
by
dosb
Any documents, paper explain how can you construct this awefull model?
2
#7 opened 9 months ago
by
anhnh2002

Check new, much better, version of this model
12
#5 opened 11 months ago
by
DreamGenX

System and followup turns are missing.
6
#2 opened 10 months ago
by
DreamGenX

Leaderboard stuck?
1
#754 opened 10 months ago
by
DreamGenX

Benchmarks
5
#5 opened 11 months ago
by
ChuckMcSneed

Please rerun failspy/llama-3-70B-Instruct-abliterated -- eval failed
1
#737 opened 11 months ago
by
DreamGenX

License?
2
#3 opened 11 months ago
by
DreamGenX

Best model for RP I have ever tried
12
#2 opened 11 months ago
by
Franchu
Update post-processor to add bos
5
#41 opened 11 months ago
by
pcuenq

Model won't stop generating [llama.cpp / koboldcpp]
12
#3 opened 11 months ago
by
DreamGenX

problem, he repeats
2
#2 opened 11 months ago
by
ClaudioItaly
Curious about Fine-tuning Methods
3
#4 opened 11 months ago
by
ElliottDyson
Special token issues
#1 opened 11 months ago
by
DreamGenX

Adding `safetensors` variant of this model
#1 opened 11 months ago
by
DreamGenX

Upload 18 files
2
#2 opened 12 months ago
by
rAIfle