ICML2023
AI & ML interests
None defined yet.
Recent Activity
View all activity
ICML2023's activity
ameerazam08Β
posted
an
update
about 11 hours ago
Delta-VectorΒ
posted
an
update
4 days ago
Post
559
For anyone that enjoys Magnum models, I just dropped a 12B that is the first (or second?) stepping stone into Magnum V5
Delta-Vector/rei-12b-6795505005c4a94ebdfdeb39
Delta-Vector/rei-12b-6795505005c4a94ebdfdeb39
Post
1574
R1 is out! And with a lot of other R1 releated models...
hystsΒ
updated
a
Space
20 days ago
vwxyzjnΒ
authored
5
papers
25 days ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Paper
β’
2403.17031
β’
Published
β’
4
A2C is a special case of PPO
Paper
β’
2205.09123
β’
Published
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Paper
β’
2410.18252
β’
Published
β’
5
TΓLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
β’
2411.15124
β’
Published
β’
58
2 OLMo 2 Furious
Paper
β’
2501.00656
β’
Published
β’
15
mbrackΒ
authored
a
paper
about 1 month ago
Post
7468
Google drops Gemini 2.0 Flash Thinking
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
KameshrΒ
authored
a
paper
about 2 months ago
Post
8477
QwQ-32B-Preview is now available in anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
Post
3910
Post
2872
anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
xzyaoΒ
authored
a
paper
2 months ago
Lupin1998Β
authored
a
paper
4 months ago