AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

hqfang  updated a model about 1 hour ago
allenai/MolmoAct-7B-D-Pretrain-RT-1-0812
hqfang  updated a model about 1 hour ago
allenai/MolmoAct-7B-D-Pretrain-0812
hqfang  updated a model about 1 hour ago
allenai/MolmoAct-7B-O-0812
View all activity

Articles

allenai 's collections 24

Tulu V2.5 Suite
A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more!
OLMo 2 Preview Post-trained Models
These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions.
Tulu V2.5 Suite
A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more!
OLMo 2 Preview Post-trained Models
These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions.