Update README.md
Browse files
README.md
CHANGED
@@ -31,6 +31,8 @@ MolmoAct is trained on a subset of OXE and MolmoAct Dataset, a dataset with 10k
|
|
31 |
|
32 |
This checkpoint is a **preview** of the MolmoAct release. All artifacts used in creating MolmoAct (data, training code, evaluations, intermediate checkpoints) will be made available at a later date, furthering our commitment to open-source AI development and reproducibility.
|
33 |
|
|
|
|
|
34 |
Quick links:
|
35 |
- π [All Models](https://huggingface.co/collections/allenai/molmoact-689697591a3936fba38174d7)
|
36 |
- π [All Data](https://huggingface.co/collections/allenai/molmoact-data-mixture-6897e583e13b6c2cf3ea2b80)
|
@@ -64,7 +66,7 @@ ckpt = "allenai/MolmoAct-7B-D-Pretrain-RT-1-0812"
|
|
64 |
processor = AutoProcessor.from_pretrained(
|
65 |
ckpt,
|
66 |
trust_remote_code=True,
|
67 |
-
torch_dtype="
|
68 |
device_map="auto",
|
69 |
padding_side="left",
|
70 |
)
|
@@ -73,7 +75,7 @@ processor = AutoProcessor.from_pretrained(
|
|
73 |
model = AutoModelForImageTextToText.from_pretrained(
|
74 |
ckpt,
|
75 |
trust_remote_code=True,
|
76 |
-
torch_dtype="
|
77 |
device_map="auto",
|
78 |
)
|
79 |
|
|
|
31 |
|
32 |
This checkpoint is a **preview** of the MolmoAct release. All artifacts used in creating MolmoAct (data, training code, evaluations, intermediate checkpoints) will be made available at a later date, furthering our commitment to open-source AI development and reproducibility.
|
33 |
|
34 |
+
**Update:** Checkpoints are now stored in FP32 (previously BF16). The model was trained in FP32, so publishing FP32 weights aligns with training and enables fine-tuning or continued training directly from this repo. For inference, you can still run BF16 by casting at load, which is what we did for evaluations. See more in the [instructions](#quick-start) below.
|
35 |
+
|
36 |
Quick links:
|
37 |
- π [All Models](https://huggingface.co/collections/allenai/molmoact-689697591a3936fba38174d7)
|
38 |
- π [All Data](https://huggingface.co/collections/allenai/molmoact-data-mixture-6897e583e13b6c2cf3ea2b80)
|
|
|
66 |
processor = AutoProcessor.from_pretrained(
|
67 |
ckpt,
|
68 |
trust_remote_code=True,
|
69 |
+
torch_dtype="bfloat16",
|
70 |
device_map="auto",
|
71 |
padding_side="left",
|
72 |
)
|
|
|
75 |
model = AutoModelForImageTextToText.from_pretrained(
|
76 |
ckpt,
|
77 |
trust_remote_code=True,
|
78 |
+
torch_dtype="bfloat16",
|
79 |
device_map="auto",
|
80 |
)
|
81 |
|