InstaDeepAI
/

visnet-organics

Model card Files Files and versions

xet

Community

JulesLT

heloise-chomet commited on May 12

Commit

21dcc21

verified ·

1 Parent(s): c145591

Update README.md (#3)

Browse files

- Update README.md (af077e000916bc4aee445e5e8c11599b63a83a9d)

Co-authored-by: Heloise Chomet <[email protected]>

Files changed (1) hide show

README.md +53 -0

README.md CHANGED Viewed

@@ -1,3 +1,56 @@
 ## License summary
 1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.

+# ViSNet
+## Reference
+Yusong Wang, Tong Wang, Shaoning Li, Xinheng He, Mingyu Li, Zun Wang, Nanning Zheng, Bin Shao, and Tie-Yan Liu.
+Enhancing geometric representations for molecules with equivariant vector-scalar interactive message passing.
+Nature Communications, 15(1), January 2024. ISSN: 2041-1723.
+URL: https://dx.doi.org/10.1038/s41467-023-43720-2.
+## How to Use
+For complete usage instructions and more information, please refer to our [documentation](https://instadeep.github.io/mlip)
+## Model architecture
+| Parameter          | Value    | Description                                                              |
+|--------------------|----------|--------------------------------------------------------------------------|
+| `num_layers`       | `4`      | Number of ViSNet layers.                                                 |
+| `num_channels`     | `128`    | Number of channels.                                                      |
+| `l_max`            | `2`      | Highest harmonic order included in the Spherical Harmonics series.       |
+| `num_heads`        | `8`      | Number of heads in the attention block.                                  |
+| `num_rbf`          | `32`     | Number of radial basis functions in the embedding block.                 |
+| `trainable_rbf`    | `False`  | Whether to add learnable weights to the radial embedding basis functions.|
+| `activation`       | `silu`   | Activation function for the output block.                                |
+| `attn_activation`  | `silu`   | Activation function for the attention block.                             |
+| `vecnorm_type`     | `None`   | Type of the vector norm.                                                 |
+| `atomic_energies`  | `average`| Treatment of the atomic energies.                                        |
+| `avg_um_neighbors` | `None`   | Mean number of neighbors.                                                |
+For more information about ViSNet hyperparameters,
+please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/models/visnet.html#mlip.models.visnet.config.VisnetConfig)
+## Training
+Training is performed over 220 epochs, with an exponential moving average (EMA) decay rate of 0.99.
+The model employs a Huber loss function with scheduled weights for the energy and force components.
+Initially, the energy term is weighted at 40 and the force term at 1000.
+At epoch 115, these weights are flipped.
+We use our default MLIP optimizer in v1.0.0 with the following settings:
+| Parameter                        | Value          | Description                                                     |
+|----------------------------------|----------------|-----------------------------------------------------------------|
+| `init_learning_rate`             | `0.0001`       | Initial learning rate.                                          |
+| `peak_learning_rate`             | `0.0001`       | Peak learning rate.                                             |
+| `final_learning_rate`            | `0.0001`       | Final learning rate.                                            |
+| `weight_decay`                   | `0`            | Weight decay.                                                   |
+| `warmup_steps`                   | `4000`         | Number of optimizer warm-up steps.                              |
+| `transition_steps`               | `360000`       | Number of optimizer transition steps.                           |
+| `grad_norm`                      | `500`          | Gradient norm used for gradient clipping.                       |
+| `num_gradient_accumulation_steps`| `1`            | Steps to accumulate before taking an optimizer step.            |
+For more information about the optimizer,
+please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/training/optimizer.html#mlip.training.optimizer_config.OptimizerConfig)
+## Dataset
+| Parameter                   | Value | Description                                |
+|-----------------------------|-------|--------------------------------------------|
+| `graph_cutoff_angstrom`     | `5`   | Graph cutoff distance (in Å).              |
+| `max_n_node`                | `32`  | Maximum number of nodes allowed in a batch.|
+| `max_n_edge`                | `288` | Maximum number of edges allowed in a batch.|
+| `batch_size`                | `16`  | Number of graphs in a batch.               |
+This model was trained on the [SPICE2_curated dataset](https://huggingface.co/datasets/InstaDeepAI/SPICE2-curated).
+For more information about dataset configuration
+please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/data/dataset_configs.html#mlip.data.configs.GraphDatasetBuilderConfig)
 ## License summary
 1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.