ghn3 / README.md
bknyaz's picture
upd readme
55dacc7
|
raw
history blame
577 Bytes
---
license: mit
---
# Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
**To appear at [ICML 2023](https://icml.cc/Conferences/2023)**
[Boris Knyazev](http://bknyaz.github.io/), [Doha Hwang](https://mila.quebec/en/person/doha-hwang/), [Simon Lacoste-Julien](http://www.iro.umontreal.ca/~slacoste/)
https://arxiv.org/abs/2303.04143
- See the list of pretrained GHN models [here](https://huggingface.co/SamsungSAILMontreal/ghn3/tree/main).
- See code examples at [github.com/SamsungSAILMontreal/ghn3](https://github.com/SamsungSAILMontreal/ghn3)