|
--- |
|
license: mit |
|
--- |
|
|
|
# Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models? |
|
|
|
**To appear at [ICML 2023](https://icml.cc/Conferences/2023)** |
|
|
|
[Boris Knyazev](http://bknyaz.github.io/), [Doha Hwang](https://mila.quebec/en/person/doha-hwang/), [Simon Lacoste-Julien](http://www.iro.umontreal.ca/~slacoste/) |
|
|
|
https://arxiv.org/abs/2303.04143 |
|
|
|
- See the list of pretrained GHN models [here](https://huggingface.co/SamsungSAILMontreal/ghn3/tree/main). |
|
|
|
- See code examples at [github.com/SamsungSAILMontreal/ghn3](https://github.com/SamsungSAILMontreal/ghn3) |