Update HART-SURYA_README.md
Browse files- HART-SURYA_README.md +10 -1
HART-SURYA_README.md
CHANGED
|
@@ -1,6 +1,15 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
I examined the patchifying and positional encoding and tokenization methods employed in the Surya model.
|
| 2 |
|
| 3 |
-
I posted
|
| 4 |
|
| 5 |
I find the input/tokenization method to be surprisingly simple, and perhaps inefficient. It seems that the transformer model is receiving square images centered about the sun and containing much dark space around the sun that contains no information. Although a transformer model is smart and can figure things out, it is usually better to optimize the inputs to reduce noise and remove extraneous data that only consumes compute and power (garbage in, garbage out) and adds nothing to the best results.
|
| 6 |
|
|
|
|
| 1 |
+
This HART-SURYA proposal is made in response to the article that I received mention of in an email today.
|
| 2 |
+
See e.g.,
|
| 3 |
+
https://science.nasa.gov/science-research/artificial-intelligence-model-heliophysics/?utm_source=whatsupinai.beehiiv.com&utm_medium=newsletter&utm_campaign=pixel-as-the-true-ai-first-smartphone-nasa-ibm-reveal-surya-gemini-replaces-assistant-on-nest-devices
|
| 4 |
+
https://science.data.nasa.gov/features-events/inside-surya-solar-ai-model
|
| 5 |
+
See
|
| 6 |
+
https://huggingface.co/nasa-ibm-ai4science/Surya-1.0/blob/main/README.md
|
| 7 |
+
and
|
| 8 |
+
https://github.com/NASA-IMPACT/Surya/blob/main/README.md
|
| 9 |
+
|
| 10 |
I examined the patchifying and positional encoding and tokenization methods employed in the Surya model.
|
| 11 |
|
| 12 |
+
I posted the following proposals for Surya model upgrades at https://github.com/NASA-IMPACT/Surya/issues/21
|
| 13 |
|
| 14 |
I find the input/tokenization method to be surprisingly simple, and perhaps inefficient. It seems that the transformer model is receiving square images centered about the sun and containing much dark space around the sun that contains no information. Although a transformer model is smart and can figure things out, it is usually better to optimize the inputs to reduce noise and remove extraneous data that only consumes compute and power (garbage in, garbage out) and adds nothing to the best results.
|
| 15 |
|