TuringsSolutions commited on
Commit
afd2e5c
·
verified ·
1 Parent(s): 241277b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -9,5 +9,19 @@ app_file: app.py
9
  pinned: false
10
  license: mit
11
  ---
 
12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
9
  pinned: false
10
  license: mit
11
  ---
12
+ # Dataset Energy & Entropy Analyzer (Gradio)
13
 
14
+ A lightweight app that analyzes a CSV and reports:
15
+ - Global compressibility (gzip ratio)
16
+ - Per-column entropy (numeric via quantile-binning; categorical via counts)
17
+ - Monotone runs and run-entropy (numeric columns)
18
+ - Sortedness fraction (numeric columns)
19
+ - 2D Pareto maxima (first two numeric columns)
20
+ - kd-partition entropy approximation (first two numeric columns)
21
+ - Overall **Harvestable Energy** score (0–1)
22
+
23
+ ## How it relates to "Harvestable Energy" & range-partition entropy
24
+ - Lower entropy and higher compressibility imply more exploitable structure → higher harvestable energy.
25
+ - kd-entropy approximates how many bits are needed to split spatial data into simple blocks (a proxy for range-partition entropy).
26
+ - Run-entropy captures how many monotone runs are present (adaptive sorting lens).
27
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference