Spaces:
Running
Running
add readme
Browse files
README.md
CHANGED
@@ -34,24 +34,35 @@ At minium, the metric requires the original time-series and the generated time-s
|
|
34 |
### Inputs
|
35 |
- **predictions**: (list of list of list of float or numpy.ndarray): The generated time-series. The shape of the array should be `(num_generation, seq_len, num_features)`.
|
36 |
- **references**: (list of list of list of float or numpy.ndarray): The original time-series. The shape of the array should be `(num_reference, seq_len, num_features)`.
|
|
|
37 |
|
38 |
### Output Values
|
39 |
|
40 |
-
|
41 |
|
42 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
|
44 |
#### Values from Popular Papers
|
45 |
-
*Give examples, preferrably with links to leaderboards or publications, to papers that have reported this metric, along with the values they have reported.*
|
46 |
|
47 |
### Examples
|
48 |
-
*Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.*
|
49 |
|
50 |
## Limitations and Bias
|
51 |
-
|
52 |
|
53 |
## Citation
|
54 |
-
*Cite the source where this metric was introduced.*
|
55 |
|
56 |
## Further References
|
57 |
-
*Add any useful further references.*
|
|
|
34 |
### Inputs
|
35 |
- **predictions**: (list of list of list of float or numpy.ndarray): The generated time-series. The shape of the array should be `(num_generation, seq_len, num_features)`.
|
36 |
- **references**: (list of list of list of float or numpy.ndarray): The original time-series. The shape of the array should be `(num_reference, seq_len, num_features)`.
|
37 |
+
- **batch_size**: (int, optional): The batch size for computing the metric. This affects quadratically. Default is None.
|
38 |
|
39 |
### Output Values
|
40 |
|
41 |
+
Let prediction instances be $P = \{p_1, p_2, \ldots, p_n\}$ and reference instances be $R = \{r_1, r_2, \ldots, r_m\}$.
|
42 |
|
43 |
+
- **matching_mse**: (float): Average of the MSE between the generated instance and the reference instance with the lowest MSE. Intuitively, This is similar to precision in classification. In the equation, $\frac{1}{n} \sum_{i=1}^{n} \min_{j} \mathrm{MSE}(p_i, r_j)$.
|
44 |
+
- **covered_mse**: (float): Average of the MSE between the reference instance and the with the lowest MSE. Intuitively, This is similar to recall in classification. In the equation, $\frac{1}{m} \sum_{j=1}^{m} \min_{i} \mathrm{MSE}(p_i, r_j)$.
|
45 |
+
- **harmonic_mean**: (float): Harmonic mean of the matching_mse and covered_mse. This is similar to F1-score in classification.
|
46 |
+
- **index_mse**: (float): Average of the MSE between the generated instance and the reference instance with the same index. In the equation, $\frac{1}{n} \sum_{i=1}^{n} \mathrm{MSE}(p_i, r_i)$.
|
47 |
+
- **matching_mse_features**: (list of float): matching_mse computed individually for each feature.
|
48 |
+
- **covered_mse_features**: (list of float): covered_mse computed individually for each feature.
|
49 |
+
- **harmonic_mean_features**: (list of float): harmonic_mean computed individually for each feature.
|
50 |
+
- **index_mse_features**: (list of float): index_mse computed individually for each feature.
|
51 |
+
- **macro_matching_mse**: (float): Average of the matching_mse_features.
|
52 |
+
- **macro_covered_mse**: (float): Average of the covered_mse_features.
|
53 |
+
- **macro_harmonic_mean**: (float): Average of the harmonic_mean_features.
|
54 |
|
55 |
#### Values from Popular Papers
|
56 |
+
<!-- *Give examples, preferrably with links to leaderboards or publications, to papers that have reported this metric, along with the values they have reported.* -->
|
57 |
|
58 |
### Examples
|
59 |
+
<!-- *Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.* -->
|
60 |
|
61 |
## Limitations and Bias
|
62 |
+
This metric is based on the assumption that the generated time-series should match the original time-series. This may not be the case in some scenarios. The metric may not be suitable for evaluating time-series generation models that are not required to match the original time-series.
|
63 |
|
64 |
## Citation
|
65 |
+
<!-- *Cite the source where this metric was introduced.* -->
|
66 |
|
67 |
## Further References
|
68 |
+
<!-- *Add any useful further references.* -->
|