bowdbeg commited on
Commit
efa4c13
·
1 Parent(s): a63936d

add readme

Browse files
Files changed (1) hide show
  1. README.md +18 -7
README.md CHANGED
@@ -34,24 +34,35 @@ At minium, the metric requires the original time-series and the generated time-s
34
  ### Inputs
35
  - **predictions**: (list of list of list of float or numpy.ndarray): The generated time-series. The shape of the array should be `(num_generation, seq_len, num_features)`.
36
  - **references**: (list of list of list of float or numpy.ndarray): The original time-series. The shape of the array should be `(num_reference, seq_len, num_features)`.
 
37
 
38
  ### Output Values
39
 
40
- *Explain what this metric outputs and provide an example of what the metric output looks like. Modules should return a dictionary with one or multiple key-value pairs, e.g. {"bleu" : 6.02}*
41
 
42
- *State the range of possible values that the metric's output can take, as well as what in that range is considered good. For example: "This metric can take on any value between 0 and 100, inclusive. Higher scores are better."*
 
 
 
 
 
 
 
 
 
 
43
 
44
  #### Values from Popular Papers
45
- *Give examples, preferrably with links to leaderboards or publications, to papers that have reported this metric, along with the values they have reported.*
46
 
47
  ### Examples
48
- *Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.*
49
 
50
  ## Limitations and Bias
51
- *Note any known limitations or biases that the metric has, with links and references if possible.*
52
 
53
  ## Citation
54
- *Cite the source where this metric was introduced.*
55
 
56
  ## Further References
57
- *Add any useful further references.*
 
34
  ### Inputs
35
  - **predictions**: (list of list of list of float or numpy.ndarray): The generated time-series. The shape of the array should be `(num_generation, seq_len, num_features)`.
36
  - **references**: (list of list of list of float or numpy.ndarray): The original time-series. The shape of the array should be `(num_reference, seq_len, num_features)`.
37
+ - **batch_size**: (int, optional): The batch size for computing the metric. This affects quadratically. Default is None.
38
 
39
  ### Output Values
40
 
41
+ Let prediction instances be $P = \{p_1, p_2, \ldots, p_n\}$ and reference instances be $R = \{r_1, r_2, \ldots, r_m\}$.
42
 
43
+ - **matching_mse**: (float): Average of the MSE between the generated instance and the reference instance with the lowest MSE. Intuitively, This is similar to precision in classification. In the equation, $\frac{1}{n} \sum_{i=1}^{n} \min_{j} \mathrm{MSE}(p_i, r_j)$.
44
+ - **covered_mse**: (float): Average of the MSE between the reference instance and the with the lowest MSE. Intuitively, This is similar to recall in classification. In the equation, $\frac{1}{m} \sum_{j=1}^{m} \min_{i} \mathrm{MSE}(p_i, r_j)$.
45
+ - **harmonic_mean**: (float): Harmonic mean of the matching_mse and covered_mse. This is similar to F1-score in classification.
46
+ - **index_mse**: (float): Average of the MSE between the generated instance and the reference instance with the same index. In the equation, $\frac{1}{n} \sum_{i=1}^{n} \mathrm{MSE}(p_i, r_i)$.
47
+ - **matching_mse_features**: (list of float): matching_mse computed individually for each feature.
48
+ - **covered_mse_features**: (list of float): covered_mse computed individually for each feature.
49
+ - **harmonic_mean_features**: (list of float): harmonic_mean computed individually for each feature.
50
+ - **index_mse_features**: (list of float): index_mse computed individually for each feature.
51
+ - **macro_matching_mse**: (float): Average of the matching_mse_features.
52
+ - **macro_covered_mse**: (float): Average of the covered_mse_features.
53
+ - **macro_harmonic_mean**: (float): Average of the harmonic_mean_features.
54
 
55
  #### Values from Popular Papers
56
+ <!-- *Give examples, preferrably with links to leaderboards or publications, to papers that have reported this metric, along with the values they have reported.* -->
57
 
58
  ### Examples
59
+ <!-- *Give code examples of the metric being used. Try to include examples that clear up any potential ambiguity left from the metric description above. If possible, provide a range of examples that show both typical and atypical results, as well as examples where a variety of input parameters are passed.* -->
60
 
61
  ## Limitations and Bias
62
+ This metric is based on the assumption that the generated time-series should match the original time-series. This may not be the case in some scenarios. The metric may not be suitable for evaluating time-series generation models that are not required to match the original time-series.
63
 
64
  ## Citation
65
+ <!-- *Cite the source where this metric was introduced.* -->
66
 
67
  ## Further References
68
+ <!-- *Add any useful further references.* -->