File size: 2,211 Bytes
2c53933
 
 
 
 
 
 
 
 
2c53b64
 
 
 
94f8dbe
18bc267
2c53b64
7c056de
2c53b64
 
 
 
 
 
 
29bf812
2c53b64
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
439e518
2c53b64
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
---
title: README
emoji: πŸ†
colorFrom: indigo
colorTo: green
sdk: static
pinned: false
---

# MONSTER

***MONSTER: Monash Scalable Time Series Evaluation Repository***

[arXiv:2502.15122](https://arxiv.org/abs/2502.15122) / [HF Papers](https://huggingface.co/papers/2502.15122) (preprint)  
[GitHub](https://github.com/Navidfoumani/monster) (code)

> <div align="justify">We introduce MONSTER&mdash;the <b>MON</b>ash <b>S</b>calable <b>T</b>ime Series <b>E</b>valuation <b>R</b>epository&mdash;a collection of large datasets for time series classification. The field of time series classification has benefitted from common benchmarks set by the UCR and UEA time series classification repositories. However, the datasets in these benchmarks are small, with median sizes of 217 and 255 examples, respectively. In consequence they favour a narrow subspace of models that are optimised to achieve low classification error on a wide variety of smaller datasets, that is, models that minimise variance, and give little weight to computational issues such as scalability. Our hope is to diversify the field by introducing benchmarks using larger datasets. We believe that there is enormous potential for new progress in the field by engaging with the theoretical and practical challenges of learning effectively from larger quantities of data.</div>

Please cite as:
```bibtex
@article{dempster_etal_2025,
  author  = {Dempster, Angus and Foumani, Navid Mohammadi and Tan, Chang Wei and Miller, Lynn and Mishra, Amish and Salehi, Mahsa and Pelletier, Charlotte and Schmidt, Daniel F and Webb, Geoffrey I},
  title   = {MONSTER: Monash Scalable Time Series Evaluation Repository},
  year    = {2025},
  journal = {arXiv:2502.15122},
}
```

## Downloading Data

### <tt>hf_hub_download</tt>

```python
from huggingface_hub import hf_hub_download

path = hf_hub_download(repo_id = f"monster-monash/Pedestrian", filename = f"Pedestrian_X.npy", repo_type = "dataset")

X = np.load(path, mmap_mode = "r")
```

### <tt>load_data</tt>

```python
from datasets import load_dataset

dataset = load_dataset("monster-monash/Pedestrian", "fold_0", trust_remote_code = True)
```

(More to come...)

<div align="center">πŸ¦–</div>