File size: 1,059 Bytes
e5b945d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
license: apache-2.0
---

# EMBER2024 Classifiers

This repository includes 14 LightGBM classifiers trained on different malware analysis tasks, including malicious/benign file detection, malware family identification, and malware behavior prediction. These classifiers were trained and evaluated using the EMBER2024 dataset.

Paper about the classifiers: [https://arxiv.org/pdf/2506.05074](https://arxiv.org/pdf/2506.05074)

GitHub repo for the EMBER2024 dataset: [https://github.com/FutureComputing4AI/EMBER2024/](https://github.com/FutureComputing4AI/EMBER2024/)

## Citing

If you use EMBER2024 in your own research, please cite it using:

```
@inproceedings{joyce2025ember,
      title={EMBER2024 - A Benchmark Dataset for Holistic Evaluation of Malware Classifiers},
      author={Robert J. Joyce and Gideon Miller and Phil Roth and Richard Zak and Elliott Zaresky-Williams and Hyrum Anderson and Edward Raff and James Holt},
      year={2025},
      booktitle={Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining},
}