File size: 9,367 Bytes
ab9315a
 
 
 
 
 
 
563a54d
 
ab9315a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
59acf3c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ab9315a
59acf3c
ab9315a
 
 
 
 
 
 
59acf3c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ab9315a
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
---
library_name: popV
license: cc-by-4.0
tags:
- biology
- genomics
- single-cell
- anndata_version:0.11.3
- python_version:3.11.11
- popV
- 'tissue: diverse'
---

Popular Vote (popV) model for automated cell type annotation of single-cell RNA-seq data. We provide here pretrained models
for plug-in use in your own analysis.
Follow our [tutorial](https://github.com/YosefLab/popV/blob/main/tabula_sapiens_tutorial.ipynb) to learn how to use the model for cell type annotation.

# Model description

Tabula Sapiens is a benchmark, first-draft human cell atlas of over 1.1M cells from 28 organs of 24 normal human subjects. This work is the product of the Tabula Sapiens Consortium. Taking the organs from the same individual controls for genetic background, age, environment, and epigenetic effects, and allows detailed analysis and comparison of cell types that are shared between tissues.

**Link to CELLxGENE**:
Link to the [data](https://cellxgene.cziscience.com/e/a68b64d8-aee3-4947-81b7-36b8fe5a44d2.cxg/) in the CELLxGENE browser for interactive exploration of the data and download of the source data.

**Training Code URL**:
Not provided by uploader.

# Metrics

We provide here accuracies for each of the experts and the ensemble model. The validation set accuracies are
computed on a 10% random subset of the data that was not used for training.

| Cell Type | N cells |  celltypist |  knn bbknn |  knn harmony |  knn on scvi |  onclass |  scanvi |  svm |  xgboost | Consensus Prediction |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| fibroblast | 8260 | 0.82 | 0.79 | 0.88 | 0.69 | 0.00 | 0.61 | 0.70 | 0.73 | 0.89 |
| stromal cell of ovary | 3509 | 0.97 | 0.94 | 0.89 | 0.60 | 0.00 | 0.50 | 0.96 | 0.96 | 0.99 |
| mesenchymal stem cell | 2266 | 0.93 | 0.75 | 0.88 | 0.33 | 0.00 | 0.40 | 0.90 | 0.89 | 0.95 |
| smooth muscle cell | 1815 | 0.79 | 0.66 | 0.85 | 0.70 | 0.00 | 0.64 | 0.77 | 0.81 | 0.83 |
| pericyte | 1105 | 0.72 | 0.63 | 0.83 | 0.66 | 0.00 | 0.49 | 0.71 | 0.74 | 0.78 |
| skeletal muscle satellite stem cell | 670 | 0.95 | 0.95 | 0.96 | 0.76 | 0.00 | 0.70 | 0.95 | 0.96 | 0.97 |
| blood vessel smooth muscle cell | 580 | 0.88 | 0.49 | 0.92 | 0.67 | 0.00 | 0.43 | 0.88 | 0.86 | 0.95 |
| regular atrial cardiac myocyte | 621 | 0.80 | 0.43 | 0.74 | 0.30 | 0.00 | 0.10 | 0.90 | 0.94 | 0.92 |
| fibroblast of breast | 541 | 0.97 | 0.00 | 0.71 | 0.14 | 0.00 | 0.10 | 0.90 | 0.80 | 0.93 |
| vascular associated smooth muscle cell | 474 | 0.72 | 0.04 | 0.62 | 0.48 | 0.00 | 0.19 | 0.64 | 0.73 | 0.73 |
| myofibroblast cell | 414 | 0.56 | 0.29 | 0.80 | 0.39 | 0.00 | 0.44 | 0.56 | 0.60 | 0.77 |
| thymic fibroblast type 2 | 401 | 0.69 | 0.00 | 0.43 | 0.07 | 0.00 | 0.23 | 0.61 | 0.65 | 0.54 |
| mesenchymal stem cell of adipose tissue | 359 | 0.35 | 0.00 | 0.64 | 0.27 | 0.00 | 0.22 | 0.34 | 0.38 | 0.50 |
| stromal cell | 289 | 0.79 | 0.02 | 0.49 | 0.30 | 0.00 | 0.37 | 0.75 | 0.80 | 0.94 |
| thymic fibroblast type 1 | 339 | 0.66 | 0.01 | 0.60 | 0.17 | 0.00 | 0.12 | 0.57 | 0.55 | 0.63 |
| ventricular cardiac muscle cell | 245 | 0.45 | 0.11 | 0.61 | 0.38 | 0.00 | 0.47 | 0.79 | 0.86 | 0.80 |
| fibroblast of cardiac tissue | 186 | 0.96 | 0.54 | 0.92 | 0.07 | 0.00 | 0.48 | 0.81 | 0.75 | 0.95 |
| alveolar adventitial fibroblast | 124 | 0.80 | 0.40 | 0.63 | 0.14 | 0.00 | 0.32 | 0.57 | 0.80 | 0.79 |
| adventitial cell | 106 | 0.25 | 0.00 | 0.13 | 0.07 | 0.00 | 0.11 | 0.30 | 0.46 | 0.51 |
| theca cell | 120 | 0.69 | 0.27 | 0.88 | 0.61 | 0.00 | 0.63 | 0.81 | 0.68 | 0.88 |
| keratocyte | 91 | 0.58 | 0.04 | 0.36 | 0.05 | 0.00 | 0.10 | 0.41 | 0.44 | 0.58 |
| myometrial cell | 101 | 0.87 | 0.00 | 0.88 | 0.16 | 0.00 | 0.36 | 0.64 | 0.72 | 0.87 |
| mural cell | 57 | 0.57 | 0.54 | 0.76 | 0.27 | 0.00 | 0.27 | 0.53 | 0.55 | 0.88 |
| Mueller cell | 50 | 0.92 | 0.63 | 0.97 | 0.72 | 0.00 | 0.59 | 0.97 | 0.97 | 0.98 |
| fast muscle cell | 53 | 0.63 | 0.70 | 0.83 | 0.72 | 0.00 | 0.85 | 0.83 | 0.88 | 0.91 |
| pancreatic stellate cell | 38 | 0.43 | 0.00 | 0.52 | 0.16 | 0.00 | 0.00 | 0.80 | 0.81 | 0.73 |
| tendon cell | 40 | 0.28 | 0.14 | 0.52 | 0.11 | 0.00 | 0.11 | 0.31 | 0.39 | 0.64 |
| muscle cell | 36 | 0.78 | 0.00 | 0.69 | 0.63 | 0.00 | 0.73 | 0.92 | 0.96 | 0.96 |
| Schwann cell | 33 | 0.71 | 0.75 | 0.79 | 0.72 | 0.00 | 0.20 | 0.70 | 0.74 | 0.82 |
| melanocyte | 16 | 0.00 | 0.67 | 0.79 | 0.71 | 0.00 | 0.20 | 0.71 | 0.70 | 0.91 |
| hepatic stellate cell | 29 | 0.24 | 0.00 | 0.70 | 0.19 | 0.00 | 0.16 | 0.84 | 0.84 | 0.71 |
| bronchial smooth muscle cell | 12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.12 | 0.20 | 0.28 | 0.29 |
| tongue muscle cell | 11 | 0.00 | 0.43 | 0.67 | 0.13 | 0.00 | 0.10 | 0.59 | 0.33 | 0.71 |
| slow muscle cell | 14 | 0.00 | 0.55 | 0.67 | 0.20 | 0.00 | 0.83 | 0.87 | 0.87 | 0.90 |
| connective tissue cell | 9 | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.02 | 0.33 | 0.22 | 0.40 |
| adipocyte | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| glial cell | 1 | 0.00 | 0.00 | 1.00 | 1.00 | 0.00 | 0.29 | 0.40 | 0.11 | 1.00 |
| Leydig cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| mesenchymal cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |

The train accuracies are computed on the training data.

| Cell Type | N cells |  celltypist |  knn bbknn |  knn harmony |  knn on scvi |  onclass |  scanvi |  svm |  xgboost | Consensus Prediction |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| fibroblast | 74880 | 0.82 | 0.71 | 0.93 | 0.77 | 0.00 | 0.84 | 0.70 | 0.73 | 0.92 |
| stromal cell of ovary | 31494 | 0.97 | 0.45 | 0.91 | 0.73 | 0.00 | 0.96 | 0.96 | 0.96 | 0.99 |
| mesenchymal stem cell | 21230 | 0.93 | 0.61 | 0.94 | 0.51 | 0.00 | 0.93 | 0.90 | 0.90 | 0.97 |
| smooth muscle cell | 16574 | 0.79 | 0.59 | 0.92 | 0.79 | 0.00 | 0.86 | 0.78 | 0.81 | 0.89 |
| pericyte | 9773 | 0.71 | 0.52 | 0.90 | 0.76 | 0.00 | 0.75 | 0.72 | 0.76 | 0.83 |
| skeletal muscle satellite stem cell | 5498 | 0.94 | 0.93 | 0.98 | 0.85 | 0.00 | 0.95 | 0.96 | 0.95 | 0.98 |
| blood vessel smooth muscle cell | 5407 | 0.89 | 0.13 | 0.96 | 0.77 | 0.00 | 0.94 | 0.88 | 0.87 | 0.96 |
| regular atrial cardiac myocyte | 5190 | 0.78 | 0.12 | 0.90 | 0.76 | 0.00 | 0.98 | 0.91 | 0.95 | 0.97 |
| fibroblast of breast | 4729 | 0.98 | 0.00 | 0.76 | 0.38 | 0.00 | 0.95 | 0.90 | 0.78 | 0.99 |
| vascular associated smooth muscle cell | 4544 | 0.71 | 0.01 | 0.79 | 0.63 | 0.00 | 0.74 | 0.66 | 0.72 | 0.82 |
| myofibroblast cell | 4021 | 0.59 | 0.09 | 0.90 | 0.62 | 0.00 | 0.60 | 0.58 | 0.63 | 0.82 |
| thymic fibroblast type 2 | 3802 | 0.69 | 0.00 | 0.67 | 0.30 | 0.00 | 0.88 | 0.63 | 0.67 | 0.89 |
| mesenchymal stem cell of adipose tissue | 3413 | 0.36 | 0.00 | 0.77 | 0.48 | 0.00 | 0.48 | 0.37 | 0.42 | 0.54 |
| stromal cell | 3139 | 0.80 | 0.00 | 0.69 | 0.46 | 0.00 | 0.80 | 0.78 | 0.82 | 0.94 |
| thymic fibroblast type 1 | 2930 | 0.62 | 0.00 | 0.79 | 0.29 | 0.00 | 0.85 | 0.59 | 0.55 | 0.87 |
| ventricular cardiac muscle cell | 2353 | 0.43 | 0.00 | 0.79 | 0.74 | 0.00 | 0.98 | 0.82 | 0.90 | 0.93 |
| fibroblast of cardiac tissue | 1751 | 0.96 | 0.26 | 0.98 | 0.46 | 0.00 | 0.93 | 0.79 | 0.74 | 0.95 |
| alveolar adventitial fibroblast | 987 | 0.82 | 0.25 | 0.74 | 0.33 | 0.00 | 0.81 | 0.60 | 0.85 | 0.94 |
| adventitial cell | 938 | 0.25 | 0.00 | 0.63 | 0.28 | 0.00 | 0.45 | 0.31 | 0.45 | 0.68 |
| theca cell | 918 | 0.65 | 0.04 | 0.92 | 0.78 | 0.00 | 0.64 | 0.76 | 0.64 | 0.81 |
| keratocyte | 883 | 0.59 | 0.01 | 0.54 | 0.16 | 0.00 | 0.46 | 0.41 | 0.47 | 0.73 |
| myometrial cell | 816 | 0.82 | 0.00 | 0.87 | 0.62 | 0.00 | 0.64 | 0.59 | 0.72 | 0.87 |
| mural cell | 557 | 0.58 | 0.42 | 0.83 | 0.43 | 0.00 | 0.60 | 0.59 | 0.60 | 0.93 |
| Mueller cell | 445 | 0.89 | 0.30 | 0.96 | 0.87 | 0.00 | 0.85 | 0.99 | 0.96 | 0.98 |
| fast muscle cell | 442 | 0.58 | 0.63 | 0.89 | 0.82 | 0.00 | 0.89 | 0.85 | 0.92 | 0.95 |
| pancreatic stellate cell | 386 | 0.39 | 0.00 | 0.83 | 0.30 | 0.00 | 0.78 | 0.80 | 0.79 | 0.81 |
| tendon cell | 380 | 0.30 | 0.02 | 0.80 | 0.43 | 0.00 | 0.31 | 0.35 | 0.40 | 0.71 |
| muscle cell | 370 | 0.77 | 0.01 | 0.89 | 0.90 | 0.00 | 0.92 | 0.98 | 0.99 | 0.99 |
| Schwann cell | 279 | 0.61 | 0.42 | 0.88 | 0.79 | 0.00 | 0.32 | 0.70 | 0.82 | 0.86 |
| melanocyte | 215 | 0.00 | 0.52 | 0.93 | 0.71 | 0.00 | 0.45 | 0.76 | 0.82 | 0.96 |
| hepatic stellate cell | 193 | 0.26 | 0.00 | 0.79 | 0.45 | 0.00 | 0.67 | 0.80 | 0.74 | 0.84 |
| bronchial smooth muscle cell | 207 | 0.00 | 0.00 | 0.59 | 0.34 | 0.00 | 0.47 | 0.47 | 0.56 | 0.76 |
| tongue muscle cell | 187 | 0.00 | 0.05 | 0.87 | 0.37 | 0.00 | 0.62 | 0.87 | 0.51 | 0.94 |
| slow muscle cell | 160 | 0.00 | 0.50 | 0.82 | 0.57 | 0.00 | 0.87 | 0.93 | 0.95 | 0.96 |
| connective tissue cell | 142 | 0.00 | 0.00 | 0.46 | 0.23 | 0.00 | 0.31 | 0.54 | 0.36 | 0.77 |
| adipocyte | 33 | 0.00 | 0.00 | 0.96 | 0.86 | 0.00 | 0.37 | 0.93 | 0.86 | 0.97 |
| glial cell | 19 | 0.00 | 0.00 | 0.79 | 0.50 | 0.00 | 0.41 | 0.41 | 0.18 | 0.61 |
| Leydig cell | 9 | 0.00 | 0.00 | 0.00 | 0.13 | 0.00 | 0.12 | 0.86 | 0.20 | 0.46 |
| mesenchymal cell | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.21 | 0.57 | 0.67 |

</details>


# References

Tabula Sapiens reveals transcription factor expression, senescence effects, and sex-specific features in cell types from 28 human organs and tissues, The Tabula Sapiens Consortium; bioRxiv, doi: https://doi.org/10.1101/2024.12.03.626516