Update README.md
Browse files
README.md
CHANGED
@@ -45,25 +45,25 @@ If you find our work useful, please give us credit by citing:
|
|
45 |
|
46 |
| Task | Type | Dataset | Non-LLM SoTA<sup>1</sup> | GPT-3.5<sup>2</sup> | GPT-4<sup>2</sup> | Jellyfish-13B| Jellyfish-7B | Jellyfish-8B |
|
47 |
| ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |
|
48 |
-
| Entity Matching | Seen | Fodors-Zagats | 100 | 100 | 100 | 100 | 100 |
|
49 |
-
| Entity Matching | Seen | Beer | 94.37| 96.30 | 100 | 96.77 | 96.55|
|
50 |
-
| Entity Matching | Seen | iTunes-Amazon | 97.06| 96.43 | 100 | 98.11 | 96.30|
|
51 |
-
| Entity Matching | Seen | DBLP-ACM | 98.99| 96.99 | 97.44 | 98.98 | 98.88|
|
52 |
-
| Entity Matching | Seen | DBLP-GoogleScholar | 95.60| 76.12 | 91.87 | 98.51 | 95.15|
|
53 |
-
| Entity Matching | Seen | Amazon-Google | 75.58| 66.53 | 74.21 | 81.34 | 80.83 |
|
54 |
-
| Entity Matching | Unseen | Walmart-Amazon | 86.76| 86.17 | 90.27 | 89.42 | 85.64 |
|
55 |
-
| Entity Matching | Unseen | Abt-Buy | 89.33 | -- | 92.77 | 89.58 | 82.38 |
|
56 |
-
| Data Imputation | Seen | Restaurant | 77.20| 94.19 | 97.67 | 94.19 | 88.37 |
|
57 |
-
| Data Imputation | Seen | Buy | 96.50| 98.46 | 100 | 100 | 96.62 |
|
58 |
-
| Data Imputation | Unseen | Filpkart | 68.00 | -- | 89.94 | 81.68 | 79.44|
|
59 |
-
| Data Imputation | Unseen | Phone | 86.70| -- | 90.79 | 87.21 | 85.00|
|
60 |
-
| Error Detection | Seen | Hosptial | 94.40| 90.74 | 90.74 | 95.59 | 96.27 |
|
61 |
-
| Error Detection | Seen | Adult | 99.10| 92.01 | 92.01 | 99.33 | 91.96 |
|
62 |
-
| Error Detection | Unseen | Flights | 81.00 | -- | 83.48 | 82.52 | 66.92 |
|
63 |
-
| Error Detection | Unseen | Rayyan | 79.00| -- | 81.95 | 90.65 | 69.82 |
|
64 |
-
| Schema Matching | Seen | Sythea | 38.50| 57.14 | 66.67 | 36.36 | 44.44 |
|
65 |
-
| Schema Matching | Seen | MIMIC | 20.00| -- | 40.00 | 40.00 | 40.00 |
|
66 |
-
| Schema Matching | Unseen | CMS | 50.00| -- | 19.35 | 59.29 | 13.79 |
|
67 |
|
68 |
_For GPT-3.5 and GPT-4, we used the few-shot approach on all datasets. However, for Jellyfish-13B and Jellyfish-Interpreter, the few-shot approach is disabled on seen datasets and enabled on unseen datasets._
|
69 |
_Accuracy as the metric for data imputation and the F1 score for other tasks._
|
|
|
45 |
|
46 |
| Task | Type | Dataset | Non-LLM SoTA<sup>1</sup> | GPT-3.5<sup>2</sup> | GPT-4<sup>2</sup> | Jellyfish-13B| Jellyfish-7B | Jellyfish-8B |
|
47 |
| ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |
|
48 |
+
| Entity Matching | Seen | Fodors-Zagats | 100 | 100 | 100 | 100 | 100 | 92.68 |
|
49 |
+
| Entity Matching | Seen | Beer | 94.37| 96.30 | 100 | 96.77 | 96.55| 96.30 |
|
50 |
+
| Entity Matching | Seen | iTunes-Amazon | 97.06| 96.43 | 100 | 98.11 | 96.30| 92.00 |
|
51 |
+
| Entity Matching | Seen | DBLP-ACM | 98.99| 96.99 | 97.44 | 98.98 | 98.88| 98.76 |
|
52 |
+
| Entity Matching | Seen | DBLP-GoogleScholar | 95.60| 76.12 | 91.87 | 98.51 | 95.15| 93.20 |
|
53 |
+
| Entity Matching | Seen | Amazon-Google | 75.58| 66.53 | 74.21 | 81.34 | 80.83 | 74.49 |
|
54 |
+
| Entity Matching | Unseen | Walmart-Amazon | 86.76| 86.17 | 90.27 | 89.42 | 85.64 | 89.97 |
|
55 |
+
| Entity Matching | Unseen | Abt-Buy | 89.33 | -- | 92.77 | 89.58 | 82.38 | 92.54 |
|
56 |
+
| Data Imputation | Seen | Restaurant | 77.20| 94.19 | 97.67 | 94.19 | 88.37 | 87.21 |
|
57 |
+
| Data Imputation | Seen | Buy | 96.50| 98.46 | 100 | 100 | 96.62 | 92.31 |
|
58 |
+
| Data Imputation | Unseen | Filpkart | 68.00 | -- | 89.94 | 81.68 | 79.44| 90.17 |
|
59 |
+
| Data Imputation | Unseen | Phone | 86.70| -- | 90.79 | 87.21 | 85.00| 83.92 |
|
60 |
+
| Error Detection | Seen | Hosptial | 94.40| 90.74 | 90.74 | 95.59 | 96.27 | 80.72|
|
61 |
+
| Error Detection | Seen | Adult | 99.10| 92.01 | 92.01 | 99.33 | 91.96 | 81.72|
|
62 |
+
| Error Detection | Unseen | Flights | 81.00 | -- | 83.48 | 82.52 | 66.92 | 75.18 |
|
63 |
+
| Error Detection | Unseen | Rayyan | 79.00| -- | 81.95 | 90.65 | 69.82 | 91.54 |
|
64 |
+
| Schema Matching | Seen | Sythea | 38.50| 57.14 | 66.67 | 36.36 | 44.44 | 27.27 |
|
65 |
+
| Schema Matching | Seen | MIMIC | 20.00| -- | 40.00 | 40.00 | 40.00 | 34.04|
|
66 |
+
| Schema Matching | Unseen | CMS | 50.00| -- | 19.35 | 59.29 | 13.79 | 56.72|
|
67 |
|
68 |
_For GPT-3.5 and GPT-4, we used the few-shot approach on all datasets. However, for Jellyfish-13B and Jellyfish-Interpreter, the few-shot approach is disabled on seen datasets and enabled on unseen datasets._
|
69 |
_Accuracy as the metric for data imputation and the F1 score for other tasks._
|