High quality pretraining and instruction datasets for law, mathematics, and science.
Casey
casey-martin
AI & ML interests
Biomedical Tool Usage
Graph Learning
Ecophysiology
Recent Activity
liked
a dataset
about 21 hours ago
SmallDoge/SmallThoughts
liked
a dataset
about 21 hours ago
mlfoundations-dev/SCP_40k-claude-3-7-sonnet-16k
liked
a dataset
1 day ago
open-r1/ioi-cots
Organizations
Collections
1
models
None public yet
datasets
10
casey-martin/Seal-Tools
Viewer
•
Updated
•
14.1k
•
156
casey-martin/GeneGPT
Preview
•
Updated
•
98
casey-martin/math_notebooks
Viewer
•
Updated
•
18.1k
•
88
casey-martin/CommonLit-Ease-of-Readability
Viewer
•
Updated
•
4.72k
•
89
•
1
casey-martin/multilingual-mathematical-autoformalization
Viewer
•
Updated
•
666k
•
368
•
2
casey-martin/MedInstruct
Preview
•
Updated
•
58
•
7
casey-martin/qald_9_plus
Viewer
•
Updated
•
15.8k
•
239
•
1
casey-martin/vquanda
Viewer
•
Updated
•
5k
•
85
•
3
casey-martin/protocols_io
Updated
•
37
casey-martin/oa_cpp_annotate_gen
Viewer
•
Updated
•
104k
•
82
•
2