data-papers Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14 • 55