filtering the samples

#4
by ehartford - opened

how did you choose the 10k samples?

We sampled uniformly across each data source category. Data sour categories are detailed here - https://huggingface.co/datasets/teknium/OpenHermes-2.5?row=5.

aravindputrevu changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment