Running 1.06k 1.06k FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
jinaai/jina-embeddings-v4-vllm-retrieval Visual Document Retrieval • 4B • Updated 16 days ago • 70.4k • 21