Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ldwang 's Collections
MiscSpaces
MiscAgentic
MiscIndustry
MiscKernel
MiscR1
MiscModels
MiscDatasets
MiscTools

MiscSpaces

updated Nov 6
Upvote
1

  • Running
    587

    Scaling test-time compute

    πŸ“ˆ
    587

    Implement test-time compute scaling for math problems


  • Running
    Featured
    1.23k

    FineWeb: decanting the web for the finest text data at scale

    🍷
    1.23k

    Generate high-quality text data for LLMs using FineWeb


  • Running
    3.6k

    The Ultra-Scale Playbook

    🌌
    3.6k

    The ultimate guide to training LLM on large GPU Clusters


  • Running
    212

    FineVision: Open Data is All You Need

    πŸ“
    212

    A new open-source dataset for training VLMs


  • Running
    19

    Megatron Memory Estimator

    πŸ‘
    19

    Estimate GPU memory usage for Megatron models


  • Running on Zero
    19

    Smol2Operator Demo

    🐒
    19

    Smol2Operator Demo: GUI Agent Model


  • Running on CPU Upgrade
    Featured
    2.68k

    The Smol Training Playbook

    πŸ“š
    2.68k

    The secrets to building world-class LLMs


  • Running
    72

    Unlocking On-Policy Distillation for Any Model Family

    πŸ“
    72

    Apply on-policy distillation to any model family

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs