view article Article Building an African Cultural Dataset with SmoLAgents: Experimental By Svngoku ⢠Feb 7 ⢠2
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other ⢠Oct 14, 2024 ⢠77
IrokoBench Collection a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM ⢠6 items ⢠Updated May 31, 2024 ⢠18
Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper ⢠2403.13257 ⢠Published Mar 20, 2024 ⢠20
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. ⢠34 items ⢠Updated 4 days ago ⢠28
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Paper ⢠2401.08417 ⢠Published Jan 16, 2024 ⢠35
Open LLM Leaderboard best models â¤ď¸âđĽ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: ⢠65 items ⢠Updated 24 minutes ago ⢠555
Trained Models đď¸ Collection They may be small, but they're training like giants! ⢠8 items ⢠Updated Dec 3, 2024 ⢠19
EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation Paper ⢠2310.08185 ⢠Published Oct 12, 2023 ⢠8
TinyGSM: achieving >80% on GSM8k with small language models Paper ⢠2312.09241 ⢠Published Dec 14, 2023 ⢠39
ChatGPT-Mini Collection A collection of fine-tuned GPT-2 models each designed to deploy a ChatGPT-like model at home. These models can also be deployed on an old computer. ⢠8 items ⢠Updated Nov 16, 2023 ⢠5
smol llama Collection đ§"raw" pretrained smol_llama checkpoints - WIP đ§ ⢠4 items ⢠Updated Apr 29, 2024 ⢠6
Indic language fine-tunes Collection Halted State: Attempting to create acceptable quality fine-tunes of different models ⢠1 item ⢠Updated Nov 23, 2023 ⢠1
PIC (Partner-in-Crime) project Collection Empathetic, small, really useful personalised models. ⢠3 items ⢠Updated Dec 10, 2023 ⢠2
Cramp(ed) Models Collection Smaller models trained locally on my 2xA6000 Lambda Vector ⢠3 items ⢠Updated Oct 10, 2023 ⢠1