10 37 148

PZ PRO

philipp-zettl

philipp-zettl

AI & ML interests

NLP/CV/Multimodal learning

Recent Activity

updated a model about 19 hours ago

philipp-zettl/T5-small-tinyqa

published a model about 19 hours ago

philipp-zettl/T5-small-tinyqa

liked a Space about 24 hours ago

elismasilva/mixture-of-diffusers-sdxl-tiling

View all activity

Organizations

philipp-zettl's activity

updated a model about 19 hours ago

philipp-zettl/T5-small-tinyqa

Text2Text Generation • Updated about 19 hours ago

published a model about 19 hours ago

philipp-zettl/T5-small-tinyqa

Text2Text Generation • Updated about 19 hours ago

liked a Space about 24 hours ago

Mixture Of Diffusers SDXL Tiling

🚀

Mixture of Diffusers implementation for XL Stable Diffusion

updated a model about 24 hours ago

philipp-zettl/chessPT

Text2Text Generation • Updated about 24 hours ago • 5

liked a model 1 day ago

nvidia/QLIP-L-14-392

Updated 2 days ago • 17 • 2

liked a Space 1 day ago

1.88k

QR Code AI Art Generator

📱

QR Code AI Art Generator Blend QR codes with AI Art

New activity in philipp-zettl/chessPT 1 day ago

Training Date Size

#3 opened 2 days ago by

nh185285

published a dataset 1 day ago

philipp-zettl/chessPT-data

Viewer • Updated Oct 8, 2024 • 6.88M

updated a collection 1 day ago

not closed TTS

Collection

13 items • Updated 1 day ago

upvoted a collection 1 day ago

Express 🚅

Collection

Express Tiny LLM's • 6 items • Updated 19 days ago • 8

liked a model 1 day ago

prithivMLmods/FastThink-0.5B-Tiny

Text Generation • Updated 12 days ago • 462 • 10

updated a collection 1 day ago

OCR

Collection

7 items • Updated 1 day ago

liked a model 1 day ago

prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Image-Text-to-Text • Updated Jan 11 • 6.8k • 46

reacted to schuler's post with 🔥 2 days ago

Post

6547

📢 New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

🔑 Key Findings:
• 77% parameter reduction.
• Maintained model capabilities.
• Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm

1 reply

New activity in philipp-zettl/chessPT 4 days ago

Any results?

#2 opened 6 days ago by

AlvaroMros

upvoted a paper 6 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

liked a Space 6 days ago

HuggingDiscussions

🏢

Join discussions on Hugging Face Hub

reacted to hexgrad's post with 🔥 12 days ago

Post

8281

hexgrad/Kokoro-82M got an upgrade! ⬆️ More voices, more languages, pip install kokoro, and still 82M parameters.

GitHub: https://github.com/hexgrad/kokoro
PyPI: https://pypi.org/project/kokoro/
Space: hexgrad/Kokoro-TTS

11 replies

upvoted an article 16 days ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

and 5 others •

Dec 23, 2024

• 18

reacted to mitkox's post with 🚀 16 days ago

Post

2265

llama.cpp is 26.8% faster than ollama.
I have upgraded both, and using the same settings, I am running the same DeepSeek R1 Distill 1.5B on the same hardware. It's an Apples to Apples comparison.

Total duration:
llama.cpp 6.85 sec <- 26.8% faster
ollama 8.69 sec

Breakdown by phase:
Model loading
llama.cpp 241 ms <- 2x faster
ollama 553 ms

Prompt processing
llama.cpp 416.04 tokens/s with an eval time 45.67 ms <- 10x faster
ollama 42.17 tokens/s with an eval time of 498 ms

Token generation
llama.cpp 137.79 tokens/s with an eval time 6.62 sec <- 13% faster
ollama 122.07 tokens/s with an eval time 7.64 sec

llama.cpp is LLM inference in C/C++; ollama adds abstraction layers and marketing.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.

7 replies