ChunTe Lee

Chunte

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

Hugging Face's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture

Chunte's activity

upvoted an article about 1 hour ago
view article
Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By not-lain โ€ข
โ€ข 19
reacted to singhsidhukuldeep's post with โž• 2 days ago
view post
Post
2938
Exciting breakthrough in Retrieval-Augmented Generation (RAG): Introducing MiniRAG - a revolutionary approach that makes RAG systems accessible for edge devices and resource-constrained environments.

Key innovations that set MiniRAG apart:

Semantic-aware Heterogeneous Graph Indexing
- Combines text chunks and named entities in a unified structure
- Reduces reliance on complex semantic understanding
- Creates rich semantic networks for precise information retrieval

Lightweight Topology-Enhanced Retrieval
- Leverages graph structures for efficient knowledge discovery
- Uses pattern matching and localized text processing
- Implements query-guided reasoning path discovery

Impressive Performance Metrics
- Achieves comparable results to LLM-based methods while using Small Language Models (SLMs)
- Requires only 25% of storage space compared to existing solutions
- Maintains robust performance with accuracy reduction ranging from just 0.8% to 20%

The researchers from Hong Kong University have also contributed a comprehensive benchmark dataset specifically designed for evaluating lightweight RAG systems under realistic on-device scenarios.

This breakthrough opens new possibilities for:
- Edge device AI applications
- Privacy-sensitive implementations
- Real-time processing systems
- Resource-constrained environments

The full implementation and datasets are available on GitHub: HKUDS/MiniRAG
  • 1 reply
ยท
reacted to sayakpaul's post with ๐Ÿš€๐Ÿค— 2 days ago
view post
Post
1784
We have authored a post to go over the state of video generation in the Diffusers ecosystem ๐Ÿงจ

We cover the models supported, the knobs of optims our users can fire, fine-tuning, and more ๐Ÿ”ฅ

5-6GBs for HunyuanVideo, sky is the limit ๐ŸŒŒ ๐Ÿค—
https://huggingface.co/blog/video_gen
upvoted 5 articles 2 days ago
view article
Article

Run ComfyUI workflows for free on Spaces

โ€ข 47
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

โ€ข 460
view article
Article

SmolVLM Grows Smaller โ€“ Introducing the 250M & 500M Models!

โ€ข 95
view article
Article

State of open video generation models in Diffusers

โ€ข 23
view article
Article

Welcome to Inference Providers on the Hub ๐Ÿ”ฅ

โ€ข 171
updated a model 6 days ago
reacted to singhsidhukuldeep's post with ๐Ÿ˜”๐Ÿค๐Ÿ‘๐Ÿค๐Ÿง ๐Ÿ˜Ž๐Ÿค—โค๏ธ 8 days ago
view post
Post
2938
Exciting breakthrough in Retrieval-Augmented Generation (RAG): Introducing MiniRAG - a revolutionary approach that makes RAG systems accessible for edge devices and resource-constrained environments.

Key innovations that set MiniRAG apart:

Semantic-aware Heterogeneous Graph Indexing
- Combines text chunks and named entities in a unified structure
- Reduces reliance on complex semantic understanding
- Creates rich semantic networks for precise information retrieval

Lightweight Topology-Enhanced Retrieval
- Leverages graph structures for efficient knowledge discovery
- Uses pattern matching and localized text processing
- Implements query-guided reasoning path discovery

Impressive Performance Metrics
- Achieves comparable results to LLM-based methods while using Small Language Models (SLMs)
- Requires only 25% of storage space compared to existing solutions
- Maintains robust performance with accuracy reduction ranging from just 0.8% to 20%

The researchers from Hong Kong University have also contributed a comprehensive benchmark dataset specifically designed for evaluating lightweight RAG systems under realistic on-device scenarios.

This breakthrough opens new possibilities for:
- Edge device AI applications
- Privacy-sensitive implementations
- Real-time processing systems
- Resource-constrained environments

The full implementation and datasets are available on GitHub: HKUDS/MiniRAG
  • 1 reply
ยท