AI & ML interests

A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs.

Recent Activity

ai-blueprint's activity

davidberenstein1957 
posted an update 1 day ago
davidberenstein1957 
posted an update 2 days ago
davidberenstein1957 
posted an update 3 days ago
davidberenstein1957 
posted an update 8 days ago
view post
Post
1533
tldr; Parquet is awesome, DuckDB too!

Datasets on the Hugging Face Hub rely on parquet files. We can interact with these files using DuckDB as a fast in-memory database system. One of DuckDB’s features is vector similarity search which can be used with or without an index.

blog:
https://huggingface.co/learn/cookbook/vector_search_with_hub_as_backend
davidberenstein1957 
posted an update 11 days ago
davidberenstein1957 
posted an update 17 days ago
davidberenstein1957 
posted an update 21 days ago