wb's picture

7 2

wb

whitebill

·

AI & ML interests

None yet

Recent Activity

updated a collection 16 days ago

updated a collection 24 days ago

reacted to anakin87's post with 👍 26 days ago

Haystack can now see 👀 The latest release of the Haystack OSS LLM framework adds a long-requested feature: image support! 📓 Notebooks below This isn't just about passing images to an LLM. We built several features to enable practical multimodal use cases. What's new? 🧠 Support for multiple LLM providers: OpenAI, Amazon Bedrock, Google Gemini, Mistral, NVIDIA, OpenRouter, Ollama and more (support for Hugging Face API coming 🔜) 🎛️ Prompt template language to handle structured inputs, including images 📄 PDF and image converters 🔍 Image embedders using CLIP-like models 🧾 LLM-based extractor to pull text from images 🧩 Components to build multimodal RAG pipelines and Agents I had the chance of leading this effort with @sjrhuschlee (great collab). 📓 Below you can find two notebooks to explore the new features: 󠁯•󠁏󠁏 Introduction to Multimodal Text Generation https://haystack.deepset.ai/cookbook/multimodal_intro 󠁯•󠁏󠁏 Creating Vision+Text RAG Pipelines https://haystack.deepset.ai/tutorials/46_multimodal_rag (🖼️ image by @bilgeyucel )

View all activity

Organizations

whitebill 's models

None public yet