sentence_transformers pypdf chromadb langchain langchain-openai langchain_community langchain_chroma arxiv pymupdf openai yt_dlp reportlab gradio bs4