streamlit transformers sentence-transformers faiss PyMuPDF python-docx beautifulsoup4 requests langdetect