jiwer evaluate transformers pytesseract opencv-contrib-python numpy torch librosa torchaudio huggingface_hub==0.22.2 sentence-transformers