itdainb
/

PhoRanker

@@ -26,12 +26,19 @@ model-index:
       name: MRR@10
       verified: false
 widget:
-- source_sentence: "UIT là gì ?"
-  sentences:
-    - "UIT là Trường Đại_học Công_nghệ Thông_tin ( ĐH CNTT ) , Đại_học Quốc_gia Thành_phố Hồ_Chí_Minh ( ĐHQG - HCM )"
-    - "Mô_hình rerank — còn được gọi là cross - encoder — là một loại mô_hình mà , khi được cung_cấp một cặp truy vấn và tài_liệu , sẽ đưa ra một điểm tương_đồng ."
-    - "Việt_Nam , quốc_hiệu là Cộng_hòa xã_hội chủ_nghĩa Việt_Nam , là một quốc_gia xã_hội chủ_nghĩa nằm ở cực Đông của bán_đảo Đông_Dương thuộc khu_vực Đông_Nam_Á"
-pipeline_tag: sentence-similarity
 ---
 ## Installation
@@ -77,19 +84,16 @@ scores = model.predict(tokenized_pairs)
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
-import torch
 model = AutoModelForSequenceClassification.from_pretrained('itdainb/vietnamese-cross-encoder')
 tokenizer = AutoTokenizer.from_pretrained('itdainb/vietnamese-cross-encoder')
-activation_fct = torch.nn.Identity()
 features = tokenizer(tokenized_pairs, padding=True, truncation="longest_first", return_tensors="pt", max_length=256)
 model.eval()
 with torch.no_grad():
     model_predictions = model(**features, return_dict=True)
-    logits = activation_fct(model_predictions.logits)
     scores = [score[0] for score in logits]
     print(scores)

       name: MRR@10
       verified: false
 widget:
+- text: "UIT là gì ?. Trường Đại_học Công_nghệ Thông_tin có tên tiếng Anh là University of Information_Technology ( viết tắt là UIT ) là thành_viên của Đại_học Quốc_Gia TP. HCM."
+  output:
+    - label: "Top 1"
+      score: 4.0033
+- text: "UIT là gì ?. Trường Đại_học Kinh_tế – Luật ( tiếng Anh : University of Economics and Law – UEL ) là trường đại_học đào_tạo và nghiên_cứu khối ngành kinh_tế , kinh_doanh và luật hàng_đầu Việt_Nam ."
+  output:
+    - label: "Top 3"
+      score: -1.1160
+- text: "UIT là gì ?. Quĩ_uỷ_thác đầu_tư ( tiếng Anh : Unit Investment_Trusts ; viết tắt : UIT ) là một công_ty đầu_tư mua hoặc nắm giữ một danh_mục đầu_tư cố_định"
+  output:
+    - label: "Top 2"
+      score: 2.5138
+pipeline_tag: text-classification
 ---
 ## Installation
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 model = AutoModelForSequenceClassification.from_pretrained('itdainb/vietnamese-cross-encoder')
 tokenizer = AutoTokenizer.from_pretrained('itdainb/vietnamese-cross-encoder')
 features = tokenizer(tokenized_pairs, padding=True, truncation="longest_first", return_tensors="pt", max_length=256)
 model.eval()
 with torch.no_grad():
     model_predictions = model(**features, return_dict=True)
+    logits = model_predictions.logits
     scores = [score[0] for score in logits]
     print(scores)