zeroMN
/

auto

Question Answering

Inference Endpoints

Model card Files Files and versions Community

zeroMN commited on Jan 1

Commit

1572e58

·

verified ·

1 Parent(s): 658ed05

Update README.md

Files changed (1) hide show

README.md +70 -33

README.md CHANGED Viewed

@@ -1,40 +1,77 @@
----
-language:
-- en
-- zh
-license: apache-2.0
-library_name: pytorch
-tags:
-- multimodal
-- vqa
-- text
-- audio
 datasets:
-- synthetic-dataset
 metrics:
-- accuracy
-- bleu
-- wer
-model-index:
-- name: AutoModel
-  results:
-  - task:
-      type: vqa
-      name: Visual Question Answering
-    dataset:
-      type: synthetic-dataset
-      name: Synthetic Multimodal Dataset
-      split: test
-    metrics:
-    - type: accuracy
-      value: 85
----
-# Model Card for AutoModel
-AutoModel 是一个多模态模型，支持图像、文本和语音输入...
----
 ### **3. 提供可下载文件**
 确保以下文件已上传到仓库，便于用户下载和运行：

+## 模型卡
+---------------------------------------------------------------------
+metadata:
+  language: multilingual # AutoModel 是一个支持多语言处理的多模态模型
+  license:
+    - apache-2.0
+    - MIT # Apache 2.0 和 MIT 是开源许可
+  library_name: pytorch  # 该模型基于 PyTorch 构建
+  tags:
+    - multimodal  # 该模型是多模态模型
+    - image  # 处理图像任务
+    - text  # 处理文本任务
+    - audio  # 处理语音任务
+    - vqa  # 支持视觉问答任务
+    - automatspeerecognition  # 支持自动语音识别任务
+    - retrieval  # 支持信息检索任务
 datasets:
+  - synthetdataset  # 训练和验证使用了合成的多模态数据集
 metrics:
+  - accuracy  # 视觉问答任务的准确率
+  - bleu  # 生成式任务（如字幕生成）的 BLEU 指标
+  - wer  # 语音识别任务的 WER（Word Error Rate）
+base_model: None  # 该模型为独立设计，没有基于预训练模型
+widget:
+  - text: "A cat playing with a ball"
+    example_title: "Cat"
+  - text: "A dog jumping over a fence"
+    example_title: "Dog"
+model_index:
+  - name: AutoModel
+    results:
+      - task:
+          type: vqa  # 支持视觉问答任务
+          name: Visual Question Answering
+        dataset:
+          type: synthetdataset
+          name: Synthetic Multimodal Dataset
+          config: default
+          split: test
+          revision: main
+        metrics:
+          - type: accuracy
+            value: 85.0
+            name: VQA Accuracy
+      - task:
+          type: automatspeerecognition
+          name: Automatic Speech Recognition
+        dataset:
+          type: synthetdataset
+          name: Synthetic Multimodal Dataset
+          config: default
+          split: test
+          revision: main
+        metrics:
+          - type: wer
+            value: 15.3
+            name: Test WER
+      - task:
+          type: captioning
+          name: Image Captioning
+        dataset:
+          type: synthetdataset
+          name: Synthetic Multimodal Dataset
+          config: default
+          split: test
+          revision: main
+        metrics:
+          - type: bleu
+            value: 27.5
+            name: BL4
+-----------------------------------------------------------
 ### **3. 提供可下载文件**
 确保以下文件已上传到仓库，便于用户下载和运行：