YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Model Card for h10505jd-a63140nd-ED-Opt-B

This is a sequence relation classification model that was trained to detect whether a given piece of evidence is relevant to a given claim.

Model Details

Model Description

This model addresses the Evidence Detection (ED) shared task: given a claim and a piece of evidence, determine if the evidence is relevant to that claim (binary classification). This model has a Bert preprocessor and encoder, that has not been fine-tuned, that feed into a multi layered BLSTM model with self-attention mechanism that was fine-tuned on 21K pairs of texts. The input sequences are concatenated to form a larger input sequence, with each sequence preceded by "CLAIM:" and "EVIDENCE:" respectively.

Developed by: James Deslandes and Nikolaos Douranos
Language(s): English
Model type: Supervised
Model architecture: BLSTM

Model Resources

Preprocessor: "https://kaggle.com/models/tensorflow/bert/TensorFlow2/en-uncased-preprocess/3"
Encoder Model: https://www.kaggle.com/models/tensorflow/bert/TensorFlow2/en-uncased-l-12-h-768-a-12/4
Repo: https://huggingface.co/Jed612/encoder-BLSTM

Training Details

Training Data

This model was trained on 21K claim-evidence pairs.

Training Procedure

Training Hyperparameters

  - batch_size: 32
  - epochs: 4
  - learning_rate: 1e-4

Speeds, Sizes, Times

  - overall training time: 16 minutes
  - duration per training epoch: 4 minutes
  - model size: 500MB

Evaluation

Testing Data & Metrics

Testing Data

A seperate validation dataset of 6K claim-evidence pairs.

Metrics

  - ROC AUC
  - Specificity
  - Precision
  - Recall
  - F1-score
  - Accuracy
  - average accuracy over 4 models

Results

The model obtained an ROC AUC of 0.91, a specificity of 92.8%, a precision of 78.1% a recall of 66.6%, an F1-score of 71.9% and an accuracy of 85.6%. Four different models with this structure were trained and their accuracies averaged to 85.4%. The error bars show twice the standard deviation, either side of the mean.

Training and Validation Accuracy and Loss Mean:

Technical Specifications

Hardware

  - RAM: at least 4 GB
  - Storage: at least 50 GB,
  - GPU: T4

Software

  - Tensorflow
  - Tensorflow_hub
  - Keras 2

Bias, Risks, and Limitations

Any inputs (concatenation of two sequences) longer than 512 subwords will be truncated by the model.

Additional Information

The hyperparameters were determined by experimentation with different values.

Downloads last month: 5

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support