---
library_name: transformers
language:
- de
license: apache-2.0
tags:
- text-classification
- political-communication
- social-science
- synthetic-data
model_name: gbert-CTA-w-synth
base_model: deepset/gbert-large
pipeline_tag: text-classification
widget:
- text: "Heute Abend live dabei sein! Unser Wahlkampf-Event beginnt um 19 Uhr. Sei dabei!"
- text: "Mehr Informationen auf unserer Website."
- text: "Erfahre mehr über unser Wahlprogramm!"
results:
      - task:
          type: text-classification
          name: Call to Action Detection
        dataset:
          name: German Instagram Political Content 2021
          type: custom
        metrics:
          - name: Macro F1 Score
            type: f1
            value: 0.93
          - name: Binary F1 Score
            type: f1
            value: 0.89
          - name: Precision
            type: precision
            value: 0.98
          - name: Recall
            type: recall
            value: 0.81

---

# gbert-CTA-w-synth

gbert-CTA-w-synth is a fine-tuned version of the [German BERT model (GBERT)](https://huggingface.co/deepset/gbert-large) designed to detect Calls to Action (CTAs) in political Instagram content. It was developed to analyze political mobilization strategies during the 2021 German Federal Election, focusing on Instagram stories and posts.

This model is trained on real-world and synthetic data to mitigate class imbalances and improve performance. It specializes in detecting explicit and implicit CTAs in multimodal content, including captions, Optical Character Recognition (OCR) text from images, and video transcriptions.

## Model Description

- **Base Model**: `deepset/gbert-large`
- **Fine-tuned on**: German Instagram content, including captions, OCR text, and transcriptions
- **Synthetic Data**: [Augmented with synthetic training data generated using OpenAI’s GPT-4 to address class imbalance.](https://huggingface.co/datasets/chaichy/CTA-synthetic-dataset/)
- **Tasks**: Binary classification of CTA presence or absence in Instagram posts and stories.

For video transcriptions, we used [bofenghuang/whisper-large-v2-cv11-german](https://huggingface.co/bofenghuang/whisper-large-v2-cv11-german), a fine-tuned version of OpenAI's Whisper model adapted for the German language.

## Performance

The model was evaluated against human-annotated ground truth labels to ensure classification quality. We performed an evaluation using five-fold cross-validation to validate the model’s generalizability. The model was benchmarked with the following metrics:

- **Macro F1 score**: 0.93  
- **Binary F1 score**: 0.89  
- **Precision**: 0.98  
- **Recall**: 0.81  

The evaluation was based on a dataset containing 1,388 documents annotated by nine contributors. Disagreements were resolved using majority decisions.

## Usage

This model is intended for computational social science and political communication research, specifically for studying how political actors mobilize audiences on social media. It is effective for detecting Calls to Action in German-language social media content.

### How to Use

You can use this model with the `transformers` library in Python:

```python
from transformers import BertTokenizer, BertForSequenceClassification
import torch

# Load model and tokenizer
tokenizer = BertTokenizer.from_pretrained('chaichy/gbert-CTA-w-synth')
model = BertForSequenceClassification.from_pretrained('chaichy/gbert-CTA-w-synth')

# Tokenize input
inputs = tokenizer("Input text here", return_tensors="pt")

# Get classification results
outputs = model(**inputs)
logits = outputs.logits
predicted_class = torch.argmax(logits, dim=1)

# 0 for absence, 1 for presence of CTA
print(f"Predicted class: {predicted_class.item()}")
```

### Data

The model was trained on Instagram content collected during the 2021 German Federal Election campaign. This included:

- **Captions**: Text accompanying images or videos in posts.
- **OCR text**: Optical Character Recognition (OCR) extracted text from images.
- **Transcriptions**: Text extracted from video audio, using [bofenghuang/whisper-large-v2-cv11-german](https://huggingface.co/bofenghuang/whisper-large-v2-cv11-german).

The dataset contains both explicit and implicit CTAs, which are binary labeled (True/False). We generated [synthetic training data based on the original human-annotated dataset](https://huggingface.co/datasets/chaichy/CTA-synthetic-dataset) to handle class imbalance. The synthetic dataset was created using OpenAI’s GPT-4o, which mimicked real-world CTAs by generating new examples in a consistent political communication style.

## Ethical Considerations

The training data was collected from publicly available Instagram posts and stories shared by verified political accounts during the 2021 German Federal Election. No personal or sensitive data was included.

## Citation

If you use this model, please cite the following:

```
@misc{achmanndenkler2024detectingcallsactionmultimodal,
      title={Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram}, 
      author={Michael Achmann-Denkler and Jakob Fehle and Mario Haim and Christian Wolff},
      year={2024},
      eprint={2409.02690},
      archivePrefix={arXiv},
      primaryClass={cs.SI},
      url={https://arxiv.org/abs/2409.02690}, 
}
```