MARTINI_enrich_BERTopic_SETexasProudBoys

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_SETexasProudBoys")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 19
  • Number of training documents: 2381
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 america - antifa - biden - fuck - everyone 20 -1_america_antifa_biden_fuck
0 fucking - link - banned - everyone - poyb 1604 0_fucking_link_banned_everyone
1 texas - illegals - mexico - mcallen - border 110 1_texas_illegals_mexico_mcallen
2 transgenderism - sodomites - lgbtqia - brainwashing - controversial 82 2_transgenderism_sodomites_lgbtqia_brainwashing
3 whites - genocide - multiracialism - jonestown - shootings 66 3_whites_genocide_multiracialism_jonestown
4 jury - mistrial - prosecutors - rioters - kenosha 64 4_jury_mistrial_prosecutors_rioters
5 hyperinflation - fed - yellen - btc - banking 57 5_hyperinflation_fed_yellen_btc
6 vaccinated - pfizered - injecting - hahaha - soyjack 56 6_vaccinated_pfizered_injecting_hahaha
7 antifa - riot - protesters - olympia - shootout 37 7_antifa_riot_protesters_olympia
8 zionists - mossad - israeli - wolfowitz - traitor 36 8_zionists_mossad_israeli_wolfowitz
9 proudboys - fellas - fascist - fucking - brother 33 9_proudboys_fellas_fascist_fucking
10 guns - militia - civilrightstaskforce - repealed - dick 33 10_guns_militia_civilrightstaskforce_repealed
11 belarus - putin - crimean - sviatlana - sanctions 31 11_belarus_putin_crimean_sviatlana
12 fbi - terrorist - insurrectionists - mcveigh - suppressed 30 12_fbi_terrorist_insurrectionists_mcveigh
13 afghanistan - talibans - mujahideen - overthrew - globohomo 29 13_afghanistan_talibans_mujahideen_overthrew
14 vaccine - miscarriages - injected - mrna - shots 28 14_vaccine_miscarriages_injected_mrna
15 divine - lohengrin - gloria - eightfold - silence 25 15_divine_lohengrin_gloria_eightfold
16 airplanes - deadliest - fema - tornado - detonated 20 16_airplanes_deadliest_fema_tornado
17 biden - pelosi - pollster - lugenpresse - crackhead 20 17_biden_pelosi_pollster_lugenpresse

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
5
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.