transformers_issues_topics

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("aouichaty/transformers_issues_topics")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 30
  • Number of training documents: 9000
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 bert - tensorflow - pytorch - pretrained - trainer 12 -1_bert_tensorflow_pytorch_pretrained
0 tokenizer - tokenizers - tokenization - tokenize - token 2216 0_tokenizer_tokenizers_tokenization_tokenize
1 gpt2 - gpt2doubleheadsmodel - gpt - gpt2lmheadmodel - distilgpt2 1767 1_gpt2_gpt2doubleheadsmodel_gpt_gpt2lmheadmodel
2 trainer - trainertrain - trainers - training - tf 739 2_trainer_trainertrain_trainers_training
3 modelcard - modelcards - card - model - models 692 3_modelcard_modelcards_card_model
4 gluepy - glue - glueconvertexamplestofeatures - attributeerror - importerror 533 4_gluepy_glue_glueconvertexamplestofeatures_attributeerror
5 summarization - summaries - summary - text - nlp 427 5_summarization_summaries_summary_text
6 s2s - exampless2s - seq2seq - seq2seqtrainer - runseq2seq 345 6_s2s_exampless2s_seq2seq_seq2seqtrainer
7 readmemd - readmetxt - readme - modelcard - file 317 7_readmemd_readmetxt_readme_modelcard
8 typos - typo - fix - correction - fixed 251 8_typos_typo_fix_correction
9 transformerscli - transformers - transformer - transformerxl - importerror 238 9_transformerscli_transformers_transformer_transformerxl
10 deprecate - deprecation - deprecated - warnings - warning 190 10_deprecate_deprecation_deprecated_warnings
11 ner - pipeline - pipelines - nerpipeline - fillmaskpipeline 155 11_ner_pipeline_pipelines_nerpipeline
12 flaxbertformaskedlm - flax - flaxelectraformaskedlm - flaxjax - distilbertmodel 149 12_flaxbertformaskedlm_flax_flaxelectraformaskedlm_flaxjax
13 longformer - longformers - longform - longformerformultiplechoice - longformertokenizerfast 147 13_longformer_longformers_longform_longformerformultiplechoice
14 tests - testing - test - testgeneratefp16 - slow 138 14_tests_testing_test_testgeneratefp16
15 onnx - onnxonnxruntime - onnxexport - 04onnxexport - 04onnxexportipynb 125 15_onnx_onnxonnxruntime_onnxexport_04onnxexport
16 generationbeamsearchpy - generatebeamsearch - generatebeamsearchoutputs - beamsearch - nonbeamsearch 95 16_generationbeamsearchpy_generatebeamsearch_generatebeamsearchoutputs_beamsearch
17 benchmark - benchmarks - accuracy - precision - comparison 74 17_benchmark_benchmarks_accuracy_precision
18 wav2vec2 - wav2vec - wav2vec20 - wav2vec2forctc - wav2vec2xlrswav2vec2 69 18_wav2vec2_wav2vec_wav2vec20_wav2vec2forctc
19 huggingfacemaster - huggingfacetokenizers297 - huggingface - huggingfaces - huggingfacetransformers 52 19_huggingfacemaster_huggingfacetokenizers297_huggingface_huggingfaces
20 prophetnet - prophetnetmodel - prophetnetforcausallm - prophetnetngramattention - prophetnetforconditionalgeneration 52 20_prophetnet_prophetnetmodel_prophetnetforcausallm_prophetnetngramattention
21 notebook - notebooks - blenderbot3b - community - blenderbot 43 21_notebook_notebooks_blenderbot3b_community
22 cachedir - cache - cachedpath - caching - cached 43 22_cachedir_cache_cachedpath_caching
23 closed - add - bort - added - deleted 32 23_closed_add_bort_added
24 electra - electrapretrainedmodel - electraformaskedlm - electraformultiplechoice - electrafortokenclassification 28 24_electra_electrapretrainedmodel_electraformaskedlm_electraformultiplechoice
25 layoutlm - layout - layoutlmtokenizer - layoutlmbaseuncased - tf 25 25_layoutlm_layout_layoutlmtokenizer_layoutlmbaseuncased
26 isort - blackisortflake8 - github - repo - version 16 26_isort_blackisortflake8_github_repo
27 pplm - pr - deprecated - variable - ppl 15 27_pplm_pr_deprecated_variable
28 colab - cola - crashes - crash - tcmalloc 15 28_colab_cola_crashes_crash

Training hyperparameters

  • calculate_probabilities: False
  • language: english
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: 30
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: True

Framework versions

  • Numpy: 1.23.5
  • HDBSCAN: 0.8.33
  • UMAP: 0.5.3
  • Pandas: 1.5.3
  • Scikit-Learn: 1.2.2
  • Sentence-transformers: 2.2.2
  • Transformers: 4.33.1
  • Numba: 0.56.4
  • Plotly: 5.15.0
  • Python: 3.10.12
Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.