Theta-Llama Tokenizers

#7
by tryumanshow - opened

Link: https://huggingface.co/NousResearch/Hermes-2-Theta-Llama-3-8B/commit/885173e97ab8572b444f7db1290d5d0386e26816

  },
    {
      "id": 128005,
      "content": "<|reserved_special_token_3|>",
      "single_word": false,
      "lstrip": false,
      "rstrip": false,

    "end_of_word_suffix": null,
    "fuse_unk": false,
    "byte_fallback": false,
    "ignore_merges": true,
    "vocab": {
      "!": 0,
      "\"": 1,

Can you tell me why <tool_response> is changed to <|reserved_special_token_3|>?
I am just practicing the model merging.
After the tokenizer.json file is changed to the current version, it always meets the tokenization error.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment