最終更新:2025-02-14 (金) 10:15:01 (33d)  

transformers.LlamaTokenizer
Top / transformers.LlamaTokenizer

https://huggingface.co/docs/transformers/model_doc/llama#transformers.LlamaTokenizer

メモ

  • The LLaMA tokenizer is a BPE model based on SentencePiece.
  • One quirk of sentencepiece is that when decoding a sequence, if the first token is the start of the word (e.g. “Banana”), the tokenizer does not prepend the prefix space to the string.

関連

  • transformers.LlamaTokenizerFast?