package
0.2.2
Repository: https://github.com/sugarme/tokenizer.git
Documentation: pkg.go.dev

# Functions

DefaultBpeDecoder create a new BpeDecoder with default suffix (`</w>`).
No description provided by the author
DefaultBpeDecoder create a new BpeDecoder with default suffix (`</w>`).
NewBpeDecoder creates a new BpeDecoder.
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
NewBpeDecoder creates a new BpeDecoder.

# Structs

Allows decoding Original BPE by joining all the tokens and then replacing the suffix used to identify end-of-words by whitespaces.
No description provided by the author
No description provided by the author
No description provided by the author
Fuse constructs Fuse decoder It's simply fuses all tokens into one big string.
No description provided by the author
No description provided by the author
WordPieceDecoder takes care of decoding a list of wordpiece tokens back into a readable string.