pkg.gl

Categorygithub.com/sugarme/tokenizerdecoder

package

0.2.2

Repository: https://github.com/sugarme/tokenizer.git

Documentation: pkg.go.dev

# Functions

DefaultBpeDecoder

DefaultBpeDecoder create a new BpeDecoder with default suffix (`</w>`).

DefaultCTC

No description provided by the author

DefaultWordpieceDecoder

DefaultBpeDecoder create a new BpeDecoder with default suffix (`</w>`).

NewBpeDecoder

NewBpeDecoder creates a new BpeDecoder.

NewByteFallback

No description provided by the author

NewCTC

No description provided by the author

NewFuse

No description provided by the author

NewSequence

No description provided by the author

NewStrip

No description provided by the author

NewWordPieceDecoder

NewBpeDecoder creates a new BpeDecoder.

# Structs

BpeDecoder

Allows decoding Original BPE by joining all the tokens and then replacing the suffix used to identify end-of-words by whitespaces.

ByteFallback

No description provided by the author

CTC

No description provided by the author

DecoderBase

No description provided by the author

Fuse

Fuse constructs Fuse decoder It's simply fuses all tokens into one big string.

Sequence

No description provided by the author

Strip

No description provided by the author

WordPieceDecoder

WordPieceDecoder takes care of decoding a list of wordpiece tokens back into a readable string.