pkg.gl

NewConfigFromContext creates a transformers config model, based on the structure of the variables in the given context -- the scope has to be set directly to the model variables.

QueryPreAttentionNormalisationTypeString

QueryPreAttentionNormalisationTypeString retrieves an enum value from the enum constants string name.

QueryPreAttentionNormalisationTypeStrings

QueryPreAttentionNormalisationTypeStrings returns a slice of all String values of the enum.

QueryPreAttentionNormalisationTypeValues

QueryPreAttentionNormalisationTypeValues returns all values of the enum.

RMSNorm

RMSNorm normalizes by its root-mean-square x = x / √(mean(sqrt(x), axis=-1) + epsilon) and applies a learned scale.

SoftCap

SoftCap using Tanh, so values won't go beyond +/- cap.

# Constants

AttentionTypeGlobal

No description provided by the author

AttentionTypeLocalSliding

No description provided by the author

AttentionTypeUnknown

No description provided by the author

Gemma_2B

No description provided by the author

Gemma_7B

No description provided by the author

Gemma2_27B

No description provided by the author

Gemma2_2B

No description provided by the author

Gemma2_9B

No description provided by the author

QueryNormTypeByEmbedDimDivNumHeads

QueryNormTypeByEmbedDimDivNumHeads indicates whether to scale the query by `embed_dim // num_heads`.

QueryNormTypeByOneOverSqrtEmbedDimDivNumHeads

QueryNormTypeByOneOverSqrtEmbedDimDivNumHeads indicates whether to scale the query by `1/sqrt(embed_dim // num_heads)`.

QueryNormTypeByOneOverSqrtHeadDim

QueryNormTypeByOneOverSqrtHeadDim indicates whether to scale the query by 1/sqrt(head_dim).

RoPEDefaultMaxWaveLength

RoPEDefaultMaxWaveLength is a default value to use for rotary positional encoding.

UnknownGemmaType

No description provided by the author

# Structs

Cache

Cache is a state cache of a (batch of) sequence being encoded/decoded.

Config

Config Gemma transformer model.

# Type aliases

AttentionType

No description provided by the author

GemmaType

No description provided by the author

QueryPreAttentionNormalisationType

QueryPreAttentionNormalisationType defines how to normalize query before attention.