pkg.gl

Categorygithub.com/CloudNativeAI/modctlpkgmodelfilecommand

package

0.0.0-20250210100259-db1763c83ee8

Repository: https://github.com/cloudnativeai/modctl.git

Documentation: pkg.go.dev

# Constants

ARCH

ARCH is the command to set the architecture of the model, such as transformer, cnn, rnn, etc.

CONFIG

CONFIG is the command to set the configuration of the model, which is used for the model to be served, such as the config.json, generation_config.json, etc.

FAMILY

FAMILY is the command to set the family of the model, such as llama3, gpt2, qwen2, etc.

FORMAT

FORMAT is the command to set the format of the model, such as onnx, tensorflow, pytorch, etc.

MODEL

MODEL is the command to set the model file path.

NAME

NAME is the command to set the model name, such as llama3-8b-instruct, gpt2-xl, qwen2-vl-72b-instruct, etc.

PARAMSIZE

PARAMSIZE is the command to set the parameter size of the model.

PRECISION

PRECISION is the command to set the precision of the model, such as bf16, fp16, int8, etc.

QUANTIZATION

QUANTIZATION is the command to set the quantization of the model, such as awq, gptq, etc.

# Variables

Commands

Commands is a list of all the commands that can be used in a modelfile.