package
1.13.0
Repository: https://github.com/llmariner/inference-manager.git
Documentation: pkg.go.dev

# Functions

ModelDir returns the directory where models are stored.
NewManager creates a new runtime manager.
NewOllamaClient creates a new Ollama runtime client.a.
NewPreloader creates a new Preloader.
NewTritonClient creates a new Triton runtime client.
NewUpdater creates a new Updater.
NewVLLMClient creates a new VLLM runtime client.
PreferredModelFormat returns the preferred model format.

# Variables

ErrRequestCanceled is returned when the request is canceled.

# Structs

Manager manages runtimes.
Preloader preloads models.
Updater updates runtimes at startup.

# Interfaces

Client is the interface for managing runtimes.
ClientFactory is the interface for creating a new Client given a model ID.