Categorygoogle.golang.org/genai

modulepackage

0.0.1

Repository: https://github.com/googleapis/go-genai.git

Documentation: pkg.go.dev

# README

GitHub go.mod Go version

✨ NEW ✨

Google Gemini Multimodal Live support

Introducing support for the Gemini Multimodal Live feature. Here's an example Multimodal Live server showing realtime conversation and video streaming: code

Google Gen AI Go SDK

The Google Gen AI Go SDK enables developers to use Google's state-of-the-art generative AI models (like Gemini) to build AI-powered features and applications. This SDK supports use cases like:

Generate text from text-only input
Generate text from text-and-images input (multimodal)
...

For example, with just a few lines of code, you can access Gemini's multimodal capabilities to generate text from text-and-image input.

parts := []*genai.Part{
  {Text: "What's this image about?"},
  {InlineData: &genai.Blob{Data: imageBytes, MIMEType: "image/jpeg"}},
}
result, err := client.Models.GenerateContent(ctx, "gemini-2.0-flash-exp", []*genai.Content{{Parts: parts}}, nil)

Installation and usage

Add the SDK to your module with go get google.golang.org/genai.

Create Clients

Imports

import "google.golang.org/genai"

Gemini API Client:

client, err := genai.NewClient(ctx, &genai.ClientConfig{
	APIKey:   apiKey,
	Backend:  genai.BackendGoogleAI,
})

Vertex AI Client:

client, err := genai.NewClient(ctx, &genai.ClientConfig{
	Project:  project,
	Location: location,
	Backend:  genai.BackendVertexAI,
})

License

The contents of this repository are licensed under the Apache License, version 2.0.

# Functions

NewClient

NewClient creates a new GenAI client.

Ptr

Ptr returns a pointer to its argument.

Text

Text returns a slice of Content with a single Part with the given text.

# Constants

BackendGoogleAI

BackendGoogleAI is the Google AI backend.

BackendUnspecified

BackendUnspecified causes the backend determined automatically.

BackendVertexAI

BackendVertexAI is the Vertex AI backend.

BlockedReasonBlocklist

Candidates blocked due to the terms which are included from the terminology blocklist.

BlockedReasonOther

Candidates blocked due to other reason.

BlockedReasonProhibitedContent

Candidates blocked due to prohibited content.

BlockedReasonSafety

Candidates blocked due to safety.

BlockedReasonUnspecified

Unspecified blocked reason.

ControlReferenceTypeControlTypeCanny

No description provided by the author

ControlReferenceTypeControlTypeDefault

No description provided by the author

ControlReferenceTypeControlTypeFaceMesh

No description provided by the author

ControlReferenceTypeControlTypeScribble

No description provided by the author

DynamicRetrievalConfigModeDynamic

Run retrieval only when system decides it is necessary.

DynamicRetrievalConfigModeUnspecified

Always trigger retrieval.

FinishReasonBlocklist

Token generation stopped because the content contains forbidden terms.

FinishReasonMalformedFunctionCall

The function call generated by the model is invalid.

FinishReasonMaxTokens

Token generation reached the configured maximum output tokens.

FinishReasonOther

All other reasons that stopped the token generation.

FinishReasonProhibitedContent

Token generation stopped for potentially containing prohibited content.

FinishReasonRecitation

The token generation stopped because of potential recitation.

FinishReasonSafety

Token generation stopped because the content potentially contains safety violations.

FinishReasonSPII

Token generation stopped because the content potentially contains Sensitive Personally Identifiable Information (SPII).

FinishReasonStop

Token generation reached a natural stopping point or a configured stop sequence.

FinishReasonUnspecified

The finish reason is unspecified.

FunctionCallingConfigModeAny

Model is constrained to always predicting function calls only.

FunctionCallingConfigModeAuto

Default model behavior, model decides to predict either function calls or natural language response.

FunctionCallingConfigModeNone

Model will not predict any function calls.

FunctionCallingConfigModeUnspecified

The function calling config mode is unspecified.

HarmBlockMethodProbability

The harm block method uses the probability score.

HarmBlockMethodSeverity

The harm block method uses both probability and severity scores.

HarmBlockMethodUnspecified

The harm block method is unspecified.

HarmBlockThresholdBlockLowAndAbove

Block low threshold and above (i.e.

HarmBlockThresholdBlockMediumAndAbove

Block medium threshold and above.

HarmBlockThresholdBlockNone

Block none.

HarmBlockThresholdBlockOnlyHigh

Block only high threshold (i.e.

HarmBlockThresholdOff

Turn off the safety filter.

HarmBlockThresholdUnspecified

Unspecified harm block threshold.

HarmCategoryCivicIntegrity

The harm category is civic integrity.

HarmCategoryDangerousContent

The harm category is dangerous content.

HarmCategoryHarassment

The harm category is harassment.

HarmCategoryHateSpeech

The harm category is hate speech.

HarmCategorySexuallyExplicit

The harm category is sexually explicit content.

HarmCategoryUnspecified

The harm category is unspecified.

HarmProbabilityHigh

High level of harm.

HarmProbabilityLow

Low level of harm.

HarmProbabilityMedium

Medium level of harm.

HarmProbabilityNegligible

Negligible level of harm.

HarmProbabilityUnspecified

Harm probability unspecified.

HarmSeverityHigh

High level of harm severity.

HarmSeverityLow

Low level of harm severity.

HarmSeverityMedium

Medium level of harm severity.

HarmSeverityNegligible

Negligible level of harm severity.

HarmSeverityUnspecified

Harm severity unspecified.

LanguagePython

Python >= 3.10, with numpy and simpy available.

LanguageUnspecified

Unspecified language.

MaskReferenceModeMaskModeBackground

No description provided by the author

MaskReferenceModeMaskModeDefault

No description provided by the author

MaskReferenceModeMaskModeForeground

No description provided by the author

MaskReferenceModeMaskModeSemantic

No description provided by the author

MaskReferenceModeMaskModeUserProvided

No description provided by the author

MediaResolutionHigh

Media resolution set to high (zoomed reframing with 256 tokens).

MediaResolutionLow

Media resolution set to low (64 tokens).

MediaResolutionMedium

Media resolution set to medium (256 tokens).

MediaResolutionUnspecified

Media resolution has not been set.

ModeDynamic

Run retrieval only when system decides it is necessary.

ModeUnspecified

Always trigger retrieval.

OutcomeDeadlineExceeded

Code execution ran for too long, and was cancelled.

OutcomeFailed

Code execution finished but with a failure.

OutcomeOK

Code execution completed successfully.

OutcomeUnspecified

Unspecified status.

SubjectReferenceTypeSubjectTypeAnimal

No description provided by the author

SubjectReferenceTypeSubjectTypeDefault

No description provided by the author

SubjectReferenceTypeSubjectTypePerson

No description provided by the author

SubjectReferenceTypeSubjectTypeProduct

No description provided by the author

TypeArray

No description provided by the author

TypeBoolean

No description provided by the author

TypeInteger

No description provided by the author

TypeNumber

No description provided by the author

TypeObject

No description provided by the author

TypeString

No description provided by the author

TypeUnspecified

No description provided by the author

# Structs

Blob

Content blob.

Candidate

Class containing a response candidate generated from the model.

Citation

Source attributions for content.

CitationMetadata

Class for citation information when the model quotes another source.

Client

Client is the GenAI client.

ClientConfig

ClientConfig is the configuration for the GenAI client.

ClientError

ClientError is an error that occurs when the GenAI API receives an invalid request from a client.

CodeExecutionResult

Result of executing the [ExecutableCode].

Content

Contains the multi-part content of a message.

ControlReferenceConfig

Configuration for a Control reference image.

ControlReferenceImage

Class that represents a Control reference image.

DynamicRetrievalConfig

Describes the options to customize dynamic retrieval.

ExecutableCode

Code generated by the model that is meant to be executed, and the result returned to the model.

FileData

URI based data.

FunctionCall

A function call.

FunctionCallingConfig

Function calling config.

FunctionDeclaration

Defines a function that the model can generate JSON inputs for.

FunctionResponse

A function response.

GenerateContentConfig

Class for configuring optional model parameters.

GenerateContentParameters

Class for configuring the content of the request to the model.

GenerateContentResponse

Response message for PredictionService.GenerateContent.

GenerateContentResponsePromptFeedback

Content filter results for a prompt sent in the request.

GenerateContentResponseUsageMetadata

Usage metadata about response(s).

GenerationConfig

Generation config.

GenerationConfigRoutingConfig

The configuration for routing the request to a specific model.

GenerationConfigRoutingConfigAutoRoutingMode

When automated routing is specified, the routing will be determined by the pretrained routing model and customer provided model routing preference.

GenerationConfigRoutingConfigManualRoutingMode

When manual routing is set, the specified model will be used directly.

GoogleSearch

Tool to support Google Search in Model.

GoogleSearchRetrieval

Tool to retrieve public web data for grounding, powered by Google.

GroundingChunk

Grounding chunk.

GroundingChunkRetrievedContext

Chunk from context retrieved by the retrieval tools.

GroundingChunkWeb

Chunk from the web.

GroundingMetadata

Metadata returned to client when grounding is enabled.

GroundingSupport

Grounding support.

Image

Class that represents an image.

Live

Live struct encapsulates the configuration for realtime interaction with the Generative Language API.

LiveClientContent

Incremental update of the current conversation delivered from the client.

LiveClientMessage

Messages sent by the client in the API call.

LiveClientRealtimeInput

User input that is sent in real time.

LiveClientSetup

Message contains configuration that will apply for the duration of the streaming session.

LiveClientToolResponse

Client generated response to a `ToolCall` received from the server.

LiveConnectConfig

Config class for the session.

LiveServerContent

Incremental server update generated by the model in response to client messages.

LiveServerMessage

Response message for API call.

LiveServerSetupComplete

Sent in response to a `LiveGenerateContentSetup` message from the client.

LiveServerToolCall

Request for the client to execute the `function_calls` and return the responses with the matching `id`s.

LiveServerToolCallCancellation

Notification for the client that a previously issued `ToolCallMessage` with the specified `id`s should have been not executed and should be cancelled.

LogprobsResult

Logprobs Result.

LogprobsResultCandidate

Candidate for the logprobs token and score.

LogprobsResultTopCandidates

Candidates with top log probabilities at each decoding step.

MaskReferenceConfig

Configuration for a Mask reference image.

MaskReferenceImage

Class that represents a Mask reference image.

Models

No description provided by the author

Part

A datatype containing media content.

PrebuiltVoiceConfig

The configuration for the prebuilt speaker to use.

RawReferenceImage

Class that represents a Raw reference image.

Retrieval

Defines a retrieval tool that model can call to access external knowledge.

RetrievalMetadata

Metadata related to retrieval in the grounding flow.

SafetyRating

Safety rating corresponding to the generated content.

SafetySetting

Safety settings.

Schema

Schema that defines the format of input and output data.

SearchEntryPoint

Google search entry point.

Segment

Segment of the content.

ServerError

ServerError is an error that occurs when the GenAI API encounters an unexpected server problem.

Session

Session struct represents a realtime connection to the API.

SpeechConfig

The speech generation configuration.

StyleReferenceConfig

Configuration for a Style reference image.

StyleReferenceImage

Class that represents a Style reference image.

SubjectReferenceConfig

Configuration for a Subject reference image.

SubjectReferenceImage

Class that represents a Subject reference image.

Tool

Tool details of a tool that the model may use to generate a response.

ToolCodeExecution

Tool that executes code generated by the model, and automatically returns the result to the model.

ToolConfig

Tool config.

UploadFileConfig

Used to override the default configuration.

UpscaleImageConfig

Configuration for upscaling an image.

UpscaleImageParameters

User-facing config UpscaleImageParameters.

VertexAISearch

Retrieve from Vertex AI Search datastore for grounding.

VertexRAGStore

Retrieve from Vertex RAG Store for grounding.

VertexRAGStoreRAGResource

The definition of the RAG resource.

VideoMetadata

Metadata describes the input video content.

VoiceConfig

The configuration for the voice to use.

# Type aliases

Backend

Backend is the GenAI backend to use for the client.

BlockedReason

Blocked reason.

ControlReferenceType

Enum representing the control type of a control reference image.

DynamicRetrievalConfigMode

Config class for the dynamic retrieval config mode.

FinishReason

The reason why the model stopped generating tokens.

FunctionCallingConfigMode

Config class for the function calling config mode.

HarmBlockMethod

Specify if the threshold is used for probability or severity score.

HarmBlockThreshold

The harm block threshold.

HarmCategory

Harm category.

HarmProbability

Harm probability levels in the content.

HarmSeverity

Harm severity levels in the content.

Language

Programming language of the `code`.

MaskReferenceMode

Enum representing the mask mode of a mask reference image.

MediaResolution

The media resolution to use.

Mode

The mode of the predictor to be used in dynamic retrieval.

Outcome

Outcome of the code execution.

SubjectReferenceType

Enum representing the subject type of a subject reference image.

Type

A basic data type.