pkg.gl

title: "Instill Model" lang: "en-US" draft: false description: "Learn about how to set up a VDP Instill Model component https://github.com/instill-ai/instill-core"

The Instill Model component is an AI component that allows users to connect the AI models served on the Instill Model Platform. It can carry out the following tasks:

Classification
Instance Segmentation
Keypoint
Detection
OCR
Semantic Segmentation
Text Generation
Text Generation Chat
Text to Image
Visual Question Answering
Chat

Release Stage

Alpha

Configuration

The component definition and tasks are defined in the definition.json and tasks.json files respectively.

Supported Tasks

Classification

Classify images into predefined categories.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_CLASSIFICATION`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Image (required)	`image-base64`	string	Image base64

Output	ID	Type	Description
Category	`category`	string	The predicted category of the input.
Score	`score`	number	The confidence score of the predicted category of the input.

Instance Segmentation

Detect, localize and delineate multiple objects in images.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_INSTANCE_SEGMENTATION`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Image (required)	`image-base64`	string	Image base64

Output	ID	Type	Description
Objects	`objects`	array[object]	A list of detected instance bounding boxes.

Output Objects in Instance Segmentation

Objects

Field	Field ID	Type	Note
Bounding Box	`bounding-box`	object	The detected bounding box in (left, top, width, height) format.
Category	`category`	string	The predicted category of the bounding box.
RLE	`rle`	string	Run Length Encoding (RLE) of instance mask within the bounding box.
Score	`score`	number	The confidence score of the predicted instance object.

Bounding Box

Field	Field ID	Type	Note
Height	`height`	number	Bounding box height value
Left	`left`	number	Bounding box left x-axis value
Top	`top`	number	Bounding box top y-axis value
Width	`width`	number	Bounding box width value

Keypoint

Detect and localize multiple keypoints of objects in images.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_KEYPOINT`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Image (required)	`image-base64`	string	Image base64

Output	ID	Type	Description
Objects	`objects`	array[object]	A list of keypoint objects, a keypoint object includes all the pre-defined keypoints of a detected object.

Output Objects in Keypoint

Objects

Field	Field ID	Type	Note
Bounding Box	`bounding-box`	object	The detected bounding box in (left, top, width, height) format.
Keypoints	`keypoints`	array	A keypoint group is composed of a list of pre-defined keypoints of a detected object.
Score	`score`	number	The confidence score of the predicted object.

Keypoints

Field	Field ID	Type	Note
Visibility Score	`v`	number	visibility score of the keypoint.
X Coordinate	`x`	number	x coordinate of the keypoint.
Y Coordinate	`y`	number	y coordinate of the keypoint.

Bounding Box

Field	Field ID	Type	Note
Height	`height`	number	Bounding box height value
Left	`left`	number	Bounding box left x-axis value
Top	`top`	number	Bounding box top y-axis value
Width	`width`	number	Bounding box width value

Detection

Detect and localize multiple objects in images.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_DETECTION`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Image (required)	`image-base64`	string	Image base64

Output	ID	Type	Description
Objects	`objects`	array[object]	A list of detected objects.

Output Objects in Detection

Objects

Field	Field ID	Type	Note
Bounding box	`bounding-box`	object	The detected bounding box in (left, top, width, height) format.
Category	`category`	string	The predicted category of the bounding box.
Score	`score`	number	The confidence score of the predicted category of the bounding box.

Bounding Box

Field	Field ID	Type	Note
Height	`height`	number	Bounding box height value
Left	`left`	number	Bounding box left x-axis value
Top	`top`	number	Bounding box top y-axis value
Width	`width`	number	Bounding box width value

OCR

Detect and recognize text in images.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_OCR`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Image (required)	`image-base64`	string	Image base64

Output	ID	Type	Description
Objects	`objects`	array[object]	A list of detected bounding boxes.

Output Objects in OCR

Objects

Field	Field ID	Type	Note
Bounding Box	`bounding-box`	object	The detected bounding box in (left, top, width, height) format.
Score	`score`	number	The confidence score of the predicted object.
Text	`text`	string	Text string recognised per bounding box.

Bounding Box

Field	Field ID	Type	Note
Height	`height`	number	Bounding box height value
Left	`left`	number	Bounding box left x-axis value
Top	`top`	number	Bounding box top y-axis value
Width	`width`	number	Bounding box width value

Semantic Segmentation

Classify image pixels into predefined categories.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_SEMANTIC_SEGMENTATION`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Image (required)	`image-base64`	string	Image base64

Output	ID	Type	Description
Stuffs	`stuffs`	array[object]	A list of RLE binary masks.

Output Objects in Semantic Segmentation

Stuffs

Field	Field ID	Type	Note
Category	`category`	string	Category text string corresponding to each stuff mask.
RLE	`rle`	string	Run Length Encoding (RLE) of each stuff mask within the image.

Text Generation

Generate texts from input text prompts.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_TEXT_GENERATION`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Prompt (required)	`prompt`	string	The prompt text
System Message	`system-message`	string	The system message helps set the behavior of the assistant. For example, you can modify the personality of the assistant or provide specific instructions about how it should behave throughout the conversation. By default, the model’s behavior is using a generic message as "You are a helpful assistant."
Seed	`seed`	integer	The seed
Temperature	`temperature`	number	The temperature for sampling
Max New Tokens	`max-new-tokens`	integer	The maximum number of tokens for model to generate

Output	ID	Type	Description
Text	`text`	string	Text

Text Generation Chat

Generate texts from input text prompts and chat history.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_TEXT_GENERATION_CHAT`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Prompt (required)	`prompt`	string	The prompt text
System Message	`system-message`	string	The system message helps set the behavior of the assistant. For example, you can modify the personality of the assistant or provide specific instructions about how it should behave throughout the conversation. By default, the model’s behavior is using a generic message as "You are a helpful assistant."
Prompt Images	`prompt-images`	array[string]	The prompt images
Chat history	`chat-history`	array[object]	Incorporate external chat history, specifically previous messages within the conversation. Please note that System Message will be ignored and will not have any effect when this field is populated. Each message should adhere to the format: : {"role": "The message role, i.e. 'system', 'user' or 'assistant'", "content": "message content"}.
Seed	`seed`	integer	The seed
Temperature	`temperature`	number	The temperature for sampling
Max New Tokens	`max-new-tokens`	integer	The maximum number of tokens for model to generate

Input Objects in Text Generation Chat

Chat History

Incorporate external chat history, specifically previous messages within the conversation. Please note that System Message will be ignored and will not have any effect when this field is populated. Each message should adhere to the format: : {"role": "The message role, i.e. 'system', 'user' or 'assistant'", "content": "message content"}.

Field	Field ID	Type	Note
Content	`content`	array	The message content
Role	`role`	string	The message role, i.e. 'system', 'user' or 'assistant'

Content

The message content

Field	Field ID	Type	Note
Image URL	`image-url`	object	The image URL
Text	`text`	string	The text content.
Type	`type`	string	The type of the content part. Enum values `text` `image-url`

Image URL

The image URL

Field	Field ID	Type	Note
URL	`url`	string	Either a URL of the image or the base64 encoded image data.

Output	ID	Type	Description
Text	`text`	string	Text

Text to Image

Generate images from input text prompts.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_TEXT_TO_IMAGE`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Prompt (required)	`prompt`	string	The prompt text
Samples	`samples`	integer	The number of generated samples, default is 1
Seed	`seed`	integer	The seed, default is 0
Aspect ratio	`negative-prompt`	string	Keywords of what you do not wish to see in the output image.
Aspect ratio	`aspect-ratio`	string	Controls the aspect ratio of the generated image. Defaults to 1:1.

Output	ID	Type	Description
Images	`images`	array[string]	Images

Visual Question Answering

Answer questions based on a prompt and an image.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_VISUAL_QUESTION_ANSWERING`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Prompt (required)	`prompt`	string	The prompt text
System Message	`system-message`	string	The system message helps set the behavior of the assistant. For example, you can modify the personality of the assistant or provide specific instructions about how it should behave throughout the conversation. By default, the model’s behavior is using a generic message as "You are a helpful assistant."
Prompt Images	`prompt-images`	array[string]	The prompt images
Chat history	`chat-history`	array[object]	Incorporate external chat history, specifically previous messages within the conversation. Please note that System Message will be ignored and will not have any effect when this field is populated. Each message should adhere to the format: : {"role": "The message role, i.e. 'system', 'user' or 'assistant'", "content": "message content"}.
Seed	`seed`	integer	The seed
Temperature	`temperature`	number	The temperature for sampling
Max New Tokens	`max-new-tokens`	integer	The maximum number of tokens for model to generate

Input Objects in Visual Question Answering

Chat History

Field	Field ID	Type	Note
Content	`content`	array	The message content
Role	`role`	string	The message role, i.e. 'system', 'user' or 'assistant'

Content

The message content

Field	Field ID	Type	Note
Image URL	`image-url`	object	The image URL
Text	`text`	string	The text content.
Type	`type`	string	The type of the content part. Enum values `text` `image-url`

Image URL

The image URL

Field	Field ID	Type	Note
URL	`url`	string	Either a URL of the image or the base64 encoded image data.

Output	ID	Type	Description
Text	`text`	string	Text

Chat

Generate texts from input text prompts and chat history.

Input	ID	Type	Description
Task ID (required)	`task`	string	`TASK_CHAT`
Model Name (required)	`model-name`	string	The Instill Model model to be used.
Prompt (required)	`prompt`	string	The prompt text
System Message	`system-message`	string	The system message helps set the behavior of the assistant. For example, you can modify the personality of the assistant or provide specific instructions about how it should behave throughout the conversation. By default, the model’s behavior is using a generic message as "You are a helpful assistant."
Prompt Images	`prompt-images`	array[string]	The prompt images
Chat history	`chat-history`	array[object]	Incorporate external chat history, specifically previous messages within the conversation. Please note that System Message will be ignored and will not have any effect when this field is populated. Each message should adhere to the format: : {"role": "The message role, i.e. 'system', 'user' or 'assistant'", "content": "message content"}.
Seed	`seed`	integer	The seed
Temperature	`temperature`	number	The temperature for sampling
Max New Tokens	`max-new-tokens`	integer	The maximum number of tokens for model to generate

Input Objects in Chat

Chat History

Field	Field ID	Type	Note
Content	`content`	array	The message content
Role	`role`	string	The message role, i.e. 'system', 'user' or 'assistant'

Content

The message content

Field	Field ID	Type	Note
Image URL	`image-url`	object	The image URL
Text	`text`	string	The text content.
Type	`type`	string	The type of the content part. Enum values `text` `image-url`

Image URL

The image URL

Field	Field ID	Type	Note
URL	`url`	string	Either a URL of the image or the base64 encoded image data.

Output	ID	Type	Description
Text	`text`	string	Text

# README

title: "Instill Model" lang: "en-US" draft: false description: "Learn about how to set up a VDP Instill Model component https://github.com/instill-ai/instill-core"

Release Stage

Configuration

Supported Tasks

Classification

Instance Segmentation

Objects

Bounding Box

Keypoint

Objects

Keypoints

Bounding Box

Detection

Objects

Bounding Box

OCR

Objects

Bounding Box

Semantic Segmentation

Stuffs

Text Generation

Text Generation Chat

Chat History

Content

Image URL

Text to Image

Visual Question Answering

Chat History

Content

Image URL

Chat

Chat History

Content

Image URL

# Functions

# Structs