# README
Amazon Timestream Output Plugin
This plugin writes metrics to the Amazon Timestream service.
⭐ Telegraf v1.16.0 🏷️ cloud, datastore 💻 all
Authentication
This plugin uses a credential chain for Authentication with Timestream API endpoint. In the following order the plugin will attempt to authenticate.
- Web identity provider credentials via STS if
role_arn
andweb_identity_token_file
are specified - [Assumed credentials via STS] if
role_arn
attribute is specified (source credentials are evaluated from subsequent rules). Theendpoint_url
attribute is used only for Timestream service. When fetching credentials, STS global endpoint will be used. - Web identity provider credentials via STS if
role_arn
andweb_identity_token_file
are specified - Assumed credentials via STS if
role_arn
attribute is specified (source credentials are evaluated from subsequent rules). Theendpoint_url
attribute is used only for Timestream service. When fetching credentials, STS global endpoint will be used. - Explicit credentials from
access_key
,secret_key
, andtoken
attributes - Shared profile from
profile
attribute - Environment Variables
- Shared Credentials
- EC2 Instance Profile
Global configuration options
In addition to the plugin-specific configuration settings, plugins support additional global and plugin configuration settings. These settings are used to modify metrics, tags, and field or create aliases and configure ordering, etc. See the CONFIGURATION.md for more details.
Configuration
# Configuration for sending metrics to Amazon Timestream.
[[outputs.timestream]]
## Amazon Region
region = "us-east-1"
## Amazon Credentials
## Credentials are loaded in the following order:
## 1) Web identity provider credentials via STS if role_arn and
## web_identity_token_file are specified
## 2) Assumed credentials via STS if role_arn is specified
## 3) explicit credentials from 'access_key' and 'secret_key'
## 4) shared profile from 'profile'
## 5) environment variables
## 6) shared credentials file
## 7) EC2 Instance Profile
#access_key = ""
#secret_key = ""
#token = ""
#role_arn = ""
#web_identity_token_file = ""
#role_session_name = ""
#profile = ""
#shared_credential_file = ""
## Endpoint to make request against, the correct endpoint is automatically
## determined and this option should only be set if you wish to override the
## default.
## ex: endpoint_url = "http://localhost:8000"
# endpoint_url = ""
## Timestream database where the metrics will be inserted.
## The database must exist prior to starting Telegraf.
database_name = "yourDatabaseNameHere"
## Specifies if the plugin should describe the Timestream database upon
## starting to validate if it has access, necessary permissions, connection,
## etc., as a safety check. If the describe operation fails, the plugin will
## not start and therefore the Telegraf agent will not start.
describe_database_on_start = false
## Specifies how the data is organized in Timestream.
## Valid values are: single-table, multi-table.
## When mapping_mode is set to single-table, all of the data is stored in a
## single table. When mapping_mode is set to multi-table, the data is
## organized and stored in multiple tables. The default is multi-table.
mapping_mode = "multi-table"
## Specifies if the plugin should create the table, if it doesn't exist.
create_table_if_not_exists = true
## Specifies the Timestream table magnetic store retention period in days.
## Check Timestream documentation for more details.
## NOTE: This property is valid when create_table_if_not_exists = true.
create_table_magnetic_store_retention_period_in_days = 365
## Specifies the Timestream table memory store retention period in hours.
## Check Timestream documentation for more details.
## NOTE: This property is valid when create_table_if_not_exists = true.
create_table_memory_store_retention_period_in_hours = 24
## Specifies how the data is written into Timestream.
## Valid values are: true, false
## When use_multi_measure_records is set to true, all of the tags and fields
## are stored as a single row in a Timestream table.
## When use_multi_measure_record is set to false, Timestream stores each field
## in a separate table row, thereby storing the tags multiple times (once for
## each field). The recommended setting is true. The default is false.
use_multi_measure_records = "false"
## Specifies the measure_name to use when sending multi-measure records.
## NOTE: This property is valid when use_multi_measure_records=true and
## mapping_mode=multi-table
measure_name_for_multi_measure_records = "telegraf_measure"
## Specifies the name of the table to write data into
## NOTE: This property is valid when mapping_mode=single-table.
# single_table_name = ""
## Specifies the name of dimension when all of the data is being stored in a
## single table and the measurement name is transformed into the dimension
## value (see Mapping data from Influx to Timestream for details)
## NOTE: This property is valid when mapping_mode=single-table.
# single_table_dimension_name_for_telegraf_measurement_name = "namespace"
## Only valid and optional if create_table_if_not_exists = true
## Specifies the Timestream table tags.
## Check Timestream documentation for more details
# create_table_tags = { "foo" = "bar", "environment" = "dev"}
## Specify the maximum number of parallel go routines to ingest/write data
## If not specified, defaulted to 1 go routines
max_write_go_routines = 25
Unsigned Integers
Timestream does DOES NOT support unsigned int64 values. Values using uint64, which are less than the maximum signed int64 are returned as expected. Any larger value is caped at the maximum int64 value.
Batching
Timestream WriteInputRequest.CommonAttributes are used to efficiently write data to Timestream.
Multithreading
Single thread is used to write the data to Timestream, following general plugin design pattern.
Errors
In case of an attempt to write an unsupported by Timestream Telegraf Field type, the field is dropped and error is emitted to the logs.
In case of receiving ThrottlingException or InternalServerException from Timestream, the errors are returned to Telegraf, in which case Telegraf will keep the metrics in buffer and retry writing those metrics on the next flush.
In case of receiving ResourceNotFoundException:
- If
create_table_if_not_exists
configuration is set totrue
, the plugin will try to create appropriate table and write the records again, if the table creation was successful. - If
create_table_if_not_exists
configuration is set tofalse
, the records are dropped, and an error is emitted to the logs.
In case of receiving any other AWS error from Timestream, the records are dropped, and an error is emitted to the logs, as retrying such requests isn't likely to succeed.
Logging
Turn on debug flag in the Telegraf to turn on detailed logging (including records being written to Timestream).
Testing
Execute unit tests with:
go test -v ./plugins/outputs/timestream/...
Mapping data from Influx to Timestream
When writing data from Influx to Timestream, data is written by default as follows:
- The timestamp is written as the time field.
- Tags are written as dimensions.
- Fields are written as measures.
- Measurements are written as table names.
For example, consider the following data in line protocol format:
weather,location=us-midwest,season=summer temperature=82,humidity=71 1465839830100400200
airquality,location=us-west no2=5,pm25=16 1465839830100400200
where:
weather
and airquality
are the measurement names,
location
and season
are tags,
temperature
, humidity
, no2
, pm25
are fields.
When you choose to create a separate table for each measurement and store multiple fields in a single table row, the data will be written into Timestream as:
-
The plugin will create 2 tables, namely, weather and airquality (mapping_mode=multi-table).
-
The tables may contain multiple fields in a single table row (use_multi_measure_records=true).
-
The table weather will contain the following columns and data:
time location season measure_name temperature humidity 2016-06-13 17:43:50 us-midwest summer <measure_name_for_multi_measure_records>
82 71 -
The table airquality will contain the following columns and data:
time location measure_name no2 pm25 2016-06-13 17:43:50 us-west <measure_name_for_multi_measure_records>
5 16
NOTE:
<measure_name_for_multi_measure_records>
represents the actual
value of that property.
You can also choose to create a separate table per measurement and store each field in a separate row per table. In that case:
-
The plugin will create 2 tables, namely, weather and airquality (mapping_mode=multi-table).
-
Each table row will contain a single field only (use_multi_measure_records=false).
-
The table weather will contain the following columns and data:
time location season measure_name measure_value::bigint 2016-06-13 17:43:50 us-midwest summer temperature 82 2016-06-13 17:43:50 us-midwest summer humidity 71 -
The table airquality will contain the following columns and data:
time location measure_name measure_value::bigint 2016-06-13 17:43:50 us-west no2 5 2016-06-13 17:43:50 us-west pm25 16
You can also choose to store all the measurements in a single table and store all fields in a single table row. In that case:
-
This plugin will create a table with name <single_table_name> (mapping_mode=single-table).
-
The table may contain multiple fields in a single table row (use_multi_measure_records=true).
-
The table will contain the following column and data:
time location season <single_table_dimension_name_for_telegraf_measurement_name>
measure_name temperature humidity no2 pm25 2016-06-13 17:43:50 us-midwest summer weather <measure_name_for_multi_measure_records>
82 71 null null 2016-06-13 17:43:50 us-west null airquality <measure_name_for_multi_measure_records>
null null 5 16
NOTE:
<single_table_name>
represents the actual value of that property.
<single_table_dimension_name_for_telegraf_measurement_name>
represents
the actual value of that property.
<measure_name_for_multi_measure_records>
represents the actual value of
that property.
Furthermore, you can choose to store all the measurements in a single table and store each field in a separate table row. In that case:
-
Timestream will create a table with name <single_table_name> (mapping_mode=single-table).
-
Each table row will contain a single field only (use_multi_measure_records=false).
-
The table will contain the following column and data:
time location season namespace measure_name measure_value::bigint 2016-06-13 17:43:50 us-midwest summer weather temperature 82 2016-06-13 17:43:50 us-midwest summer weather humidity 71 2016-06-13 17:43:50 us-west NULL airquality no2 5 2016-06-13 17:43:50 us-west NULL airquality pm25 16
NOTE:
<single_table_name>
represents the actual value of that property.
<single_table_dimension_name_for_telegraf_measurement_name>
represents the
actual value of that property.
<measure_name_for_multi_measure_records>
represents the actual value of
that property.