redshift_load>: Redshift load operations
redshift_load> operator runs COPY statement to load data from external storage on Redshift.
_export: redshift: host: my-redshift.1234abcd.us-east-1.redshift.amazonaws.com
port: 5439
database: production_db user: app_user ssl: true
strict_transaction: false
+load_from_dynamodb_simple: redshift_load>: schema: myschema table: transactions from: dynamodb://transaction-table readratio: 123
+load_from_s3_with_many_options: redshift_load>: schema: myschema table: access_logs from: s3://my-app-bucket/access_logs/today manifest: true encrypted: true region: us-east-1 csv: "'" delimiter: "$"
json: s3://my-app-bucket/access_logs/jsonpathfile
avro: auto
fixedwidth: host:15,code:3,method:15
gzip: true
bzip2: true
lzop: true
acceptanydate: true acceptinvchars: "&" blanksasnull: true dateformat: yyyy-MM-dd emptyasnull: true encoding: UTF8 escape: false explicit_ids: true fillrecord: true ignoreblanklines: true ignoreheader: 2 null_as: nULl removequotes: false roundec: true timeformat: YYYY-MM-DD HH:MI:SS trimblanks: true truncatecolumns: true comprows: 12 compupdate: ON maxerror: 34
noload: true
statupdate: false role_session_name: federated_user session_duration: 1800
temp_credentials: false
Secrets
When you don't know how to set secrets, please refer to Managing Workflow Secret
aws.redshift.password: NAME
Optional user password to use when connecting to the Redshift database.
aws.redshift_load.access_key_id, aws.redshift.access_key_id, aws.access_key_id
The AWS Access Key ID to use when accessing data source. This value is used to get temporary security credentials by default. See
temp_credentials
option for details.aws.redshift_load.secret_access_key, aws.redshift.secret_access_key, aws.secret_access_key
The AWS Secret Access Key to use when accessing data source. This value is used to get temporary security credentials by default. See
temp_credentials
option for details.aws.redshift_load.role_arn, aws.redshift.role_arn, aws.role_arn
Optional Amazon resource names (ARNs) used to copy data to the Redshift. The role needs
AssumeRole
role to use this option. Requirestemp_credentials
to be true. If this option isn't specified, this operator tries to use a federated user
Options
database: NAME
Database name.
Examples:
database: my_db
host: NAME
Hostname or IP address of the database.
Examples:
host: db.foobar.com
port: NUMBER
Port number to connect to the database. Default:
5439
.Examples:
port: 2345
user: NAME
User to connect to the database
Examples:
user: app_user
ssl: BOOLEAN
Enable SSL to connect to the database. Default:
false
.Examples:
ssl: true
schema: NAME
Default schema name. Default:
public
.Examples:
schema: my_schema
strict_transaction: BOOLEAN
Whether this operator uses a strict transaction to prevent generating unexpected duplicated records just in case. Default:
true
. This operator creates and uses a status table in the database to make an operation idempotent. But if creating a table isn't allowed, this option should be false.Examples:
strict_transaction: false
status_table_schema: NAME
Schema name of status table. Default: same as the value of
schema
option.Examples:
status_table_schema: writable_schema
status_table: NAME
Table name prefix of status table. Default:
__digdag_status
.Examples:
status_table: customized_status_table
table: NAME
Table name in Redshift database to be loaded data
Examples:
table: access_logs
from: URI
Parameter mapped to
FROM
parameter of Redshift'sCOPY
statementExamples:
from: s3://my-app-bucket/access_logs/today
manifest: BOOLEAN
Parameter mapped to
MANIFEST
parameter of Redshift'sCOPY
statementExamples:
manifest: true
encrypted: BOOLEAN
Parameter mapped to
ENCRYPTED
parameter of Redshift'sCOPY
statementExamples:
encrypted: true
readratio: NUMBER
Parameter mapped to
READRATIO
parameter of Redshift'sCOPY
statementExamples:
readratio: 150
region: NAME
Parameter mapped to
REGION
parameter of Redshift'sCOPY
statementExamples:
region: us-east-1
csv: CHARACTER
Parameter mapped to
CSV
parameter of Redshift'sCOPY
statement. If you want to just use default quote character ofCSV
parameter, set empty string likecsv: ''
Examples:
csv: "'"
delimiter: CHARACTER
Parameter mapped to
DELIMITER
parameter of Redshift'sCOPY
statementExamples:
delimiter: "$"
json: URI
Parameter mapped to
JSON
parameter of Redshift'sCOPY
statementExamples:
json: auto
Examples:
json: s3://my-app-bucket/access_logs/jsonpathfile
avro: URI
Parameter mapped to
AVRO
parameter of Redshift'sCOPY
statementExamples:
avro: auto
avro: s3://my-app-bucket/access_logs/jsonpathfile
fixedwidth: CSV
Parameter mapped to
FIXEDWIDTH
parameter of Redshift'sCOPY
statementExamples:
fixedwidth: host:15,code:3,method:15
gzip: BOOLEAN
Parameter mapped to
GZIP
parameter of Redshift'sCOPY
statementExamples:
gzip: true
bzip2: BOOLEAN
Parameter mapped to
BZIP2
parameter of Redshift'sCOPY
statementExamples:
bzip2: true
lzop: BOOLEAN
Parameter mapped to
LZOP
parameter of Redshift'sCOPY
statementExamples:
lzop: true
acceptanydate: BOOLEAN
Parameter mapped to
ACCEPTANYDATE
parameter of Redshift'sCOPY
statementExamples:
acceptanydate: true
acceptinvchars: CHARACTER
Parameter mapped to
ACCEPTINVCHARS
parameter of Redshift'sCOPY
statementExamples:
acceptinvchars: "&"
blanksasnull: BOOLEAN
Parameter mapped to
BLANKSASNULL
parameter of Redshift'sCOPY
statementExamples:
blanksasnull: true
dateformat: STRING
Parameter mapped to
DATEFORMAT
parameter of Redshift'sCOPY
statementExamples:
dateformat: yyyy-MM-dd
emptyasnull: BOOLEAN
Parameter mapped to
EMPTYASNULL
parameter of Redshift'sCOPY
statementExamples:
emptyasnull: true
encoding: TYPE
Parameter mapped to
ENCODING
parameter of Redshift'sCOPY
statementExamples:
encoding: UTF8
escape: BOOLEAN
Parameter mapped to
ESCAPE
parameter of Redshift'sCOPY
statementExamples:
escape: false
explicit_ids: BOOLEAN
Parameter mapped to
EXPLICIT_IDS
parameter of Redshift'sCOPY
statementExamples:
explicit_ids: true
fillrecord: BOOLEAN
Parameter mapped to
FILLRECORD
parameter of Redshift'sCOPY
statementExamples:
fillrecord: true
ignoreblanklines: BOOLEAN
Parameter mapped to
IGNOREBLANKLINES
parameter of Redshift'sCOPY
statementExamples:
ignoreblanklines: true
ignoreheader: NUMBER
Parameter mapped to
IGNOREHEADER
parameter of Redshift'sCOPY
statementExamples:
ignoreheader: 2
null_as: STRING
Parameter mapped to
NULL AS
parameter of Redshift'sCOPY
statementExamples:
null_as: nULl
removequotes: BOOLEAN
Parameter mapped to
REMOVEQUOTES
parameter of Redshift'sCOPY
statementExamples:
removequotes: false
roundec: BOOLEAN
Parameter mapped to
ROUNDEC
parameter of Redshift'sCOPY
statementExamples:
roundec: true
timeformat: STRING
Parameter mapped to
TIMEFORMAT
parameter of Redshift'sCOPY
statementExamples:
timeformat: YYYY-MM-DD HH:MI:SS
trimblanks: BOOLEAN
Parameter mapped to
TRIMBLANKS
parameter of Redshift'sCOPY
statementExamples:
trimblanks: true
truncatecolumns: BOOLEAN
Parameter mapped to
TRUNCATECOLUMNS
parameter of Redshift'sCOPY
statementExamples:
truncatecolumns: true
comprows: NUMBER
Parameter mapped to
COMPROWS
parameter of Redshift'sCOPY
statementExamples:
comprows: 12
compupdate: TYPE
Parameter mapped to
COMPUPDATE
parameter of Redshift'sCOPY
statementExamples:
compupdate: ON
maxerror: NUMBER
Parameter mapped to
MAXERROR
parameter of Redshift'sCOPY
statementExamples:
maxerror: 34
noload: BOOLEAN
Parameter mapped to
NOLOAD
parameter of Redshift'sCOPY
statementExamples:
noload: true
statupdate: TYPE
Parameter mapped to
STATUPDATE
parameter of Redshift'sCOPY
statementExamples:
statupdate: off
temp_credentials: BOOLEAN
Whether this operator uses temporary security credentials. Default:
true
. This operator tries to use temporary security credentials as follows:- If
role_arn
is specified, it callsAssumeRole
action - If not, it calls
GetFederationToken
action
See details about
AssumeRole
andGetFederationToken
in the documents of AWS Security Token Service.So either of
AssumeRole
orGetFederationToken
action is called to use temporary security credentials by default for secure operation. But if this option is disabled, this operator uses credentials as-is set in the secrets insread of temporary security credentials.Examples:
- If
temp_credentials: false
session_duration INTEGER
Session duration of temporary security credentials. Default:
3 hour
. This option isn't used when disablingtemp_credentials
Examples:
session_duration: 1800