athena
This component is responsible for provisioning an Amazon Athena workgroup, databases, and related resources.
Usage
Stack Level: Regional
Here are some example snippets for how to use this component:
stacks/catalog/athena/defaults.yaml
file (base component for all Athena deployments with default settings):
components:
terraform:
athena/defaults:
metadata:
type: abstract
settings:
spacelift:
workspace_enabled: true
vars:
enabled: true
tags:
Team: sre
Service: athena
create_s3_bucket: true
create_kms_key: true
athena_kms_key_deletion_window: 7
bytes_scanned_cutoff_per_query: null
enforce_workgroup_configuration: true
publish_cloudwatch_metrics_enabled: true
encryption_option: "SSE_KMS"
s3_output_path: ""
workgroup_state: "ENABLED"
database: []
import:
- catalog/athena/defaults
components:
terraform:
athena/example:
metadata:
component: athena
inherits:
- athena/defaults
vars:
enabled: true
name: athena-example
workgroup_description: "My Example Athena Workgroup"
database:
- example_db_1
- example_db_2
CloudTrail Integration
Using Athena with CloudTrail logs is a powerful way to enhance your analysis of AWS service activity. This component supports creating a CloudTrail table for each account and setting up queries to read CloudTrail logs from a centralized location.
To set up the CloudTrail Integration, first create the create
and alter
queries in Athena with this component. When
var.cloudtrail_database
is defined, this component will create these queries.
import:
- catalog/athena/defaults
components:
terraform:
athena/audit:
metadata:
component: athena
inherits:
- athena/defaults
vars:
enabled: true
name: athena-audit
workgroup_description: "Athena Workgroup for Auditing"
cloudtrail_database: audit
databases:
audit:
comment: "Auditor database for Athena"
properties: {}
named_queries:
platform_dev:
database: audit
description: "example query against CloudTrail logs"
query: |
SELECT
useridentity.arn,
eventname,
sourceipaddress,
eventtime
FROM %s.platform_dev_cloudtrail_logs
LIMIT 100;
Once those are created, run the create
and then the alter
queries in the AWS Console to create and then fill the
tables in Athena.
Athena runs queries with the permissions of the user executing the query. In order to be able to query CloudTrail logs,
the audit
account must have access to the KMS key used to encrypt CloudTrails logs. Set var.audit_access_enabled
to
true
in the cloudtrail
component
Variables
Required Variables
databases
(map(any)
) requiredMap of Athena databases and related configuration.
region
(string
) requiredAWS Region
Optional Variables
athena_kms_key
(string
) optionalUse an existing KMS key for Athena if
create_kms_key
isfalse
.Default value:
null
athena_kms_key_deletion_window
(number
) optionalKMS key deletion window (in days).
Default value:
7
athena_s3_bucket_id
(string
) optionalUse an existing S3 bucket for Athena query results if
create_s3_bucket
isfalse
.Default value:
null
bytes_scanned_cutoff_per_query
(number
) optionalInteger for the upper data usage limit (cutoff) for the amount of bytes a single query in a workgroup is allowed to scan. Must be at least 10485760.
Default value:
null
cloudtrail_bucket_component_name
(string
) optionalThe name of the CloudTrail bucket component
Default value:
"cloudtrail-bucket"
cloudtrail_database
(string
) optionalThe name of the Athena Database to use for CloudTrail logs. If set, an Athena table will be created for the CloudTrail trail.
Default value:
""
create_kms_key
(bool
) optionalEnable the creation of a KMS key used by Athena workgroup.
Default value:
true
create_s3_bucket
(bool
) optionalEnable the creation of an S3 bucket to use for Athena query results
Default value:
true
data_catalogs
(map(any)
) optionalMap of Athena data catalogs and parameters
Default value:
{ }
enforce_workgroup_configuration
(bool
) optionalBoolean whether the settings for the workgroup override client-side settings.
Default value:
true
named_queries
(map(map(string))
) optionalMap of Athena named queries and parameters
Default value:
{ }
publish_cloudwatch_metrics_enabled
(bool
) optionalBoolean whether Amazon CloudWatch metrics are enabled for the workgroup.
Default value:
true
s3_output_path
(string
) optionalThe S3 bucket path used to store query results.
Default value:
""
workgroup_description
(string
) optionalDescription of the Athena workgroup.
Default value:
""
workgroup_encryption_option
(string
) optionalIndicates whether Amazon S3 server-side encryption with Amazon S3-managed keys (SSE_S3), server-side encryption with KMS-managed keys (SSE_KMS), or client-side encryption with KMS-managed keys (CSE_KMS) is used.
Default value:
"SSE_KMS"
workgroup_force_destroy
(bool
) optionalThe option to delete the workgroup and its contents even if the workgroup contains any named queries.
Default value:
false
workgroup_state
(string
) optionalState of the workgroup. Valid values are
DISABLED
orENABLED
.Default value:
"ENABLED"
Context Variables
The following variables are defined in the context.tf
file of this module and part of the terraform-null-label pattern.
context.tf
file of this module and part of the terraform-null-label pattern.Outputs
data_catalogs
List of newly created Athena data catalogs.
databases
List of newly created Athena databases.
kms_key_arn
ARN of KMS key used by Athena.
named_queries
List of newly created Athena named queries.
s3_bucket_id
ID of S3 bucket used for Athena query results.
workgroup_id
ID of newly created Athena workgroup.
Dependencies
Requirements
terraform
, version:>= 1.0.0
aws
, version:>= 4.0
Providers
aws
, version:>= 4.0
Modules
Name | Version | Source | Description |
---|---|---|---|
account_map | 1.8.0 | cloudposse/stack-config/yaml//modules/remote-state | n/a |
athena | 0.2.1 | cloudposse/athena/aws | n/a |
cloudtrail_bucket | 1.8.0 | cloudposse/stack-config/yaml//modules/remote-state | n/a |
iam_roles | latest | ../account-map/modules/iam-roles | n/a |
this | 0.25.0 | cloudposse/label/null | n/a |
Resources
The following resources are used by this module:
aws_athena_named_query.cloudtrail_query_alter_tables
(resource)aws_athena_named_query.cloudtrail_query_create_tables
(resource)
Data Sources
The following data sources are used by this module:
References
- cloudposse/terraform-aws-components - Cloud Posse's upstream component
- Querying AWS CloudTrail logs with AWS Athena