Skip to main content

mwaa

This component provisions Amazon managed workflows for Apache Airflow.

The s3 bucket dag_bucket stores DAGs to be executed by MWAA.

Access Modes

Public

Allows the Airflow UI to be access over the public internet to users granted access by an IAM policy.

Private

Limits access to users within the VPC to users granted access by an IAM policy.

  • MWAA creates a VPC interface endpoint for the Airflow webserver and an interface endpoint for the pgsql metadatabase.
    • the endpoints are created in the AZs mapped to your private subnets
  • MWAA binds an IP address from your private subnet to the interface endpoint

Managing access to VPC endpoings on MWAA

MWAA creates a VPC endpoint in each of the private subnets.

Usage

Stack Level: Regional

Here's an example snippet for how to use this component.

components:
terraform:
mwaa:
vars:
enabled: true
name: app
dag_processing_logs_enabled: true
dag_processing_logs_level: INFO
environment_class: mw1.small
airflow_version: 2.0.2

Variables

Required Variables

region (string) required

AWS Region

Optional Variables

airflow_configuration_options (map(string)) optional

The Airflow override options


Default value: { }

airflow_version (string) optional

Airflow version of the MWAA environment, will be set by default to the latest version that MWAA supports.


Default value: ""

allow_ingress_from_vpc_stages (list(string)) optional

List of stages to pull VPC ingress cidr and add to security group


Default value:

[
"auto",
"corp"
]
allowed_cidr_blocks (list(string)) optional

List of CIDR blocks to be allowed to connect to the MWAA cluster


Default value: [ ]

allowed_security_groups (list(string)) optional

A list of IDs of Security Groups to allow access to the security group created by this module.


Default value: [ ]

allowed_web_access_role_arns (list(string)) optional

List of role ARNs to allow airflow web access


Default value: [ ]

allowed_web_access_role_names (list(string)) optional

List of role names to allow airflow web access


Default value: [ ]

create_iam_role (bool) optional

Enabling or disabling the creation of a default IAM Role for AWS MWAA


Default value: true

create_s3_bucket (bool) optional

Enabling or disabling the creation of an S3 bucket for AWS MWAA


Default value: true

dag_processing_logs_enabled (bool) optional

Enabling or disabling the collection of logs for processing DAGs


Default value: false

dag_processing_logs_level (string) optional

DAG processing logging level. Valid values: CRITICAL, ERROR, WARNING, INFO, DEBUG


Default value: "INFO"

dag_s3_path (string) optional

Path to dags in s3


Default value: "dags"

environment_class (string) optional

Environment class for the cluster. Possible options are mw1.small, mw1.medium, mw1.large.


Default value: "mw1.small"

execution_role_arn (string) optional

If create_iam_role is false then set this to the target MWAA execution role


Default value: ""

max_workers (number) optional

The maximum number of workers that can be automatically scaled up. Value need to be between 1 and 25.


Default value: 10

min_workers (number) optional

The minimum number of workers that you want to run in your environment.


Default value: 1

plugins_s3_object_version (string) optional

The plugins.zip file version you want to use.


Default value: null

plugins_s3_path (string) optional

The relative path to the plugins.zip file on your Amazon S3 storage bucket. For example, plugins.zip. If a relative path is provided in the request, then plugins_s3_object_version is required


Default value: null

requirements_s3_object_version (string) optional

The requirements.txt file version you


Default value: null

requirements_s3_path (string) optional

The relative path to the requirements.txt file on your Amazon S3 storage bucket. For example, requirements.txt. If a relative path is provided in the request, then requirements_s3_object_version is required


Default value: null

scheduler_logs_enabled (bool) optional

Enabling or disabling the collection of logs for the schedulers


Default value: false

scheduler_logs_level (string) optional

Schedulers logging level. Valid values: CRITICAL, ERROR, WARNING, INFO, DEBUG


Default value: "INFO"

source_bucket_arn (string) optional

Set this to the Amazon Resource Name (ARN) of your Amazon S3 storage bucket.


Default value: null

task_logs_enabled (bool) optional

Enabling or disabling the collection of logs for DAG tasks


Default value: false

task_logs_level (string) optional

DAG tasks logging level. Valid values: CRITICAL, ERROR, WARNING, INFO, DEBUG


Default value: "INFO"

webserver_access_mode (string) optional

Specifies whether the webserver is accessible over the internet, PUBLIC_ONLY or PRIVATE_ONLY


Default value: "PRIVATE_ONLY"

webserver_logs_enabled (bool) optional

Enabling or disabling the collection of logs for the webservers


Default value: false

webserver_logs_level (string) optional

Webserver logging level. Valid values: CRITICAL, ERROR, WARNING, INFO, DEBUG


Default value: "INFO"

weekly_maintenance_window_start (string) optional

Specifies the start date for the weekly maintenance window.


Default value: null

worker_logs_enabled (bool) optional

Enabling or disabling the collection of logs for the workers


Default value: false

worker_logs_level (string) optional

Workers logging level. Valid values: CRITICAL, ERROR, WARNING, INFO, DEBUG


Default value: "INFO"

Context Variables

The following variables are defined in the context.tf file of this module and part of the terraform-null-label pattern.

additional_tag_map (map(string)) optional

Additional key-value pairs to add to each map in tags_as_list_of_maps. Not added to tags or id.
This is for some rare cases where resources want additional configuration of tags
and therefore take a list of maps with tag key, value, and additional configuration.


Required: No

Default value: { }

attributes (list(string)) optional

ID element. Additional attributes (e.g. workers or cluster) to add to id,
in the order they appear in the list. New attributes are appended to the
end of the list. The elements of the list are joined by the delimiter
and treated as a single ID element.


Required: No

Default value: [ ]

context (any) optional

Single object for setting entire context at once.
See description of individual variables for details.
Leave string and numeric variables as null to use default value.
Individual variable settings (non-null) override settings in context object,
except for attributes, tags, and additional_tag_map, which are merged.


Required: No

Default value:

{
"additional_tag_map": {},
"attributes": [],
"delimiter": null,
"descriptor_formats": {},
"enabled": true,
"environment": null,
"id_length_limit": null,
"label_key_case": null,
"label_order": [],
"label_value_case": null,
"labels_as_tags": [
"unset"
],
"name": null,
"namespace": null,
"regex_replace_chars": null,
"stage": null,
"tags": {},
"tenant": null
}
delimiter (string) optional

Delimiter to be used between ID elements.
Defaults to - (hyphen). Set to "" to use no delimiter at all.


Required: No

Default value: null

descriptor_formats (any) optional

Describe additional descriptors to be output in the descriptors output map.
Map of maps. Keys are names of descriptors. Values are maps of the form
\{<br/> format = string<br/> labels = list(string)<br/> \}
(Type is any so the map values can later be enhanced to provide additional options.)
format is a Terraform format string to be passed to the format() function.
labels is a list of labels, in order, to pass to format() function.
Label values will be normalized before being passed to format() so they will be
identical to how they appear in id.
Default is {} (descriptors output will be empty).


Required: No

Default value: { }

enabled (bool) optional

Set to false to prevent the module from creating any resources
Required: No

Default value: null

environment (string) optional

ID element. Usually used for region e.g. 'uw2', 'us-west-2', OR role 'prod', 'staging', 'dev', 'UAT'
Required: No

Default value: null

id_length_limit (number) optional

Limit id to this many characters (minimum 6).
Set to 0 for unlimited length.
Set to null for keep the existing setting, which defaults to 0.
Does not affect id_full.


Required: No

Default value: null

label_key_case (string) optional

Controls the letter case of the tags keys (label names) for tags generated by this module.
Does not affect keys of tags passed in via the tags input.
Possible values: lower, title, upper.
Default value: title.


Required: No

Default value: null

label_order (list(string)) optional

The order in which the labels (ID elements) appear in the id.
Defaults to ["namespace", "environment", "stage", "name", "attributes"].
You can omit any of the 6 labels ("tenant" is the 6th), but at least one must be present.


Required: No

Default value: null

label_value_case (string) optional

Controls the letter case of ID elements (labels) as included in id,
set as tag values, and output by this module individually.
Does not affect values of tags passed in via the tags input.
Possible values: lower, title, upper and none (no transformation).
Set this to title and set delimiter to "" to yield Pascal Case IDs.
Default value: lower.


Required: No

Default value: null

labels_as_tags (set(string)) optional

Set of labels (ID elements) to include as tags in the tags output.
Default is to include all labels.
Tags with empty values will not be included in the tags output.
Set to [] to suppress all generated tags.
Notes:
The value of the name tag, if included, will be the id, not the name.
Unlike other null-label inputs, the initial setting of labels_as_tags cannot be
changed in later chained modules. Attempts to change it will be silently ignored.


Required: No

Default value:

[
"default"
]
name (string) optional

ID element. Usually the component or solution name, e.g. 'app' or 'jenkins'.
This is the only ID element not also included as a tag.
The "name" tag is set to the full id string. There is no tag with the value of the name input.


Required: No

Default value: null

namespace (string) optional

ID element. Usually an abbreviation of your organization name, e.g. 'eg' or 'cp', to help ensure generated IDs are globally unique
Required: No

Default value: null

regex_replace_chars (string) optional

Terraform regular expression (regex) string.
Characters matching the regex will be removed from the ID elements.
If not set, "/[^a-zA-Z0-9-]/" is used to remove all characters other than hyphens, letters and digits.


Required: No

Default value: null

stage (string) optional

ID element. Usually used to indicate role, e.g. 'prod', 'staging', 'source', 'build', 'test', 'deploy', 'release'
Required: No

Default value: null

tags (map(string)) optional

Additional tags (e.g. {'BusinessUnit': 'XYZ'}).
Neither the tag keys nor the tag values will be modified by this module.


Required: No

Default value: { }

tenant (string) optional

ID element (Rarely used, not included by default). A customer identifier, indicating who this instance of a resource is for
Required: No

Default value: null

Outputs

arn

ARN of MWAA environment.

created_at

The Created At date of the Amazon MWAA Environment

execution_role_arn

IAM Role ARN for Amazon MWAA Execution Role

logging_configuration

The Logging Configuration of the MWAA Environment

s3_bucket_arn

ID of S3 bucket.

security_group_id

ID of the MWAA Security Group(s)

service_role_arn

The Service Role ARN of the Amazon MWAA Environment

status

The status of the Amazon MWAA Environment

tags_all

A map of tags assigned to the resource, including those inherited from the provider for the Amazon MWAA Environment

webserver_url

The webserver URL of the Amazon MWAA Environment

Dependencies

Requirements

  • terraform, version: >= 1.0.0
  • aws, version: >= 4.0

Providers

  • aws, version: >= 4.0

Modules

NameVersionSourceDescription
iam_policy0.4.0cloudposse/iam-policy/awsn/a
iam_roleslatest../account-map/modules/iam-rolesn/a
mwaa_environment0.4.8cloudposse/mwaa/awsn/a
this0.25.0cloudposse/label/nulln/a
vpc1.5.0cloudposse/stack-config/yaml//modules/remote-staten/a
vpc_ingress1.5.0cloudposse/stack-config/yaml//modules/remote-staten/a

Resources

The following resources are used by this module:

Data Sources

The following data sources are used by this module:

References