Data Processing Connection Profiles

The following sections provide information about the parameters of connection profiles for Data Processing platforms and services.

AWS EMR Connection Profile Parameters

The following table describes Control-M for AWS EMR connection profile parameters.

Parameter

Description

Region

Determines the AWS region.

EXAMPLE: us-east-1

EMR Access Key

Defines the token for the connection to AWS.

EMR Service Key

Defines an additional security token for AWS.

Google Dataflow Connection Profile Parameters

The following table describes Google Dataflow connection profile parameters.

Parameter

Description

Identity Type

Defines one of the following types of authentication to perform using GCP Access Control.

  • Service Account – authenticates using an application ID (service account) and client secret

  • Managed Identity – does not require credentials; available on GCP VMs only

Dataflow URL

Defines the Google Cloud Platform (GCP) authentication endpoint.

Required only for Service Account authentication.

https://dataflow.googleapis.com

Service Account Key

Defines a service account that is associated with public/private RSA key pairs.

Required only for Service Account authentication.

Google Dataproc Connection Profile Parameters

The following table describes Google Dataproc connection profile parameters.

Parameter

Description

Identity Type

Defines one of the following types of authentication to perform using GCP Access Control.

  • Service Account – authenticates using an application ID (service account) and client secret

  • Managed Identity – does not require credentials; available on GCP VMs only

Dataproc URL

Defines the Google Cloud Platform (GCP) authentication endpoint.

Required only for Service Account authentication.

For example, https://dataproc.googleapis.com

Service Account Key

Defines a service account that is associated with an RSA key pair.

Required only for Service Account authentication.

Connection Timeout

Defines a timeout value, in seconds, for the trigger call to the Google Cloud Platform.

Default: 20 seconds