Import data from Amazon S3

You will need your S3 credentials to connect your S3 data with Data Orchestrator. View the S3 documentation for more information about your credentials.

To create a connection:

Select Data Orchestrator from the top-left navigation menu.
Select Connections on the left-side panel.
Select Create connection.
On the Create connections page, select S3 and then select Next.
If you can't find the connector, enter a search term in the Find... field.
On the Connection details page, enter these details and select Next:
- Name: Create a name for your connection. The name can contain alphanumeric characters and underscores.
- Description: Enter a description about your connection.
On the Connection Credentials page, enter your S3 credentials and select Next:
- AWS Key: The access key ID (for example, AKIAIOSFODNN7EXAMPLE).
- AWS Secret Key: The secret access key (for example, wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY).
- Bucket: The S3 bucket where your data is stored (for example, S3-EXAMPLE-BUCKET).
After the connection test is complete, select Done.

You can extract data from the Amazon S3 connection to add source data to Data Orchestrator. The data extract creates a source dataset.

To extract data:

Select Data Orchestrator from the top-left navigation menu.
Select Source data from the left-side panel.
Select Add data > From connection.
On the Dataset details page, enter these details and select Next:
- Connection
- Dataset name
- Description
- Path Name (see additional information below)
- Column Separator
- Text Delimiter
- Header Row
- First Data Row
On the Choose an upload type page, enter these details and select Next:
1. Select the Load type:
  - Full replace: Completely replaces the current loaded data with the new data.
  - Append: Adds the new data to the end of the current table.
  - Incremental: Takes the data and incrementally updates what was previously loaded.
2. Select the columns to import.
  - If you choose Incremental as the load type, Primary Key (PK) is required. The system uses the PK to identify and update existing records. The Cursor Field is automatically selected and is the last updated date of the files that match the pattern. This ensures that only new or updated files are included in the sync.
  - If you choose Append as the load type, the system automatically chooses the Cursor Field. So no need for you to set it up. This enables seamless appending of new data and maintains synchronization.
Select Create in the confirmation dialog.

When you extract data from S3 connections, you are asked to enter the Path name. If the bucket includes files with the same file name pattern, you can enter *.CSV to upload all the files with the same file name pattern.

Your S3 bucket is called SALES_DATA, and you have files called SALES_wk01.CSV, SALES_wk02.CSV, and SALES_wk03.CSV. If you enter Sales_Files/Sales_*.CSV for the Path name, all three files are uploaded to Data Orchestrator.

If you add more files to your bucket later with the same file name pattern, you can sync the data to upload the new files.

For more information, see this community article: Setup your ADO Demo using an AWS S3 Connection

Import data from Amazon S3

Create a connection to Amazon S3

Extract data from the Amazon S3 connection

Path name for Amazon S3 data extracts

Example

communityLink.title

title

caseportalLink.text

registrationLink.text

emailLink.text

callLink.text