Ingest File Server Binary File And Deflate
Copy and Deflate binary file from file server path to Azure Data Lake Storage Gen2.
Category: Ingest to Lakehouse | Tags: Ingestion
Ingest file '<<filename>>' from File Server into Data Lake location 'raw/<<DataLakeSystemFolder>>/<<DataLakeDatasetFolder>>'
To use this activity within the API, use an ActivityCode of FILESERVER-BINARY-DEFLATE-ADLS.
Available Connections
SourceConnection:
TargetConnection:
Example JSON
An example of what the Task Config would look like for a task using this activity. Some of these variables would be set at the group level to avoid duplication between tasks.
{
"SourceConnection": "MY-SOURCE-CONN",
"Filename": "",
"CompressionType": "bzip2",
"CompressionLevel": "Optimal",
"DataLakeSystemFolder": "my_folder",
"DataLakeDatasetFolder": "data",
"TargetConnection": "MY-TARGET-CONN"
}
Variable Reference
The following variables are supported:
CompressionLevel(Required) - What compression level to uncompress the file via.CompressionType(Required) - Compression type to useDataLakeDatasetFolder(Required) - Name of the folder in the Data Lake containing the dataset.DataLakeSystemFolder(Required) - Name of the parent (System) folder in the Data Lake containing the dataset.DeleteFileFromSourceAfterCopying(Optional) - Should the source file be deleted once it has been successfully copied to its destination?DIUsToUseForCopyActivity(Optional) - Specifies the powerfulness of the copy executor. Value can be between 2 and 256. When left at default, the Data Factory dynamically applies the optimal DIU setting based on the source-sink pair and data pattern.ExtractControlVariableName(Optional) - For incremental loads only, the name to assign the Extract Control variable in State Config for the ExtractControl value derived from the Extract Control Query above.ExtractControlVariableSeedValue(Optional) - The initial value to set for the Extract Control variable in State Config - this will have no impact beyond the original seeding of the Extract Control variable in State Config.FailIfFileNotExists(Optional) - Should the Task fail if the file isn't found. If set to true, the Task will retry until the file arrives (or the Task reaches the maximum retry threshold).Filename(Required) - Filename to ingest. Can be wildcarded.IsFederated(Optional) - Makes task available to other Insight Factories within this organisation.LastModifiedHours(Optional) - Only files with a last modified time greater than the current UTC time minus Last Modified Hours will be returned. If empty, no time based filter will be applied.Links(Optional) - NULLMaximumNumberOfAttemptsAllowed(Optional) - The total number of times the running of this Task can be attempted.MinutesToWaitBeforeNextAttempt(Optional) - If a Task run fails, the number of minutes to wait before re-attempting the Task.OverrideFilePathWithDynamicValue(Optional) - Call into 'Override File Path' Pipeline to override values of either or both of RelativeFilePath and Filename.RelativeFilePath(Optional) - Relative path from the File Server Root folder.RetainHistory(Optional) - Should the raw files be saved to the History Container to preserve them?Show more details
**Retain History? ** By default, this flag is set to the value assigned in the Configuration item SaveRawFilesToHistory (signalled by the double triangle brackets around the Configuration item name e.g. <<SaveRawFilesToHistory>>). This default behaviour can be overridden here.
SourceConnection(Required) - Source connection to use.TargetConnection(Optional) - Target connection to use.