Ingesting Data from Files

Learn how to ingest data from file sources like CSV, Excel, and JSON files from SFTP or cloud storage.

Overview

File-based ingestion is essential for working with data exports, spreadsheets, and file drops. In this guide, you'll learn how to:

Navigate to Build > Connections
Click New Connection
Select your file source type:
- SFTP for secure file transfer servers
- Azure Data Lake Storage Gen2 for ADLS
- Azure Blob Storage for blob containers
Enter your connection details and credentials
Test and save the connection

Open your production line and navigate to the Graph view
Add a new task using one of these methods:
- Click the + button in the graph side menu
- Right-click on an existing node and select Add Task from the context menu
Enter a unique Code and Name for your task
Select the appropriate ingestion activity from the Activity dropdown:
- "Ingest Delimited File to Lakehouse" for CSV files
- "Ingest Excel Worksheet to Lakehouse" for Excel files
- "Ingest JSON File to Lakehouse" for JSON files
Configure the task properties

Depending on your file format, you may need to configure:

For Delimited Files (CSV):

For Excel Files:

For JSON Files:

Term	Definition
Schema Inference	Automatic detection of column names and data types from file structure
Column Mapping	The process of defining how source columns map to destination columns
Delimited File	A text file where columns are separated by a specific character