File Format: Allows you to configure the file format and source schema. Select the required file format from the dropdown.
Info |
---|
Once the File Format is selected, all the related configurations will appear. |
Parquet
Expand | ||
---|---|---|
| ||
Delimited
Expand | ||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||||||||||||
Header Row: Allows to specify that the file's first row data must be considered a header row. Data available in the header row separated by delimiter are column names. Delimiter: Delimiter is a character separator that separates the values stored in rows. Allows to define the predefined and custom delimiter for the data. Select the required delimiter from the dropdown.
Custom Delimiter: Allows to define a new custom delimiter for the data. Selection of Custom from the Delimiter dropdown is mandatory to define the custom delimiter.
The use of escape character minimises the need of switching quotation marks for enclosing the strings, which contains special punctuation marks. It also avoids the delimiter collision.
A text qualifier is a character used at the beginning and end of a field value.
|
JSON
Expand | ||||||
---|---|---|---|---|---|---|
| ||||||
Supported Structure
This is a widely used format for data ingestion. Not Supported Structure
|
Upload: Click the Upload button to upload the data file. Select the required file needed to upload and click Upload.
Note |
---|
The selected file size must be less than 2MB. We recommend that the user must avoid uploading the Gzip file unless it has been decrypted. |
Info |
---|
The View Batch Schema button gets enabled after uploading the selected file. |
View Batch Schema: Allows you to view and modify the selection of source schema.
Source Data Format
All the available parent column names in the uploaded data file will appear here.
Filter: It allows you to search and filter the specific columns here. Scroll down manually to see the entire list.
Column list: As required, you can select or deselect the required columns of the left-hand side drawer.
By default, all the columns available in the source data format are selected.
Selection of atleast one column is mandatory to save the schema in the batch listener configuration.
In case of nested data structures, only the parent column is displayed on the left side.
JSON: JSON schema will update dynamically when columns/keys are selected or deselected from the left-hand side drawer. It displays the standardized schema structure that needs to be mapped in the project schema.
Users can only view the standardized schema structure.
After changes, copy the created entire schema and paste it into the defined project schema at the right-hand side drawer.
Click Save Changes to save the configuration.