Batch Listener

The Batch Listener allows data ingestion from batch sources into a graph. Also, the Batch Listener captures input required to process data from batch sources at a defined frequency.

Creating a Batch Listener

Creating a new Batch Connection is mandatory to create a Batch Listener Adaptor.

To add a Batch Listener Adaptor on any graph:

  1. Click the Graph Options dropdown in the top left side of the window.

  2. Select Add Listener from the dropdown. The Listener Editor window will appear.

Listener Type: Select the Batch Listener Type from the Listener Type dropdown.

Connection: Choose the required batch Connection from the Connection drop-down.

Batch Connections created under the Batch Connection configuration will appear here.

The (Edit) button is next to the connection dropdown.

It allows users to reconfigure, edit or create a new connection.

The Edit button is available for all types of listeners.


Batch Listener Options

Data Format

In data processing, there are different types of file formats to store your data sets. The most popular data formats are Parquet, Delimited, and JSON.

Parquet is an open-source file format that handles flat columnar storage data formats.

Delimited files in which values in the row and columns are separated by configured delimiters (Comma, Tab, Customized delimiter, Pipe).

JSON (JavaScript Object Notation) is an open data format and is widely used by APIs (Application Programming Interface — How systems communicate with each other) and several databases (like MongoDB)


Parsing Inputs

To parse the inputs from the defined files creating the detailed configuration (Compression, Encryption, and File path) is mandatory.

Compression allows you to define the decompression method, while the Encryption method decrypts the data using an algorithm and private key to parse the information.

A Directory refers to the location of the stored files and Path Pattern (Regex) filters the exact defined file.


Path Pattern (Regex)

Regular expressions are mainly helpful for defining the file name available in the directory, which contains a series of characters that define a pattern of text to be matched to make a filter more specific.

By using regular expressions single or multiple files can be filtered out.


Interval

The Interval configuration defines the batch processing interval, and the same will be repeated at the number of times mentioned in the frequency.


Output

The Output configuration allows you to create, modify and define the source schema. The source schema defines the structure and properties of the data source, which can be read from and written to from any node in the graph 

 

Privacy Policy
© 2022 CSG International, Inc.