Batch: Google Cloud Storage (GCS)

The Banyan data lake is built atop of Google Cloud Storage (GCS). Each Financial Services Provider brought onto the network will be provisioned their own GCS bucket(s) that only they will be allowed to access. The data within the bucket(s) will be encrypted at rest.

Each bucket will contain 3 folders: Input, Historical, Error. Once data is loaded into the folder, an automated ETL process is kicked off by Banyan and the data is moved into the historical folder. This data is not transformed in any way, it is an indicator that it has been processed. If data anomalies are found, those specific records will be written to files and placed in the Error folder for examination. Financial Services Providers will have read/write access to the Input folder, read only access to the Error folder and no access to the Historical folder.

📘

In addition to an Input work stream, Financial Service Providers can also opt to receive their Enriched transactions via the same Google Cloud Storage Data Lake. A separate Output folder will be accessible using the same service account provisioned for writing data to the Input folder. See here for more information about how to retrieve and interpret the data in your Output folder

Folder Structure and File Naming Formats


Did this page help you?