Pipelines reference - Preparers

This section provides reference documentation for Pipelines Preparers. It includes information on the functions and views available in the aidb extension related to Preparers. See the Usage and Examples sections for more details.

Views

aidb.preparers

ColumnTypeDescription
idINTEGER
nameTEXTName of the preparer.
operationaidb.DataPreparationOperationThe kind of processing step that will be performed.
destination_schemaTEXTSchema of the destination table where the output data will be stored.
destination_tableTEXTName of the destination table where the output data will be stored.
destination_key_columnTEXTColumn of the destination table that references the key in source data.
destination_data_columnTEXTColumn of the destination table where the output data will be stored.
optionsJSONBConfiguration options for the data preparation operation. Utilizes the same API as the data preparation primitives.
source_typeTEXTType of source data the preparer is working with. Can be either 'Table' or 'Volume'.
source_schemaTEXTSchema of the table with the source data the preparer will process. Only applicable for preparers of Table source type.
source_tableTEXTName of the table with the source data the preparer will process. Only applicable for preparers of Table source type.
source_data_columnTEXTColumn in the source table with the source data the preparer will process. Only applicable for preparers of Table source type.
source_key_columnTEXTName of the key column in the source table for reference with the output processed data. Only applicable for preparers of Table source type.
source_volume_nameTEXTName of the volume to use as a data source. Only applicable for preparers of Volume source type.

Types

aidb.DataPreparationOperation

The aidb.DataPreparationOperation type is an enum that represents the different types of pre-processing steps that can be performed.

  • ChunkText
  • SummarizeText
  • ParseHtml
  • ParsePdf

Functions

aidb.create_preparer_for_table

Creates a preparer with a source data table.

Parameters

ParameterTypeDefaultDescription
nameTEXTRequiredName of the preparer
operationaidb.DataPreparationOperationRequiredType of data preparation operation
source_tableTEXTRequiredName of the source data table
source_data_columnTEXTRequiredColumn in the source table containing the raw data
destination_tableTEXTRequiredName of the destination table
destination_data_columnTEXTRequiredColumn in the destination table for processed data
source_key_columnTEXT'id'Column to use as key to reference the rows
destination_key_columnTEXT'id'Key column in the destination table that references the source_key_column
optionsJSONB'{}'::JSONBConfiguration options for the data preparation operation. Utilizes the same API as the data preparation primitives.

aidb.create_preparer_for_volume

Creates a preparer for a given PGFS volume.

Parameters

ParameterTypeDefaultDescription
nameTEXTRequiredName of the preparer
operationaidb.DataPreparationOperationRequiredType of data preparation operation
source_volume_nameTEXTRequiredName of the source volume containing the raw data
destination_tableTEXTRequiredName of the destination table
destination_data_columnTEXTRequiredColumn in the destination table for processed data
destination_key_columnTEXT'id'Key column in the destination table that uniquely identifies the processed data
optionsJSONB'{}'::JSONBConfiguration options for the data preparation operation. Utilizes the same API as the data preparation primitives.

aidb.bulk_data_preparation

Executes the configured data preparation operation on all data from the specified preparer’s source.

Parameters

ParameterTypeDescription
preparer_nameTEXTName of the preparer.

aidb.delete_preparer

Deletes the preparer's configuration.

Parameters

ParameterTypeDescription
preparer_nameTEXTName of preparer to delete.
Note

This does not delete the destination table or any data in it.


Could this page be better? Report a problem or suggest an addition!