Data Recipe app provides you a visual way to cleanse and prepare data for a model or some other task in a Pipeline. It also allows you to visualise data with EDA statistics and graphs.
Creating a Data Recipe
You can create a data recipe by clicking on ‘Create New’ and providing the name and the optional description. View list of recipe created along with their status and also preview a recipe DAG and it’s meta info.
Deleting a Data Recipe
Delete option in the data recipe options menu will delete the data recipe from the Project
Building a Data Recipe
- Building a data recipe is as simple as dragging and dropping various Blocks from the left into the canvas and connecting them. Following are some of the types of Blocks available:
- Data Connectors:
- Source Connectors - Blocks to get pull data from a source
- Target Connectors - Blocks to push data into a source
- Row Blocks
- Table Blocks
- Data Connectors:
- Aggregate Blocks
- Each recipe block can be configured with the required parameters in the right pane that you by clicking on the block.
Preview and Logs of your Data Recipe
Clicking on the test button in the recipe builder, you can run the recipe on the source file and you can view the preview of the data after applying the data recipe in the preview section and its corresponding run logs in the Logs section
EDA on Source Data
Platform provides you EDA on the data based on your source connector. On the EDA screen, select a few columns and click Start to generate EDA. Platform will analyse your data and provide you a single view with graphical visualization of the complete data with a wide range of graphs like Histogram, Boxplot, Summary, Statistics, Dendrogram, Correlation matrix, scatter plot matrix.
Creating a new Data Transformation Block
You can create a custom data transformation block for the data recipe by clicking on the “add” icon in the recipe builder and configure the new block by providing name of the block, technology used, category, Intellectual property status, code of the block and the following parameters input, input validation, output, handlers, resource requirement.
Editing custom Data Transformation Block
You can edit a custom block available in the recipe builder by using the edit block option and modify the structure and the configuration of the pipeline block.