Learning Paths
Advanced
Part 2. Removing entities and Grouping Data

Part 2. Removing entities and Grouping Data


Step by Step Guide: How to Regroup Entities

To reshape the dataset, we will be using Regroup entities, which allows you to redefine the entities. In this step, you can select a new column and define it as entities, or remove them. However, removing an entity is a delicate task, as it can potentially corrupt the data.

image

01. Fetch Alphacast Dataset

To create a pipeline by clicking on Create Pipeline at the top right corner. Select a name and a repository for your pipeline, then choose Fetch Alphacast Dataset. For this example, search Sales Data

02. Regroup Entities

In our example we decided to delete the entity SalesPerson so be need to solve the problem of duplicates. We must define how to deal with them, such as keeping the first one or combining and calculating the average or sum. You can remove the entity "SalesPerson" to calculate the total sales in each region or remove the region to calculate the sales of each person.

Adv - part 2 - regroupe.png