This article will walk readers through how to merge datasets within ClearQuery
ClearQuery allows users to merge up to three (3) datasets together, provided that the datasets share a common field, such as a primary/foreign key relationship or transaction ID. To merge datasets, all the datasets must already exist within CQ. Once all the data is in the system, the user can navigate to the "Dataset Management" page and locate the "Link Existing Datasets" button in the top right corner.
The user will be prompted to provide the following information:
- Dataset Name: The label for the new merged dataset.
- Owners: Users who will have access to edit and delete the merged dataset.
- Datasets: A dropdown menu containing all the datasets the user has access to. Select the datasets to be merged.
- Note: The first dataset selected will be labeled as the "Primary" dataset.
Once that information is filled in, CQ will create a linkage table where the user will tell the system how to link each dataset together. This is accomplished by selecting the common field for the dataset listed and linking it to a corresponding field in the other datasets, the common field must be one of the following field types: Categorical, Numeric, or Decimal. Once this information is completed, select the “Create” button. The system will now work to complete the merge.
Note: Merging datasets creates a new dataset. Therefore, the organization must have sufficient room in their subscription to accommodate the addition of another dataset as well as the size of the newly created dataset. If there is no additional room, a pre-existing dataset can be removed, or the admin can contact their Account Manager for more information.
If you have additional questions regarding this topic, please reach out to your Account Manager or contact ClearQuery Support (support@clearquery.io).