Within the platform, data is structured and stored in a hierarchical manner, which comprises of 3 main components - data sources, datasets, projects. If you are an Automation Cloud user, these three components will be stored within your cloud tenant(s). Access to each of these is controlled by strict permissions.
Data Sources
These are collections of raw unlabelled communications data of a similar type, e.g. all emails from a shared mailbox, or a collection of NPS survey responses (see here for more detail). Individual data sources can be associated with up to 10 different datasets.
Datasets
These are comprised of 1 - 20 data sources (of similar type with similar intended purposes) and the 'model' that you create when you train the platform to understand the data in those sources (see here for more detail).
Projects
A permissioned storage area within the platform. Each dataset and data source belongs to a specific project, which is designated when they are created (see here for more detail).
Tenants (Automation Cloud users only)
These allow you to model your organization structure, separating your business flows and information just like in real-life organizations. They are containers where you can organize your services and manage them for a group of users.
For example, you can create tenants for each of your departments and decide what services you want to enable for each, based on their needs. In each tenant, you can have one instance of each of the cloud services.
It is important to note that you cannot promote Communications Mining models between different UiPath Cloud tenants (e.g.: promoting from DEV to PROD).
If you can only deploy to PROD in a PROD environment, then enable Communications Mining in PROD. However, if you have flexibility with deploying to PROD from another environment, you can have your PROD automation(s) call the platform from the tenant it sits in (e.g.: QA/DEV).
Permissions
These are per-user and specific to each project that a user belongs to. They can provide access to sensitive data and, depending on the permission, allow users to perform a range of different actions in the platform (see here for more detail).
Overview
If you are an Automation Cloud user, your Communications Mining service will be enabled on a specific tenant. Tenants are where projects are stored.
Each dataset and data source is associated with a specific project, with users requiring permissions in those projects to be able to work with the data within them.
Datasets in one project can be made up of data sources from another project. Users will just require permissions in both projects to view and label the data.
The below illustration helps to illustrate the relationship between these components and permissions:
- In the example below with Tenant A, all of the data sources are associated with Project A1, whilst there are datasets associated with both Project A1 and Project A2.
- If a user wanted to access datasets in Project A1 (i.e. dataset 1, 2 or 3), they would require viewing permissions for Project A1 only.
- But if a user wanted to access datasets in Project A2 (i.e. dataset 4, 5 or 6), they would require viewing permissions for both Projects A1 and A2, because the data sources are all located in Project A1.
- To view project A1 or A2, the user would require access to Tenant A. To view project B1, the user would require access to Tenant B. The user permissions do not transfer cross-tenant.
- The concept of having multiple cloud tenants is only applicable for Automation Cloud users.
Example data structure within the platform