User permissions required: ‘View Sources’ AND ‘Review and label’
Please Note: Users will be able to see verbatims in Discover if they have ‘View Sources’ AND see labels if they have ‘View labels’ permissions, but they will require the ‘Review and label’ permission in order to actually apply labels in Discover.
Overview
The 'Search' functionality in Discover is used to search for key terms and phrases. You are able to search for exact search terms and if they exist it will show you these followed by partial matches. This function can be used to search for alternative terms and ways of expressing the same intent or concept for each label. This can be useful if you know a relevant common term or expression that has not appeared in any of the clusters so far and want to pin a couple of examples.
Search should not be used to apply a large number of examples per search term and per label - only a few of each.
Let’s look at an example. The cluster below is clearly about the location of the hotel, where a ‘Location’ label has been predicted. If we only used this term it could bias the model towards the phrases around the word ‘Location’ or similar, and we should use the Search feature to find alternative ways of expressing this:
Example cluster when searching for 'Location'
Possible alternative search terms for 'Location':
- Located
- Convenient
- Position
- Proximity
- Near
- Hotel position
- Location to transport
- Transport links
- Tourist attractions
- Close to transport
- Central
- Close to airport
- Near the airport
Searching for different terms
The example below shows how searching for alternative terms for ‘Location’ highlights verbatims that are related to the location of the hotel but expressed differently. By doing this, the model will be given different examples of ‘Location’.
Alternative search results
Applying labels to search results
Labelling search results in Discover
- Select ‘Search’ from the ‘Cluster’ drop-down menu in the Discover tab
- Enter your search term and hit enter or click the search icon
- Matching search terms will appear highlighted in orange. The platform will show full matches followed by partial matches
- Add all labels that should apply, not just your Search results (e.g. Property > Staff label in the cluster above)
- DO NOT do this for large numbers of verbatims for each label
You can use this process sparingly for each label that has variable ways of expressing the same topic. However, there are other methods covered in the Explore phase that also help provide different training examples, but do not have the potential to bias your model.
Previous: Training using 'Clusters'