This page was last edited on December 17, 2013, at 18:54.
Comments or questions about this documentation? Contact us for support!
This topic describes part of the functionality of Genesys Content Analyzer.
This tab, shown in Indexing Tab, displays information on cooccurrence patterns of words in uncategorized e-mails.
The tab displays, in tree form, a list of the words that occur in all uncategorized e-mails (except Stop Words).
The index tree consists of folder icons, each labeled with a word, with the number of occurrences (number of e-mails it occurs in) in square brackets. These words can be called head words.
Each head word folder expands to a list of the words (also folders) that cooccur with the head word—that is, that occur together with the head word in one or more e-mails. Each cooccurring word is followed by square brackets containing two numbers: the number of e-mails this word occurs in, and a ratio. This ratio is the rate of occurrence with this head word divided by rate of occurrence in whole corpus. Indexing Tab Example provides an example.
Among the information displayed in this example is the following:
This indicates that the words articles and newsstand are highly likely to occur together, which means e-mails that contain both words are good candidates for grouping together in a category. If you select newsstand, then click Select texts, the display switches to the Main tab, showing that all e-mails that contain magazines, articles, and newsstand have been put in the Candidate messages list.
At the bottom of the tab are the following:
The figure "Find Words = “mystery reading” " shows the result of entering "mystery reading" in the Find words box:
the index tree shows only the head word mystery and the cooccurring word reading.