- Create single-word frequency histograms from Study 1 Data
- Chunk Study 1 Data into 2-word pairs and create histograms to see what combinations come up a lot (such as “authorized person” and “unauthorized person”)
- Use POS tagging and create word frequency histograms based only on verbs
- Generate WordNet synsets for common verbs
- Finally, look for clusters in the data and try to determine how to predict user vocabulary
