This is a view of the top 3000 keywords used in the Common Vulnerabilities and Exposures description/summary (from 1999 until Today) automatically generated from the full-text indexing functionality of cve-search. Move over your mouse to get the value. On the visualization the keyword is truncated to fit the screen but if you move your mouse on the node you'll see the full keywords with the number of occurrences.

The terms (words & verbs) have been lemmatized with the WordNet lemmatizer from NTLK and the English stopwords has been used to remove the undesired words.

The CVE terms could be considered as a kind of a text corpus as itself. Specific terms like most common version numbers or CVE most common wording like 'via' were kept in this visualization.

If you want to read the code used for the generation of the keywords list, cve-search search_fulltext section generating the JSON for this visualization.

JSON files containing the CVE keywords and their occurences can be downloaded for further analysis.

Tweet