Ranking and Extraction of Relevant Single Words in text
Jan 2008
The extraction of keywords is currently a very important technique used in several
applications, for instance, the characterization of document topics. In this case, by extracting
the right keywords on a query, one could easily know what documents should be read and
what documents should be put aside. However, while the automatic extraction of
multiword has been an active search field by the scientific community, the automatic
extraction of single words, or unigrams, has been basically ignored due to its intrinsic
difficulty. Meanwhile, it is easy to demonstrate that in a process of keyword extraction,
leaving unigrams out impoverishes, in a certain extent, the quality of the final result.