dc.contributor.author |
Paijmans, H
|
|
dc.date.accessioned |
2018-05-30T14:18:48Z |
|
dc.date.available |
2018-05-30T14:18:48Z |
|
dc.date.issued |
1995 |
|
dc.identifier.citation |
Paijmans H (1995) Word frequencies in scientific prose. South African Computer Journal, Number 14, 1995 |
en |
dc.identifier.issn |
2313-7835 |
|
dc.identifier.uri |
http://hdl.handle.net/10500/24177 |
|
dc.description.abstract |
An overview is given of popular term-weighting schemes as used in Information Retrieval. The question is raised whether the word frequency based t f.idf weights may be used as a yardstick to compare schemes that are not based on frequency-characteristics, such as the first mention theory or schemes based on word categories. It is demonstrated that word categories and frequency derived weights have a close correlation but that weighting according to the first mentioned theory shows no agreement with frequency-based weights. |
en |
dc.language.iso |
en |
en |
dc.publisher |
South African Computer Society (SAICSIT) |
en |
dc.subject |
Information retrieval |
en |
dc.subject |
Word frequencies |
en |
dc.subject |
SMART |
en |
dc.subject |
Full-text documents |
en |
dc.title |
Word frequencies in scientific prose |
en |
dc.type |
Article |
en |