Please use this identifier to cite or link to this item:
http://hdl.handle.net/11452/31516
Title: | Using conditional probabilities for automatic new topic identification |
Authors: | Uludağ Üniversitesi/Mühendislik Fakültesi/Endüstri Mühendisliği Bölümü. Özmutlu, Seda Özmutlu, Hüseyin C. Büyük, Buket AAH-4480-2021 ABH-5209-2020 6603660605 6603061328 23570445900 |
Keywords: | Computer science Information science & library science Information-seeking Web Users Life Behaviour Information services Query categories Search engines Topic identification Information retrieval Search engines Statistical analysis Online searching Probability distributions Problem solving Statistical methods User interfaces |
Issue Date: | 2007 |
Publisher: | Emerald Group Publishing |
Citation: | Özmutlu, S. vd. (2007). "Using conditional probabilities for automatic new topic identification". Online Information Review, 31(4), 491-515. |
Abstract: | Purpose - One of the most important dimensions of search engine user information seeking behaviour is content-based behaviour. One of the main elements in developing a personalised intelligent search engine is new topic identification. The purpose of this study is to perform automatic new topic identification in search engine transaction logs using conditional probabilities of new topic arrivals. Design/methodology/approach - Sample data logs from FAST (currently owned by Yahoo!) and Excite (currently owned by IAC Search & Media) are used in the study. Conditional probabilities of new topic arrivals and topic continuations given query category are used to estimate new topic arrivals. Findings - The findings of this study show that the conditional probability approach reduced overestimation of topic shifts, increasing some performance measures to their highest ever value compared to previous studies. A straightforward procedure such as the conditional probability approach can be as successful as, and for some measures more successful than, more complex methods applied in previous automatic new topic identification studies. Originality/value - A straightforward procedure that can enable fast automatic new topic identification, a problem not yet solved, and an important step towards personalised search engines. |
URI: | https://doi.org/10.1108/14684520710780449 https://www.emerald.com/insight/content/doi/10.1108/14684520710780449/full/html http://hdl.handle.net/11452/31516 |
ISSN: | 14684527 |
Appears in Collections: | Scopus Web of Science |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.