Bu öğeden alıntı yapmak, öğeye bağlanmak için bu tanımlayıcıyı kullanınız: http://hdl.handle.net/11452/25198
Başlık: Effective early termination techniques for text similarity join operator
Yazarlar: Ulusoy, Özgür
Yolum, Pınar
Güngör, T.
Gürgen, Fikret
Özturan, Can
Uludağ Üniversitesi/Mühendislik Fakültesi/Endüstri Mühendisliği Bölümü.
0000-0001-9201-6349
Özalp, Selma Ayşe
G-1584-2018
I-9828-2018
6603978393
Anahtar kelimeler: Computer science
Metadata
Bibliographic retrieval systems
Computation theory
Computer operating procedures
Data mining
Data reduction
Information retrieval
Integration
Query languages
Application domains
Data querying
Filter heuristics
Text similarity
Text processing
Yayın Tarihi: 2005
Yayıncı: Springer
Atıf: Özalp, S. A. ve Ulusoy, Ö. (2005). "Effective early termination techniques for text similarity join operator". ed. P. Yolum vd. Computer and Information Sciences (ISCIS 2005)- Lecture Notes in Computer Science, 3733, 791-801.
Özet: Text similarity join operator joins two relations if their join attributes are textually similar to each other, and it has a variety of application domains including integration and querying of data from heterogeneous resources; cleansing of data; and mining of data. Although, the text similarity join operator is widely used, its processing is expensive due to the huge number of similarity computations performed. In this paper, we incorporate some short cut evaluation techniques from the Information Retrieval domain, namely Harman, quit, continue, and maximal similarity filter heuristics, into the previously proposed text similarity join algorithms to reduce the amount of similarity computations needed during the join operation. We experimentally evaluate the original and the heuristic based similarity join algorithms using real data obtained from the DBLP Bibliography database, and observe performance improvements with continue and maximal similarity filter heuristics.
Açıklama: Bu çalışma, 26-28 Ekim 2005 tarihleri arasında İstanbul[Türkiye]'da düzenlenen 20. International Symposium on Computer and Information Sciences'da bildiri olarak sunulmuştur.
URI: https://doi.org/10.1007/11569596_81
https://link.springer.com/chapter/10.1007/11569596_81
http://hdl.handle.net/11452/25198
ISBN: 3-540-29414-7
ISSN: 0302-9743
1611-3349
Koleksiyonlarda Görünür:Scopus
Web of Science

Bu öğenin dosyaları:
Dosya Açıklama BoyutBiçim 
Özalp_Ulusoy_2005.pdf55.93 kBAdobe PDFKüçük resim
Göster/Aç


Bu öğe kapsamında lisanslı Creative Commons License Creative Commons