Please use this identifier to cite or link to this item:
http://hdl.handle.net/11452/25198
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ulusoy, Özgür | - |
dc.contributor.author | Yolum, Pınar | - |
dc.contributor.author | Güngör, T. | - |
dc.contributor.author | Gürgen, Fikret | - |
dc.contributor.author | Özturan, Can | - |
dc.date.accessioned | 2022-03-21T06:13:03Z | - |
dc.date.available | 2022-03-21T06:13:03Z | - |
dc.date.issued | 2005 | - |
dc.identifier.citation | Özalp, S. A. ve Ulusoy, Ö. (2005). "Effective early termination techniques for text similarity join operator". ed. P. Yolum vd. Computer and Information Sciences (ISCIS 2005)- Lecture Notes in Computer Science, 3733, 791-801. | en_US |
dc.identifier.isbn | 3-540-29414-7 | - |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.issn | 1611-3349 | - |
dc.identifier.uri | https://doi.org/10.1007/11569596_81 | - |
dc.identifier.uri | https://link.springer.com/chapter/10.1007/11569596_81 | - |
dc.identifier.uri | http://hdl.handle.net/11452/25198 | - |
dc.description | Bu çalışma, 26-28 Ekim 2005 tarihleri arasında İstanbul[Türkiye]'da düzenlenen 20. International Symposium on Computer and Information Sciences'da bildiri olarak sunulmuştur. | tr_TR |
dc.description.abstract | Text similarity join operator joins two relations if their join attributes are textually similar to each other, and it has a variety of application domains including integration and querying of data from heterogeneous resources; cleansing of data; and mining of data. Although, the text similarity join operator is widely used, its processing is expensive due to the huge number of similarity computations performed. In this paper, we incorporate some short cut evaluation techniques from the Information Retrieval domain, namely Harman, quit, continue, and maximal similarity filter heuristics, into the previously proposed text similarity join algorithms to reduce the amount of similarity computations needed during the join operation. We experimentally evaluate the original and the heuristic based similarity join algorithms using real data obtained from the DBLP Bibliography database, and observe performance improvements with continue and maximal similarity filter heuristics. | en_US |
dc.description.sponsorship | Inst Elec & Elect Engineers, Turkey Sect | en_US |
dc.description.sponsorship | Boğaziçi Üniversitesi | tr_TR |
dc.language.iso | en | en_US |
dc.publisher | Springer | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.rights | Atıf Gayri Ticari Türetilemez 4.0 Uluslararası | tr_TR |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Computer science | en_US |
dc.subject | Metadata | en_US |
dc.subject | Bibliographic retrieval systems | en_US |
dc.subject | Computation theory | en_US |
dc.subject | Computer operating procedures | en_US |
dc.subject | Data mining | en_US |
dc.subject | Data reduction | en_US |
dc.subject | Information retrieval | en_US |
dc.subject | Integration | en_US |
dc.subject | Query languages | en_US |
dc.subject | Application domains | en_US |
dc.subject | Data querying | en_US |
dc.subject | Filter heuristics | en_US |
dc.subject | Text similarity | en_US |
dc.subject | Text processing | en_US |
dc.title | Effective early termination techniques for text similarity join operator | en_US |
dc.type | Article | en_US |
dc.identifier.wos | 000234179600079 | tr_TR |
dc.identifier.scopus | 2-s2.0-33646503003 | tr_TR |
dc.relation.tubitak | 100U024 | tr_TR |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi | tr_TR |
dc.contributor.department | Uludağ Üniversitesi/Mühendislik Fakültesi/Endüstri Mühendisliği Bölümü. | tr_TR |
dc.contributor.orcid | 0000-0001-9201-6349 | tr_TR |
dc.identifier.startpage | 791 | tr_TR |
dc.identifier.endpage | 801 | tr_TR |
dc.identifier.volume | 3733 | tr_TR |
dc.relation.journal | Computer and Information Sciences (ISCIS 2005) - Lecture Notes in Computer Science | en_US |
dc.contributor.buuauthor | Özalp, Selma Ayşe | - |
dc.contributor.researcherid | G-1584-2018 | tr_TR |
dc.contributor.researcherid | I-9828-2018 | tr_TR |
dc.relation.collaboration | Yurt içi | tr_TR |
dc.subject.wos | Computer science, information systems | en_US |
dc.subject.wos | Computer science, theory & methods | en_US |
dc.indexed.wos | SCIE | en_US |
dc.indexed.wos | CPCIS | en_US |
dc.indexed.scopus | Scopus | en_US |
dc.wos.quartile | Q4 | en_US |
dc.contributor.scopusid | 6603978393 | tr_TR |
dc.subject.scopus | Inverted Index; Query Processing; Caching | en_US |
Appears in Collections: | Scopus Web of Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Özalp_Ulusoy_2005.pdf | 55.93 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License