{"title":"ME-Match: Tonal Grouping Based Approach in Cross-Script Name Matching","authors":"Kyaw Zar Zar Phyu, Khin Mar Lar Tun","doi":"10.1109/ICFCC.2009.24","DOIUrl":null,"url":null,"abstract":"Even though matching between different scripts could be immensely useful for news organizations, author recognition with cross-script matches in digital libraries and homeland security, it is impossible to automatically match. Now, we propose a new approach, ME-Match, for matching the proper names across different scripts. The foremost concept of our approach is to match them via phoneme strings. The main steps in ME-Match are creation of bilingual pronouncing mapping, tokenization of query names, transformation of query names to IPA forms based on tonal grouping approach, searching possible various words in both scripts for each query IPA phoneme string, combination of various words to become full name strings and then searching names. The performance is measured by standard information-retrieval metrics: recall, precision, and f-measures.","PeriodicalId":338489,"journal":{"name":"2009 International Conference on Future Computer and Communication","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Future Computer and Communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFCC.2009.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Even though matching between different scripts could be immensely useful for news organizations, author recognition with cross-script matches in digital libraries and homeland security, it is impossible to automatically match. Now, we propose a new approach, ME-Match, for matching the proper names across different scripts. The foremost concept of our approach is to match them via phoneme strings. The main steps in ME-Match are creation of bilingual pronouncing mapping, tokenization of query names, transformation of query names to IPA forms based on tonal grouping approach, searching possible various words in both scripts for each query IPA phoneme string, combination of various words to become full name strings and then searching names. The performance is measured by standard information-retrieval metrics: recall, precision, and f-measures.