{"title":"Japanese sentence compression using Simple English Wikipedia","authors":"Shunsuke Takeno, Kazuhide Yamamoto","doi":"10.1109/IALP.2015.7451533","DOIUrl":null,"url":null,"abstract":"We describe a cross-lingual approach for sentence compression of articles of Japanese Wikipedia using the correspondence of articles of Simple English Wikipedia. Taking advantages of the nature of the corpus, we can find essential parts from encyclopedic description without highly depending on the statistical information which are noisy. We manually explored the correspondences between the articles of Japanese Wikipedia and those of Simple English Wikipedia and then proposed a cross-lingual alignment method using simple matching algorithm. We provide an analysis of the abovementioned correspondence and the preliminary result of sentence compression using Simple English Wikipedia.","PeriodicalId":256927,"journal":{"name":"2015 International Conference on Asian Language Processing (IALP)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Asian Language Processing (IALP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2015.7451533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
We describe a cross-lingual approach for sentence compression of articles of Japanese Wikipedia using the correspondence of articles of Simple English Wikipedia. Taking advantages of the nature of the corpus, we can find essential parts from encyclopedic description without highly depending on the statistical information which are noisy. We manually explored the correspondences between the articles of Japanese Wikipedia and those of Simple English Wikipedia and then proposed a cross-lingual alignment method using simple matching algorithm. We provide an analysis of the abovementioned correspondence and the preliminary result of sentence compression using Simple English Wikipedia.