首页 > 最新文献

2011 International Conference on Semantic Technology and Information Retrieval最新文献

英文 中文
A review of retrospective news event detection 回顾性新闻事件检测综述
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995790
Qusai Ramadan, M. Mohd
Retrospective news event detection (RED) has been studied for many years in order to discover previous unidentified events. There are ongoing works done to improve RED techniques such as distance measure and clustering approaches to overcome issues such as huge dimensionality of data. This paper discusses three major sequential stages in RED (data preprocessing, data representation and data organization) and reports the limitation in each stage. Finally we present the suggested RED with respect to crimes domain.
回顾性新闻事件检测(RED)的研究已有多年,其目的是发现以前未确定的事件。目前正在进行的工作是改进RED技术,如距离测量和聚类方法,以克服数据的巨大维度等问题。本文讨论了RED的三个主要顺序阶段(数据预处理、数据表示和数据组织),并报告了每个阶段的局限性。最后,我们提出了关于犯罪领域的RED建议。
{"title":"A review of retrospective news event detection","authors":"Qusai Ramadan, M. Mohd","doi":"10.1109/STAIR.2011.5995790","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995790","url":null,"abstract":"Retrospective news event detection (RED) has been studied for many years in order to discover previous unidentified events. There are ongoing works done to improve RED techniques such as distance measure and clustering approaches to overcome issues such as huge dimensionality of data. This paper discusses three major sequential stages in RED (data preprocessing, data representation and data organization) and reports the limitation in each stage. Finally we present the suggested RED with respect to crimes domain.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116401626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A framework for semantic forum in e-learning education 电子学习教育中的语义论坛框架
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995768
Hazalina Hashim, S. Noah
Establishing an effective online forum is crucial to the success of development and support the creation of collaborative knowledge building community. However there exist some resistance for online forum being as an effective e-learning facility. As such, it is the interest of this research to support this learning facility through semantic technologies adaptation. A conceptual framework is proposed for holistic approach focussing on the development of the knowledge content and how this knowledge can be used by user through semantic forum in e-learning education.
建立一个有效的在线论坛对开发的成功和支持创建协作知识建设社区至关重要。然而,在线论坛作为一种有效的电子学习工具存在一些阻力。因此,本研究的兴趣是通过语义技术适应来支持这种学习设施。提出了一个整体方法的概念框架,重点关注知识内容的开发以及用户如何通过语义论坛使用这些知识。
{"title":"A framework for semantic forum in e-learning education","authors":"Hazalina Hashim, S. Noah","doi":"10.1109/STAIR.2011.5995768","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995768","url":null,"abstract":"Establishing an effective online forum is crucial to the success of development and support the creation of collaborative knowledge building community. However there exist some resistance for online forum being as an effective e-learning facility. As such, it is the interest of this research to support this learning facility through semantic technologies adaptation. A conceptual framework is proposed for holistic approach focussing on the development of the knowledge content and how this knowledge can be used by user through semantic forum in e-learning education.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122804595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clustering patent document in the field of ICT (Information & Communication Technology) ICT (Information & Communication Technology)领域专利文献聚类
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995789
A. Widodo, I. Budi
The current classification of patent data that refers to the IPC (International Patent Classification) of the WIPO (World Intellectual Property Organization), deemed not reflect the classification of the field of ICT (Information & Communication Technology). ICT applications are usually included in sections G (Physics) and H (Electricity). This paper will evaluate the eight groupings of patents based on the IPC classes (G01, G06, G09, G11, H01, H03, H04, and H06) of patents registered in the Directorate General of Intellectual Property Rights in Indonesia, from the year 1991 to 2000. The algorithm used to grouping is KMeans, KMeans++, Hierchical Clustering, and a combination of these three algorithms with SVD (Singular Value Decomposition). For external validation, Purity and F-Measure are used, whereas Silhouette is used for internal validation. From the experimental results it can be concluded that SVD provides improvements to the clustering results. In addition, the use of abstract does not necessarily improve the performance of clustering, and the use of phrase does not always yield better cluster than the use of the word as index. Moreover, no cluster has purity measure greater than 50%, which means that the existing IPC classification has not been able to accommodate the field of ICT appropriately.
目前的专利数据分类参照的是WIPO(世界知识产权组织)的IPC(国际专利分类),被认为不能反映ICT(信息与通信技术)领域的分类。信息通信技术应用通常包括在G(物理)和H(电力)部分。本文将对1991年至2000年在印度尼西亚知识产权总局注册的专利根据IPC类别(G01、G06、G09、G11、H01、H03、H04和H06)的八组专利进行评估。用于分组的算法是KMeans、kmeans++、分层聚类以及这三种算法与奇异值分解(SVD)的结合。对于外部验证,使用Purity和F-Measure,而内部验证使用Silhouette。从实验结果可以看出,奇异值分解对聚类结果有改善作用。此外,使用abstract并不一定会提高聚类的性能,使用短语并不总是比使用单词作为索引产生更好的聚类。此外,没有一个集群的纯度测量值大于50%,这意味着现有的IPC分类不能适当地适应ICT领域。
{"title":"Clustering patent document in the field of ICT (Information & Communication Technology)","authors":"A. Widodo, I. Budi","doi":"10.1109/STAIR.2011.5995789","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995789","url":null,"abstract":"The current classification of patent data that refers to the IPC (International Patent Classification) of the WIPO (World Intellectual Property Organization), deemed not reflect the classification of the field of ICT (Information & Communication Technology). ICT applications are usually included in sections G (Physics) and H (Electricity). This paper will evaluate the eight groupings of patents based on the IPC classes (G01, G06, G09, G11, H01, H03, H04, and H06) of patents registered in the Directorate General of Intellectual Property Rights in Indonesia, from the year 1991 to 2000. The algorithm used to grouping is KMeans, KMeans++, Hierchical Clustering, and a combination of these three algorithms with SVD (Singular Value Decomposition). For external validation, Purity and F-Measure are used, whereas Silhouette is used for internal validation. From the experimental results it can be concluded that SVD provides improvements to the clustering results. In addition, the use of abstract does not necessarily improve the performance of clustering, and the use of phrase does not always yield better cluster than the use of the word as index. Moreover, no cluster has purity measure greater than 50%, which means that the existing IPC classification has not been able to accommodate the field of ICT appropriately.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115243530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Knowledge Management Systems for emergency managers: Malaysian perspective 应急管理人员的知识管理系统:马来西亚视角
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995804
Magiswary Dorasamy, Murali Raman, S. Muthaiyah, M. Kaliannan
Recent disasters in Malaysia which was composed of major floods and landslides events proved that despite past experiences and strong disaster management mechanism, life and property losses is still unavoidable. Our proposition is the disaster planning and response efforts need a more prudent solution in order to reduce life and property losses. This paper attempts to answer how emergency managers and responders can benefit from an information and communication technology (ICT) in the form of a Knowledge Management Systems (KMS) implementation to support their planning and response efforts, hence reduce the losses. The paper examines recent literatures in the context of KMS for disasters. The findings of this paper are twofold. First, it points out the role and possible usage of KMS to improve the effectiveness of planning and response efforts for emergency managers and secondly, the important factors to consider in developing an effective KMS for disaster. The knowledge gained from this studies should help emergency managers learn from past disasters that already been so costly to society.
马来西亚近期发生的特大洪灾和山体滑坡事件证明,尽管过去的经验和强大的灾害管理机制,生命财产损失仍然不可避免。我们的主张是,灾难规划和应对工作需要更谨慎的解决方案,以减少生命和财产损失。本文试图回答应急管理人员和响应者如何从知识管理系统(KMS)实施形式的信息通信技术(ICT)中受益,以支持他们的规划和响应工作,从而减少损失。本文考察了灾害中KMS的最新文献。这篇论文的发现是双重的。首先,它指出了KMS在提高应急管理人员规划和响应工作有效性方面的作用和可能的用途,其次,在开发有效的灾害KMS时需要考虑的重要因素。从这项研究中获得的知识应该有助于应急管理人员从过去已经给社会造成巨大损失的灾害中吸取教训。
{"title":"Knowledge Management Systems for emergency managers: Malaysian perspective","authors":"Magiswary Dorasamy, Murali Raman, S. Muthaiyah, M. Kaliannan","doi":"10.1109/STAIR.2011.5995804","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995804","url":null,"abstract":"Recent disasters in Malaysia which was composed of major floods and landslides events proved that despite past experiences and strong disaster management mechanism, life and property losses is still unavoidable. Our proposition is the disaster planning and response efforts need a more prudent solution in order to reduce life and property losses. This paper attempts to answer how emergency managers and responders can benefit from an information and communication technology (ICT) in the form of a Knowledge Management Systems (KMS) implementation to support their planning and response efforts, hence reduce the losses. The paper examines recent literatures in the context of KMS for disasters. The findings of this paper are twofold. First, it points out the role and possible usage of KMS to improve the effectiveness of planning and response efforts for emergency managers and secondly, the important factors to consider in developing an effective KMS for disaster. The knowledge gained from this studies should help emergency managers learn from past disasters that already been so costly to society.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123815423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Producing complete modules in ontology partitioning 生成完整的本体划分模块
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995778
Maryam Jafari Sharif Abadi, K. Zamanifar
Ontology modularization is an important part of ontology engineering for which the reduction of complexity and size of ontology, either breaks an ontology down to all the constituent parts(ontology partitioning) or extracts only a small part of the ontology(module extraction). The presented approach in this essay is concentrated on ontology partitioning. To have complete segments, we initially do the reasoning on ontology and then render the result of reasoning to Pato, a tool for partitioning. So, the generated modules are complete with appropriate size. Although reasoning on large ontologies is time consuming, it happens just one time and instead brings a lot of advantages which cannot be provided by other methods. For guaranteeing this claim, we assess the presented and former methods theoretically and practically and show that the proposed method causes the ontology to be parted properly and the resulted modules can be utilized by different tools like a self contained ontology.
本体模块化是本体工程的一个重要组成部分,它通过将本体分解为所有组成部分(本体划分)或仅提取本体的一小部分(模块提取)来降低本体的复杂性和尺寸。本文提出的方法主要集中在本体划分上。为了得到完整的段,我们首先在本体上进行推理,然后将推理结果呈现给Pato这个划分工具。因此,生成的模块是完整的,具有适当的大小。尽管在大型本体上进行推理是耗时的,但它只发生一次,反而带来了许多其他方法无法提供的优势。为了保证这一说法,我们从理论上和实践上对所提出的方法和以前的方法进行了评估,并表明所提出的方法可以使本体正确地分离,并且所得到的模块可以被不同的工具使用,就像一个自包含的本体。
{"title":"Producing complete modules in ontology partitioning","authors":"Maryam Jafari Sharif Abadi, K. Zamanifar","doi":"10.1109/STAIR.2011.5995778","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995778","url":null,"abstract":"Ontology modularization is an important part of ontology engineering for which the reduction of complexity and size of ontology, either breaks an ontology down to all the constituent parts(ontology partitioning) or extracts only a small part of the ontology(module extraction). The presented approach in this essay is concentrated on ontology partitioning. To have complete segments, we initially do the reasoning on ontology and then render the result of reasoning to Pato, a tool for partitioning. So, the generated modules are complete with appropriate size. Although reasoning on large ontologies is time consuming, it happens just one time and instead brings a lot of advantages which cannot be provided by other methods. For guaranteeing this claim, we assess the presented and former methods theoretically and practically and show that the proposed method causes the ontology to be parted properly and the resulted modules can be utilized by different tools like a self contained ontology.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124286676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Word Sense Disambiguation by using domain knowledge 基于领域知识的词义消歧
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995795
W. Lee, E. Mit
Over the decades, lot of studies had been carried out to suggest different approaches for Word Sense Disambiguation (WSD) process. From times to times, different approaches had been suggested to define the sense of a polysemous word. In this paper, a WSD approach with the domain knowledge will be discussed. In this approach, by using Wordnet, domains of each single word will be defined and a process of defining the best domain to be assigned to that particular word will be carried out. A method of calculating the weight of each domain to its corresponding word will be discussed. According to the weight assigned to each domain, the sense of the ambiguous word will be identified.
几十年来,人们对词义消歧过程进行了大量的研究,提出了不同的方法。人们经常提出不同的方法来定义多义词的意义。本文将讨论一种具有领域知识的WSD方法。在这种方法中,通过使用Wordnet,将定义每个单个单词的域,并执行定义分配给该特定单词的最佳域的过程。将讨论计算每个域对其对应词的权重的方法。根据赋予每个域的权重,识别歧义词的意义。
{"title":"Word Sense Disambiguation by using domain knowledge","authors":"W. Lee, E. Mit","doi":"10.1109/STAIR.2011.5995795","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995795","url":null,"abstract":"Over the decades, lot of studies had been carried out to suggest different approaches for Word Sense Disambiguation (WSD) process. From times to times, different approaches had been suggested to define the sense of a polysemous word. In this paper, a WSD approach with the domain knowledge will be discussed. In this approach, by using Wordnet, domains of each single word will be defined and a process of defining the best domain to be assigned to that particular word will be carried out. A method of calculating the weight of each domain to its corresponding word will be discussed. According to the weight assigned to each domain, the sense of the ambiguous word will be identified.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"393 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114313267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Phonetic coding methods for Malay names retrieval 马来语人名检索的语音编码方法
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995776
Norhasimawati Abdul Mutalib, S. Noah
Searching for person names has been very popular among users of information systems and search engines. Thus, the effectiveness, accuracy and appropriateness of search results are strongly emphasized. Information retrieval (IR) methods provide high impact in influencing the searching results. Efforts in improving the IR methods have been made due to the fact that names are not unique and have varieties of spelling. This will caused errors during the process of getting accurate names. Searching based on phonetic is said to be a suitable method to solve the aforementioned problem because names have limited spelling standards. Phonetic method is used to recognize and retrieve words that have the same pronunciation. The main aim of this paper is to test the effectiveness of phonetic coding method on Malay name retrieval using Soundex and modified-Asoundex (Asoundex is an Arabic Soundex). The experimental approach used to perform this research consists of two stages; program development and testing Malay name data sets. The development of programs referred to the existing algorithms to generate name code. Code generated from program will be compared with data contained in the test data. The effectiveness of the result is determined by comparing the output with the result obtained from both phonetic approaches. Evaluation is based on the precision and recall measures. The contribution of the research is to provide comparative accuracy of Malay name retrieval using Soundex and modified-Asoundex coding method. Result show that an average of 38.38% improvement of the precision measure has been achieved.
搜索人名在信息系统和搜索引擎的用户中非常流行。因此,搜索结果的有效性,准确性和适当性被强烈强调。信息检索方法对检索结果的影响很大。由于名称不是唯一的,而且拼写有多种,因此已经努力改进IR方法。这将导致在获取准确名称的过程中出现错误。基于语音的搜索据说是解决上述问题的合适方法,因为名称的拼写标准有限。语音法用于识别和检索具有相同发音的单词。本文的主要目的是测试语音编码方法在使用Soundex和修改后的Asoundex (Asoundex是阿拉伯语的Soundex)检索马来语名称时的有效性。实验方法用于执行这项研究包括两个阶段;程序开发和测试马来语名称数据集。程序的开发参考了现有的算法来生成名称代码。由程序生成的代码将与测试数据中包含的数据进行比较。结果的有效性是通过将输出结果与两种语音方法得到的结果进行比较来确定的。评估是基于精度和召回措施。本研究的贡献在于提供使用Soundex和修正asoundex编码方法检索马来语名称的比较准确性。结果表明,该方法平均提高了38.38%的测量精度。
{"title":"Phonetic coding methods for Malay names retrieval","authors":"Norhasimawati Abdul Mutalib, S. Noah","doi":"10.1109/STAIR.2011.5995776","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995776","url":null,"abstract":"Searching for person names has been very popular among users of information systems and search engines. Thus, the effectiveness, accuracy and appropriateness of search results are strongly emphasized. Information retrieval (IR) methods provide high impact in influencing the searching results. Efforts in improving the IR methods have been made due to the fact that names are not unique and have varieties of spelling. This will caused errors during the process of getting accurate names. Searching based on phonetic is said to be a suitable method to solve the aforementioned problem because names have limited spelling standards. Phonetic method is used to recognize and retrieve words that have the same pronunciation. The main aim of this paper is to test the effectiveness of phonetic coding method on Malay name retrieval using Soundex and modified-Asoundex (Asoundex is an Arabic Soundex). The experimental approach used to perform this research consists of two stages; program development and testing Malay name data sets. The development of programs referred to the existing algorithms to generate name code. Code generated from program will be compared with data contained in the test data. The effectiveness of the result is determined by comparing the output with the result obtained from both phonetic approaches. Evaluation is based on the precision and recall measures. The contribution of the research is to provide comparative accuracy of Malay name retrieval using Soundex and modified-Asoundex coding method. Result show that an average of 38.38% improvement of the precision measure has been achieved.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114835307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluation of Quranic text retrieval system based on manually indexed topics 基于人工索引主题的古兰经文本检索系统评价
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995781
A. M. Sultan, A. Azman, R. A. Kadir, M. T. Abdullah
This paper investigates the effectiveness of a state of the art information retrieval (IR) system in the verse retrieval problem for Quranic text. The evaluation is based on manually indexed topics of the Quran that provides both the queries and the relevance judgments. Furthermore, the system is evaluated in both Malay and English environment. The performance of the system is measured based on the MAP, the precision at 1, 5 and 10, and the MRR scores. The results of the evaluation are promising, showing the IR system has many potential for the Quranic text retrieval.
本文研究了一种先进的信息检索系统在古兰经经文检索中的有效性。评估是基于人工索引的古兰经主题,提供查询和相关性判断。此外,该系统在马来语和英语环境中进行评估。基于MAP、1、5和10的精度以及MRR分数来衡量系统的性能。评价结果表明,红外检索系统在古兰经文本检索方面具有很大的潜力。
{"title":"Evaluation of Quranic text retrieval system based on manually indexed topics","authors":"A. M. Sultan, A. Azman, R. A. Kadir, M. T. Abdullah","doi":"10.1109/STAIR.2011.5995781","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995781","url":null,"abstract":"This paper investigates the effectiveness of a state of the art information retrieval (IR) system in the verse retrieval problem for Quranic text. The evaluation is based on manually indexed topics of the Quran that provides both the queries and the relevance judgments. Furthermore, the system is evaluated in both Malay and English environment. The performance of the system is measured based on the MAP, the precision at 1, 5 and 10, and the MRR scores. The results of the evaluation are promising, showing the IR system has many potential for the Quranic text retrieval.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123277021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Enriching non-taxonomic relations extracted from domain texts 丰富从领域文本中提取的非分类关系
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995772
N. Nabila, Ali Mamat, M. Azmi-Murad, N. Mustapha
Extracting non-taxonomic relations is one of the important tasks in the construction of ontology from the text. Most of current methods on identification and extraction of non-taxonomic relations is based on predicate representing relationships between two concepts, namely the relation between subject and object that occurs in a sentence. However, the number of relations that has been identified does not properly represent the domain as the methods only identify a portion of the total relations from domain texts. In this paper, we present a method that increases the number of relations extracted and thus properly represent the domain. In this method, all potential relations are first generated and then less significant ones, based on their frequency, are removed. The method has been tested on a collection of texts that described electronic voting machine and the result is encouraging.
从文本中提取非分类关系是构建本体的重要任务之一。目前大多数非分类关系的识别和提取方法都是基于表示两个概念之间关系的谓词,即句子中出现的主语和宾语之间的关系。然而,已识别的关系数量并不能正确地表示域,因为这些方法只能从域文本中识别总关系的一部分。在本文中,我们提出了一种方法,增加了提取关系的数量,从而正确地表示领域。在这种方法中,首先生成所有潜在的关系,然后根据其频率去除不太重要的关系。该方法已经在一组描述电子投票机的文本上进行了测试,结果令人鼓舞。
{"title":"Enriching non-taxonomic relations extracted from domain texts","authors":"N. Nabila, Ali Mamat, M. Azmi-Murad, N. Mustapha","doi":"10.1109/STAIR.2011.5995772","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995772","url":null,"abstract":"Extracting non-taxonomic relations is one of the important tasks in the construction of ontology from the text. Most of current methods on identification and extraction of non-taxonomic relations is based on predicate representing relationships between two concepts, namely the relation between subject and object that occurs in a sentence. However, the number of relations that has been identified does not properly represent the domain as the methods only identify a portion of the total relations from domain texts. In this paper, we present a method that increases the number of relations extracted and thus properly represent the domain. In this method, all potential relations are first generated and then less significant ones, based on their frequency, are removed. The method has been tested on a collection of texts that described electronic voting machine and the result is encouraging.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131853636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Mining user activity as a context source for search and retrieval 挖掘用户活动作为搜索和检索的上下文源
Pub Date : 2011-06-28 DOI: 10.1109/STAIR.2011.5995782
Zhengwei Qiu, A. Doherty, C. Gurrin, A. Smeaton
Nowadays in information retrieval it is generally accepted that if we can better understand the context of searchers then this could help the search process, either at indexing time by including more metadata or at retrieval time by better modelling the user needs. In this work we explore how activity recognition from tri-axial accelerometers can be employed to model a user's activity as a means of enabling context-aware information retrieval. In this paper we discuss how we can gather user activity automatically as a context source from a wearable mobile device and we evaluate the accuracy of our proposed user activity recognition algorithm. Our technique can recognise four kinds of activities which can be used to model part of an individual's current context. We discuss promising experimental results, possible approaches to improve our algorithms, and the impact of this work in modelling user context toward enhanced search and retrieval.
如今,在信息检索中,人们普遍认为,如果我们能够更好地理解搜索者的上下文,那么这将有助于搜索过程,或者在索引时包含更多的元数据,或者在检索时更好地建模用户需求。在这项工作中,我们探讨了如何利用三轴加速度计的活动识别来模拟用户的活动,作为实现上下文感知信息检索的一种手段。在本文中,我们讨论了如何从可穿戴移动设备自动收集用户活动作为上下文源,并评估了我们提出的用户活动识别算法的准确性。我们的技术可以识别四种活动,这些活动可以用来为个人当前环境的一部分建模。我们讨论了有希望的实验结果,改进算法的可能方法,以及这项工作对增强搜索和检索的用户上下文建模的影响。
{"title":"Mining user activity as a context source for search and retrieval","authors":"Zhengwei Qiu, A. Doherty, C. Gurrin, A. Smeaton","doi":"10.1109/STAIR.2011.5995782","DOIUrl":"https://doi.org/10.1109/STAIR.2011.5995782","url":null,"abstract":"Nowadays in information retrieval it is generally accepted that if we can better understand the context of searchers then this could help the search process, either at indexing time by including more metadata or at retrieval time by better modelling the user needs. In this work we explore how activity recognition from tri-axial accelerometers can be employed to model a user's activity as a means of enabling context-aware information retrieval. In this paper we discuss how we can gather user activity automatically as a context source from a wearable mobile device and we evaluate the accuracy of our proposed user activity recognition algorithm. Our technique can recognise four kinds of activities which can be used to model part of an individual's current context. We discuss promising experimental results, possible approaches to improve our algorithms, and the impact of this work in modelling user context toward enhanced search and retrieval.","PeriodicalId":376671,"journal":{"name":"2011 International Conference on Semantic Technology and Information Retrieval","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134590527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
期刊
2011 International Conference on Semantic Technology and Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1