首页 > 最新文献

2014 IEEE International Conference on Semantic Computing最新文献

英文 中文
Partitioning OWL Knowledge Bases for Parallel Reasoning 面向并行推理的OWL知识库划分
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.34
S. Priya, Yuanbo Guo, Michael F. Spear, J. Heflin
The ability to reason over large scale data and return responsive query results is widely seen as a critical step to achieving the Semantic Web vision. We describe an approach for partitioning OWL Lite datasets and then propose a strategy for parallel reasoning about concept instances and role instances on each partition. The partitions are designed such that each can be reasoned on independently to find answers to each query sub goal, and when the results are unioned together, a complete set of results are found for that sub goal. Our partitioning approach has a polynomial worst case time complexity in the size of the knowledge base. In our current implementation, we partition semantic web datasets and execute reasoning tasks on partitioned data in parallel on independent machines. We implement a master-slave architecture that distributes a given query to the slave processes on different machines. All slaves run in parallel, each performing sound and complete reasoning to execute each sub goal of its query on its own set of partitions. As a final step, master joins the results computed by the slaves. We study the impact of our parallel reasoning approach on query performance and show some promising results on LUBM data.
对大规模数据进行推理并返回响应性查询结果的能力被广泛视为实现语义Web愿景的关键一步。我们描述了一种划分OWL Lite数据集的方法,然后提出了一种对每个分区上的概念实例和角色实例进行并行推理的策略。分区的设计使得每个分区都可以独立地进行推理,以找到每个查询子目标的答案,并且当结果合并在一起时,可以找到该子目标的完整结果集。我们的划分方法在知识库的大小上具有多项式的最坏情况时间复杂度。在我们目前的实现中,我们对语义web数据集进行分区,并在独立的机器上并行地对分区数据执行推理任务。我们实现了一个主从架构,将给定的查询分发到不同机器上的从进程。所有从服务器并行运行,每个从服务器执行合理和完整的推理,在其自己的分区集上执行其查询的每个子目标。作为最后一步,主人加入奴隶计算的结果。我们研究了并行推理方法对查询性能的影响,并在LUBM数据上展示了一些有希望的结果。
{"title":"Partitioning OWL Knowledge Bases for Parallel Reasoning","authors":"S. Priya, Yuanbo Guo, Michael F. Spear, J. Heflin","doi":"10.1109/ICSC.2014.34","DOIUrl":"https://doi.org/10.1109/ICSC.2014.34","url":null,"abstract":"The ability to reason over large scale data and return responsive query results is widely seen as a critical step to achieving the Semantic Web vision. We describe an approach for partitioning OWL Lite datasets and then propose a strategy for parallel reasoning about concept instances and role instances on each partition. The partitions are designed such that each can be reasoned on independently to find answers to each query sub goal, and when the results are unioned together, a complete set of results are found for that sub goal. Our partitioning approach has a polynomial worst case time complexity in the size of the knowledge base. In our current implementation, we partition semantic web datasets and execute reasoning tasks on partitioned data in parallel on independent machines. We implement a master-slave architecture that distributes a given query to the slave processes on different machines. All slaves run in parallel, each performing sound and complete reasoning to execute each sub goal of its query on its own set of partitions. As a final step, master joins the results computed by the slaves. We study the impact of our parallel reasoning approach on query performance and show some promising results on LUBM data.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127495723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Feature Selection for Twitter Classification Twitter分类的特征选择
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.50
D. Ostrowski
Twitter-based messages have presented challenges in the identification of features as applied to classification. This paper explores filtering techniques for improved trend detection and information extraction. Starting with a pre-filtered source (Twitter), we will examine the application of both information theory and Natural Language Processing (NLP) based techniques as a means of preprocessing for classification. Results demonstrate that both means allow for improved results in classification among highly idiosyncratic data (Twitter).
基于twitter的消息在识别用于分类的特征方面提出了挑战。本文探讨了用于改进趋势检测和信息提取的过滤技术。从预过滤的源(Twitter)开始,我们将研究信息理论和基于自然语言处理(NLP)的技术作为分类预处理手段的应用。结果表明,这两种方法都允许在高度特质数据(Twitter)的分类中改进结果。
{"title":"Feature Selection for Twitter Classification","authors":"D. Ostrowski","doi":"10.1109/ICSC.2014.50","DOIUrl":"https://doi.org/10.1109/ICSC.2014.50","url":null,"abstract":"Twitter-based messages have presented challenges in the identification of features as applied to classification. This paper explores filtering techniques for improved trend detection and information extraction. Starting with a pre-filtered source (Twitter), we will examine the application of both information theory and Natural Language Processing (NLP) based techniques as a means of preprocessing for classification. Results demonstrate that both means allow for improved results in classification among highly idiosyncratic data (Twitter).","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125391775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Post-analysis of Keyword-Based Search Results Using Entity Mining, Linked Data, and Link Analysis at Query Time 使用实体挖掘、关联数据和查询时链接分析的基于关键字的搜索结果事后分析
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.11
P. Fafalios, Yannis Tzitzikas
The integration of the classical Web (of documents) with the emerging Web of Data is a challenging vision. In this paper we focus on an integration approach during searching which aims at enriching the responses of non-semantic search systems (e.g. professional search systems, web search engines) with semantic information, i.e. Linked Open Data (LOD), and exploiting the outcome for providing an overview of the search space and allowing the users (apart from restricting it) to explore the related LOD. We use named entities (e.g. persons, locations, etc.) as the "glue" for automatically connecting search hits with LOD. We consider a scenario where this entity-based integration is performed at query time with no human effort, and no a-priori indexing, which is beneficial in terms of configurability and freshness. To realize this scenario one has to tackle various challenges. One spiny issue is that the number of identified entities can be high, the same is true for the semantic information about these entities that can be fetched from the available LOD (i.e. their properties and associations with other entities). To this end, in this paper we propose a Link Analysis-based method which is used for (a) ranking (and thus selecting to show) the more important semantic information related to the search results, (b) deriving and showing top-K semantic graphs. In the sequel, we report the results of a survey regarding the marine domain with promising results, and comparative results that illustrate the effectiveness of the proposed (Page Rank-based) ranking scheme. Finally, we report experimental results regarding efficiency showing that the proposed functionality can be offered even at query time.
将经典的(文档的)Web与新兴的数据Web集成是一个具有挑战性的愿景。在本文中,我们专注于搜索过程中的集成方法,该方法旨在用语义信息(即链接开放数据(LOD))丰富非语义搜索系统(例如专业搜索系统,web搜索引擎)的响应,并利用结果提供搜索空间的概述,并允许用户(除了限制它)探索相关的LOD。我们使用命名实体(例如人员、位置等)作为“粘合剂”,将搜索结果与LOD自动连接起来。我们考虑这样一个场景,即在查询时执行基于实体的集成,无需人工操作,也无需先验索引,这在可配置性和新鲜度方面是有益的。要实现这一设想,我们必须应对各种挑战。一个棘手的问题是,已识别实体的数量可能很高,从可用LOD中获取这些实体的语义信息(即它们的属性和与其他实体的关联)也是如此。为此,在本文中,我们提出了一种基于链接分析的方法,该方法用于(a)对与搜索结果相关的更重要的语义信息进行排序(从而选择显示),(b)导出并显示top-K语义图。在续文中,我们报告了一项关于海洋领域的调查结果,结果很有希望,并比较了结果,说明了所提出的(基于页面排名的)排名方案的有效性。最后,我们报告了关于效率的实验结果,表明所提出的功能甚至可以在查询时提供。
{"title":"Post-analysis of Keyword-Based Search Results Using Entity Mining, Linked Data, and Link Analysis at Query Time","authors":"P. Fafalios, Yannis Tzitzikas","doi":"10.1109/ICSC.2014.11","DOIUrl":"https://doi.org/10.1109/ICSC.2014.11","url":null,"abstract":"The integration of the classical Web (of documents) with the emerging Web of Data is a challenging vision. In this paper we focus on an integration approach during searching which aims at enriching the responses of non-semantic search systems (e.g. professional search systems, web search engines) with semantic information, i.e. Linked Open Data (LOD), and exploiting the outcome for providing an overview of the search space and allowing the users (apart from restricting it) to explore the related LOD. We use named entities (e.g. persons, locations, etc.) as the \"glue\" for automatically connecting search hits with LOD. We consider a scenario where this entity-based integration is performed at query time with no human effort, and no a-priori indexing, which is beneficial in terms of configurability and freshness. To realize this scenario one has to tackle various challenges. One spiny issue is that the number of identified entities can be high, the same is true for the semantic information about these entities that can be fetched from the available LOD (i.e. their properties and associations with other entities). To this end, in this paper we propose a Link Analysis-based method which is used for (a) ranking (and thus selecting to show) the more important semantic information related to the search results, (b) deriving and showing top-K semantic graphs. In the sequel, we report the results of a survey regarding the marine domain with promising results, and comparative results that illustrate the effectiveness of the proposed (Page Rank-based) ranking scheme. Finally, we report experimental results regarding efficiency showing that the proposed functionality can be offered even at query time.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"753 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122978493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A Statistical Approach to Semantic Analysis for Chinese Terms 汉语术语语义分析的统计方法
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.47
Dongfeng Cai, Na Ye, Guiping Zhang, Yan Song
We propose a statistical semantic analysis method for Chinese terms. We use words, part-of-speech (POS) tags, word distances, word contexts and the first sememe of a word in HowNet as features to train a Support Vector Machine (SVM) model for analyzing term semantics. The model is used to identify dependencies embedded inside a term. A Conditional Random Field (CRF) model is used afterwards to incorporate the dependencies and experimental results showed the effectiveness and validity of our approach.
提出了一种汉语术语统计语义分析方法。我们在HowNet中使用单词、词性标签、单词距离、单词上下文和单词的第一个义素作为特征来训练支持向量机(SVM)模型来分析术语语义。该模型用于识别嵌入在术语中的依赖关系。然后使用条件随机场(CRF)模型来合并依赖关系,实验结果表明了我们的方法的有效性和有效性。
{"title":"A Statistical Approach to Semantic Analysis for Chinese Terms","authors":"Dongfeng Cai, Na Ye, Guiping Zhang, Yan Song","doi":"10.1109/ICSC.2014.47","DOIUrl":"https://doi.org/10.1109/ICSC.2014.47","url":null,"abstract":"We propose a statistical semantic analysis method for Chinese terms. We use words, part-of-speech (POS) tags, word distances, word contexts and the first sememe of a word in HowNet as features to train a Support Vector Machine (SVM) model for analyzing term semantics. The model is used to identify dependencies embedded inside a term. A Conditional Random Field (CRF) model is used afterwards to incorporate the dependencies and experimental results showed the effectiveness and validity of our approach.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126903012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Refinement of Ontology-Constrained Human Pose Classification 基于本体约束的人体姿态分类的改进
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.20
Kazuhiro Tashiro, Takahiro Kawamura, Y. Sei, Hiroyuki Nakagawa, Yasuyuki Tahara, Akihiko Ohsuga
In this paper, we propose an image classification method that recognizes several poses of idol photographs. The proposed method takes unannotated idol photos as input, and classifies them according to their poses based on spatial layouts of the idol in the photos. Our method has two phases, the first one is to estimate the spatial layout of ten body parts (head, torso, upper and lower arms and legs) using Eichner's Stickman Pose Estimation. The second one is to classify the poses of the idols using Bayesian Network classifiers. In order to improve accuracy of the classification, we introduce Pose Guide Ontology (PGO). PGO contains useful background knowledge, such as semantic hierarchies and constraints related to the positional relationship between the body parts. The location information of body parts is amended by PGO. We also propose iterative procedures for making further refinements of PGO. Finally, we evaluated our method on a dataset consisting of 400 images in 8 poses, and the final results indicated that F-measure of the classification has become 15% higher than non-amended results.
在本文中,我们提出了一种图像分类方法来识别多个姿势的偶像照片。该方法以未标注的偶像照片为输入,根据照片中偶像的空间布局,对其进行姿势分类。我们的方法有两个阶段,第一个阶段是使用Eichner的stick - man Pose Estimation来估计十个身体部位(头、躯干、上臂和下臂以及腿)的空间布局。二是使用贝叶斯网络分类器对偶像的姿势进行分类。为了提高分类精度,引入姿态引导本体(Pose Guide Ontology, PGO)。PGO包含有用的背景知识,例如与身体部位之间位置关系相关的语义层次和约束。身体部位的位置信息通过PGO进行修正。我们还提出了进一步改进PGO的迭代过程。最后,我们在一个包含8个姿态的400张图像的数据集上对我们的方法进行了评估,最终结果表明,分类的F-measure比未修正的结果提高了15%。
{"title":"Refinement of Ontology-Constrained Human Pose Classification","authors":"Kazuhiro Tashiro, Takahiro Kawamura, Y. Sei, Hiroyuki Nakagawa, Yasuyuki Tahara, Akihiko Ohsuga","doi":"10.1109/ICSC.2014.20","DOIUrl":"https://doi.org/10.1109/ICSC.2014.20","url":null,"abstract":"In this paper, we propose an image classification method that recognizes several poses of idol photographs. The proposed method takes unannotated idol photos as input, and classifies them according to their poses based on spatial layouts of the idol in the photos. Our method has two phases, the first one is to estimate the spatial layout of ten body parts (head, torso, upper and lower arms and legs) using Eichner's Stickman Pose Estimation. The second one is to classify the poses of the idols using Bayesian Network classifiers. In order to improve accuracy of the classification, we introduce Pose Guide Ontology (PGO). PGO contains useful background knowledge, such as semantic hierarchies and constraints related to the positional relationship between the body parts. The location information of body parts is amended by PGO. We also propose iterative procedures for making further refinements of PGO. Finally, we evaluated our method on a dataset consisting of 400 images in 8 poses, and the final results indicated that F-measure of the classification has become 15% higher than non-amended results.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114816816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MapReduce Design Patterns for Social Networking Analysis 用于社交网络分析的MapReduce设计模式
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.61
D. Ostrowski
The MapReduce paradigm has become ubiquitous within Big Data Analytics. Within this field, Social Networks exist as an important area of applications as it relies on the large scale analysis of graphs. To enable the scalability of Social Networks, we consider the application of MapReduce design patterns for the determination of graph-based metrics. Specifically, we detail the application of a MapReduce-based solution for the metric of betweenness-centrality. The prevailing concept is separation of the graph topology from the actual graph analysis. Here, we consider the chaining of MapReduce jobs for the estimation of shortest paths in a graph as well as post processing statistics. Through our design pattern, we are able to leverage Big Data Technologies to determine metrics in the context of ever expanding internet-based data resources.
MapReduce范式在大数据分析中已经无处不在。在这个领域中,社交网络作为一个重要的应用领域存在,因为它依赖于大规模的图分析。为了实现社交网络的可扩展性,我们考虑应用MapReduce设计模式来确定基于图的指标。具体来说,我们详细介绍了基于mapreduce的间中心性度量解决方案的应用。流行的概念是图拓扑与实际图分析的分离。在这里,我们考虑MapReduce作业的链接,以估计图中的最短路径以及后处理统计。通过我们的设计模式,我们能够利用大数据技术在不断扩展的基于互联网的数据资源的背景下确定指标。
{"title":"MapReduce Design Patterns for Social Networking Analysis","authors":"D. Ostrowski","doi":"10.1109/ICSC.2014.61","DOIUrl":"https://doi.org/10.1109/ICSC.2014.61","url":null,"abstract":"The MapReduce paradigm has become ubiquitous within Big Data Analytics. Within this field, Social Networks exist as an important area of applications as it relies on the large scale analysis of graphs. To enable the scalability of Social Networks, we consider the application of MapReduce design patterns for the determination of graph-based metrics. Specifically, we detail the application of a MapReduce-based solution for the metric of betweenness-centrality. The prevailing concept is separation of the graph topology from the actual graph analysis. Here, we consider the chaining of MapReduce jobs for the estimation of shortest paths in a graph as well as post processing statistics. Through our design pattern, we are able to leverage Big Data Technologies to determine metrics in the context of ever expanding internet-based data resources.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123879939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Robotic Creativity Driven by Motivation and Semantic Analysis 动机与语义分析驱动的机器人创造力
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.58
A. Augello, Ignazio Infantino, G. Pilato, R. Rizzo, Filippo Vella
The paper proposes a system architecture for artificial creativity that enables a robot to perform portraits. The proposed cognitive architecture is inspired by the PSI model, and it requires that the motivation of the robot in the execution of its tasks is influenced by urges. Such parameters depend on both internal and external evaluation mechanisms. The system is a premise for the development of an artificial artist able to develop a personality and a behavior that depends on its experience of successes and failures (competence), and the availability of different painting techniques (certainty). The creative execution is driven by the motivation arising from the urges, and the perception of the work being executed or performed. The external evaluation is obtained by analyzing the opinions expressed in natural language from people watching the realized portrait.
本文提出了一种人工创造力的系统架构,使机器人能够执行肖像。所提出的认知架构受到PSI模型的启发,它要求机器人在执行任务时的动机受到冲动的影响。这些参数取决于内部和外部评价机制。该系统是人工艺术家发展的前提,能够发展个性和行为,这取决于其成功和失败的经验(能力),以及不同绘画技术的可用性(确定性)。创造性的执行是由冲动和对正在执行或执行的工作的感知所产生的动机所驱动的。外部评价是通过分析人们观看已实现的画像时用自然语言表达的意见而得出的。
{"title":"Robotic Creativity Driven by Motivation and Semantic Analysis","authors":"A. Augello, Ignazio Infantino, G. Pilato, R. Rizzo, Filippo Vella","doi":"10.1109/ICSC.2014.58","DOIUrl":"https://doi.org/10.1109/ICSC.2014.58","url":null,"abstract":"The paper proposes a system architecture for artificial creativity that enables a robot to perform portraits. The proposed cognitive architecture is inspired by the PSI model, and it requires that the motivation of the robot in the execution of its tasks is influenced by urges. Such parameters depend on both internal and external evaluation mechanisms. The system is a premise for the development of an artificial artist able to develop a personality and a behavior that depends on its experience of successes and failures (competence), and the availability of different painting techniques (certainty). The creative execution is driven by the motivation arising from the urges, and the perception of the work being executed or performed. The external evaluation is obtained by analyzing the opinions expressed in natural language from people watching the realized portrait.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126340905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A Neurobiologically Plausible Vector Symbolic Architecture 神经生物学上似是而非的向量符号架构
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.40
Daniel E. Padilla, M. McDonnell
Vector Symbolic Architectures (VSA) are approaches to representing symbols and structured combinations of symbols as high-dimensional vectors. They have applications in machine learning and for understanding information processing in neurobiology. VSAs are typically described in an abstract mathematical form in terms of vectors and operations on vectors. In this work, we show that a machine learning approach known as hierarchical temporal memory, which is based on the anatomy and function of mammalian neocortex, is inherently capable of supporting important VSA functionality. This follows because the approach learns sequences of semantics-preserving sparse distributed representations.
向量符号架构(VSA)是将符号和符号的结构化组合表示为高维向量的方法。它们在机器学习和理解神经生物学中的信息处理方面有应用。vsa通常以抽象的数学形式用向量和对向量的运算来描述。在这项工作中,我们展示了一种被称为分层时间记忆的机器学习方法,它基于哺乳动物新皮层的解剖和功能,本质上能够支持重要的VSA功能。这是因为该方法学习了保持语义的稀疏分布表示序列。
{"title":"A Neurobiologically Plausible Vector Symbolic Architecture","authors":"Daniel E. Padilla, M. McDonnell","doi":"10.1109/ICSC.2014.40","DOIUrl":"https://doi.org/10.1109/ICSC.2014.40","url":null,"abstract":"Vector Symbolic Architectures (VSA) are approaches to representing symbols and structured combinations of symbols as high-dimensional vectors. They have applications in machine learning and for understanding information processing in neurobiology. VSAs are typically described in an abstract mathematical form in terms of vectors and operations on vectors. In this work, we show that a machine learning approach known as hierarchical temporal memory, which is based on the anatomy and function of mammalian neocortex, is inherently capable of supporting important VSA functionality. This follows because the approach learns sequences of semantics-preserving sparse distributed representations.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123816045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Mining Semantic Structures from Syntactic Structures in Free Text Documents 从句法结构中挖掘自由文本文档中的语义结构
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.31
Hamid Mousavi, Deirdre Kerr, Markus R Iseli, C. Zaniolo
The Web has made possible many advanced text-mining applications, such as news summarization, essay grading, question answering, and semantic search. For many of such applications, statistical text-mining techniques are ineffective since they do not utilize the morphological structure of the text. Thus, many approaches use NLP-based techniques, that parse the text and use patterns to mine and analyze the parse trees which are often unnecessarily complex. Therefore, we propose a weighted-graph representation of text, called Text Graphs, which captures the grammatical and semantic relations between words and terms in the text. Text Graphs are generated using a new text mining framework which is the main focus of this paper. Our framework, SemScape, uses a statistical parser to generate few of the most probable parse trees for each sentence and employs a novel two-step pattern-based technique to extract from parse trees candidate terms and their grammatical relations. Moreover, SemScape resolves co references by a novel technique, generates domain-specific Text Graphs by consulting ontologies, and provides a SPARQL-like query language and an optimized engine for semantically querying and mining Text Graphs.
Web使许多高级文本挖掘应用程序成为可能,例如新闻摘要、论文评分、问题回答和语义搜索。对于许多这样的应用程序,统计文本挖掘技术是无效的,因为它们不利用文本的形态结构。因此,许多方法使用基于nlp的技术来解析文本,并使用模式来挖掘和分析解析树,这通常是不必要的复杂。因此,我们提出了文本的加权图表示,称为文本图,它捕获了文本中单词和术语之间的语法和语义关系。文本图的生成使用了一种新的文本挖掘框架,这是本文的主要关注点。我们的框架SemScape使用统计解析器为每个句子生成几个最可能的解析树,并采用一种新颖的基于模式的两步技术从解析树中提取候选术语及其语法关系。此外,SemScape通过一种新技术解析co引用,通过咨询本体生成特定于领域的文本图,并提供类似sparql的查询语言和用于语义查询和挖掘文本图的优化引擎。
{"title":"Mining Semantic Structures from Syntactic Structures in Free Text Documents","authors":"Hamid Mousavi, Deirdre Kerr, Markus R Iseli, C. Zaniolo","doi":"10.1109/ICSC.2014.31","DOIUrl":"https://doi.org/10.1109/ICSC.2014.31","url":null,"abstract":"The Web has made possible many advanced text-mining applications, such as news summarization, essay grading, question answering, and semantic search. For many of such applications, statistical text-mining techniques are ineffective since they do not utilize the morphological structure of the text. Thus, many approaches use NLP-based techniques, that parse the text and use patterns to mine and analyze the parse trees which are often unnecessarily complex. Therefore, we propose a weighted-graph representation of text, called Text Graphs, which captures the grammatical and semantic relations between words and terms in the text. Text Graphs are generated using a new text mining framework which is the main focus of this paper. Our framework, SemScape, uses a statistical parser to generate few of the most probable parse trees for each sentence and employs a novel two-step pattern-based technique to extract from parse trees candidate terms and their grammatical relations. Moreover, SemScape resolves co references by a novel technique, generates domain-specific Text Graphs by consulting ontologies, and provides a SPARQL-like query language and an optimized engine for semantically querying and mining Text Graphs.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127908836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Parameterized Spatial SQL Translation for Geographic Question Answering 地理问答的参数化空间SQL转换
Pub Date : 2014-06-16 DOI: 10.1109/ICSC.2014.44
Wei Chen
Spatial SQL (structured query language) is a powerful tool for systematically solving geographic problems, however, it has not been widely applied to the problem of geographic question answering. This paper introduces a parameterized approach to translate natural language geographic questions into spatial SQLs. In particular, three types of complexity are introduced and initial solutions are proposed to deal with these complexities. The entire parameterization process is implemented to generate spatial SQL templates for five types of geographic questions. It is suggested that our approach is useful for solving natural geographic problems using spatial functions such as those in a GIS.
空间SQL (structured query language,结构化查询语言)是系统解决地理问题的有力工具,但在地理问答问题中尚未得到广泛应用。本文介绍了一种将自然语言地理问题转换为空间sql的参数化方法。特别地,介绍了三种类型的复杂性,并提出了处理这些复杂性的初步解决方案。整个参数化过程实现为五种类型的地理问题生成空间SQL模板。这表明我们的方法对于利用GIS中的空间函数来解决自然地理问题是有用的。
{"title":"Parameterized Spatial SQL Translation for Geographic Question Answering","authors":"Wei Chen","doi":"10.1109/ICSC.2014.44","DOIUrl":"https://doi.org/10.1109/ICSC.2014.44","url":null,"abstract":"Spatial SQL (structured query language) is a powerful tool for systematically solving geographic problems, however, it has not been widely applied to the problem of geographic question answering. This paper introduces a parameterized approach to translate natural language geographic questions into spatial SQLs. In particular, three types of complexity are introduced and initial solutions are proposed to deal with these complexities. The entire parameterization process is implemented to generate spatial SQL templates for five types of geographic questions. It is suggested that our approach is useful for solving natural geographic problems using spatial functions such as those in a GIS.","PeriodicalId":175352,"journal":{"name":"2014 IEEE International Conference on Semantic Computing","volume":"89 1-3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116916946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2014 IEEE International Conference on Semantic Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1