首页 > 最新文献

Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)最新文献

英文 中文
Challenge: Processing web texts for classifying job offers 挑战:处理网络文本来分类工作机会
F. Amato, R. Boselli, M. Cesarini, Fabio Mercorio, Mario Mezzanzanica, V. Moscato, Fabio Persia, A. Picariello
Today the Web represents a rich source of labour market data for both public and private operators, as a growing number of job offers are advertised through Web portals and services. In this paper we apply and compare several techniques, namely explicit-rules, machine learning, and LDA-based algorithms to classify a real dataset of Web job offers collected from 12 heterogeneous sources against a standard classification system of occupations.
今天,随着越来越多的工作机会通过门户网站和服务发布广告,网络为公共和私营运营商提供了丰富的劳动力市场数据来源。在本文中,我们应用并比较了几种技术,即显式规则、机器学习和基于lda的算法,将从12个异构来源收集的Web工作机会的真实数据集与标准的职业分类系统进行分类。
{"title":"Challenge: Processing web texts for classifying job offers","authors":"F. Amato, R. Boselli, M. Cesarini, Fabio Mercorio, Mario Mezzanzanica, V. Moscato, Fabio Persia, A. Picariello","doi":"10.1109/ICOSC.2015.7050852","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050852","url":null,"abstract":"Today the Web represents a rich source of labour market data for both public and private operators, as a growing number of job offers are advertised through Web portals and services. In this paper we apply and compare several techniques, namely explicit-rules, machine learning, and LDA-based algorithms to classify a real dataset of Web job offers collected from 12 heterogeneous sources against a standard classification system of occupations.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125633256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
Performance analysis of Ensemble methods on Twitter sentiment analysis using NLP techniques 基于NLP技术的集成方法在Twitter情感分析中的性能分析
M. Kanakaraj, R. R. Guddeti
Mining opinions and analyzing sentiments from social network data help in various fields such as even prediction, analyzing overall mood of public on a particular social issue and so on. This paper involves analyzing the mood of the society on a particular news from Twitter posts. The key idea of the paper is to increase the accuracy of classification by including Natural Language Processing Techniques (NLP) especially semantics and Word Sense Disambiguation. The mined text information is subjected to Ensemble classification to analyze the sentiment. Ensemble classification involves combining the effect of various independent classifiers on a particular classification problem. Experiments conducted demonstrate that ensemble classifier outperforms traditional machine learning classifiers by 3-5%.
从社交网络数据中挖掘意见和分析情绪有助于各个领域,例如甚至预测,分析公众对特定社会问题的整体情绪等等。这篇论文涉及到从Twitter帖子中分析社会对某一特定新闻的情绪。本文的核心思想是通过引入自然语言处理技术,特别是语义和词义消歧技术来提高分类的准确性。对挖掘的文本信息进行集成分类,进行情感分析。集成分类涉及将各种独立分类器对特定分类问题的影响结合起来。实验表明,集成分类器优于传统机器学习分类器3-5%。
{"title":"Performance analysis of Ensemble methods on Twitter sentiment analysis using NLP techniques","authors":"M. Kanakaraj, R. R. Guddeti","doi":"10.1109/ICOSC.2015.7050801","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050801","url":null,"abstract":"Mining opinions and analyzing sentiments from social network data help in various fields such as even prediction, analyzing overall mood of public on a particular social issue and so on. This paper involves analyzing the mood of the society on a particular news from Twitter posts. The key idea of the paper is to increase the accuracy of classification by including Natural Language Processing Techniques (NLP) especially semantics and Word Sense Disambiguation. The mined text information is subjected to Ensemble classification to analyze the sentiment. Ensemble classification involves combining the effect of various independent classifiers on a particular classification problem. Experiments conducted demonstrate that ensemble classifier outperforms traditional machine learning classifiers by 3-5%.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115631453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 96
Chinese enterprise knowledge graph construction based on Linked Data 基于关联数据的中国企业知识图谱构建
Qingliang Miao, Yao Meng, Bo Zhang
Enterprise knowledge graph is crucial for both enterprises and their management agencies. However, enterprise knowledge graph construction faces several challenges such as heterogeneous taxonomies, knowledge inconsistencies or conflicts and lack of semantic links. In this paper, we use Linked Data paradigm to construct enterprise knowledge graph by integrating heterogeneous enterprise data itself as well as link enterprise data with external data. Preliminary experiment on real world dataset shows the proposed approach is effective.
企业知识图谱对企业及其管理机构来说都是至关重要的。然而,企业知识图谱的构建面临着异构分类、知识不一致或冲突、缺乏语义链接等挑战。本文采用关联数据范式,通过集成异构企业数据本身构建企业知识图谱,并将企业数据与外部数据进行链接。在真实数据集上的初步实验表明,该方法是有效的。
{"title":"Chinese enterprise knowledge graph construction based on Linked Data","authors":"Qingliang Miao, Yao Meng, Bo Zhang","doi":"10.1109/ICOSC.2015.7050795","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050795","url":null,"abstract":"Enterprise knowledge graph is crucial for both enterprises and their management agencies. However, enterprise knowledge graph construction faces several challenges such as heterogeneous taxonomies, knowledge inconsistencies or conflicts and lack of semantic links. In this paper, we use Linked Data paradigm to construct enterprise knowledge graph by integrating heterogeneous enterprise data itself as well as link enterprise data with external data. Preliminary experiment on real world dataset shows the proposed approach is effective.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117128015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
FC-MST: Feature correlation maximum spanning tree for multimedia concept classification FC-MST:多媒体概念分类的特征相关最大生成树
Hsin-Yu Ha, Shu‐Ching Chen, Min Chen
Feature selection is an actively researched topic in varies domains, mainly owing to its ability in greatly reducing feature space and associated computational time. Given the explosive growth of high-dimensional multimedia data, a well-designed feature selection method can be leveraged in classifying multimedia contents into high-level semantic concepts. In this paper we present a multi-phase feature selection method using maximum spanning tree built from feature correlation among multiple modalities (FC-MST). The method aims to first thoroughly explore not only the correlation between features within and across modalities, but also the association of features towards semantic concepts. Secondly, with the correlations, we identify important features and exclude redundant or irrelevant ones. The proposed method is tested on a well-known benchmark multimedia data set called NUS-WIDE and the experimental results show that it outperforms four well-known feature selection methods in all three important measurement metrics.
特征选择是一个被广泛研究的领域,主要是因为它能够大大减少特征空间和相关的计算时间。鉴于高维多媒体数据的爆炸性增长,设计良好的特征选择方法可以用于将多媒体内容分类为高级语义概念。本文提出了一种基于多模态特征相关性构建的最大生成树的多阶段特征选择方法(FC-MST)。该方法旨在首先深入探索模态内部和跨模态特征之间的相关性,以及特征与语义概念之间的关联。其次,通过相关性,我们可以识别重要的特征并排除冗余或不相关的特征。在一个著名的多媒体基准数据集NUS-WIDE上进行了测试,实验结果表明,该方法在三个重要度量指标上都优于四种知名的特征选择方法。
{"title":"FC-MST: Feature correlation maximum spanning tree for multimedia concept classification","authors":"Hsin-Yu Ha, Shu‐Ching Chen, Min Chen","doi":"10.1109/ICOSC.2015.7050820","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050820","url":null,"abstract":"Feature selection is an actively researched topic in varies domains, mainly owing to its ability in greatly reducing feature space and associated computational time. Given the explosive growth of high-dimensional multimedia data, a well-designed feature selection method can be leveraged in classifying multimedia contents into high-level semantic concepts. In this paper we present a multi-phase feature selection method using maximum spanning tree built from feature correlation among multiple modalities (FC-MST). The method aims to first thoroughly explore not only the correlation between features within and across modalities, but also the association of features towards semantic concepts. Secondly, with the correlations, we identify important features and exclude redundant or irrelevant ones. The proposed method is tested on a well-known benchmark multimedia data set called NUS-WIDE and the experimental results show that it outperforms four well-known feature selection methods in all three important measurement metrics.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117188327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Multi-cloud policy enforcement through semantic modeling and mapping 通过语义建模和映射实施多云策略
Zhengping Wu
In today's cloud service market, different providers have very different low-level mechanisms to accommodate various types of policies from their users. Enforcement of policies over multiple cloud provider domains is an intrinsically complex problem for both sides. In reality, cloud providers have to either manually update enforcement mechanisms or negotiate adjusted policies with their users for enforcement. To save these high-cost and error-prone manual updates or adjustments, an automatic and flexible solution is desired. This paper proposes a semantic modeling and mapping based approach to help enforce high-level user policies across cloud domain boundaries when applications or IT operations have to span over multiple cloud domains. This approach creates policy models and maps these models across cloud domain boundaries automatically or semi-automatically. Policy rules following these mappings can be tied to multiple enforcement mechanisms in different cloud domains. If a rule cannot be mapped, a manual adjustment solution will be suggested. A case study is also included to demonstrate the efficiency and accuracy of this approach.
在当今的云服务市场中,不同的提供商有非常不同的底层机制来适应来自其用户的各种类型的策略。在多个云提供商域上实施策略对双方来说本质上都是一个复杂的问题。实际上,云提供商必须手动更新执行机制,或者与用户协商调整后的策略来执行。为了节省这些高成本和容易出错的手动更新或调整,需要一个自动和灵活的解决方案。本文提出了一种基于语义建模和映射的方法,以帮助在应用程序或IT操作必须跨越多个云域时跨云域边界执行高级用户策略。这种方法创建策略模型,并自动或半自动地跨云域边界映射这些模型。遵循这些映射的策略规则可以绑定到不同云域中的多个执行机制。如果不能映射规则,则建议手动调整解决方案。最后,通过实例分析,验证了该方法的有效性和准确性。
{"title":"Multi-cloud policy enforcement through semantic modeling and mapping","authors":"Zhengping Wu","doi":"10.1109/ICOSC.2015.7050849","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050849","url":null,"abstract":"In today's cloud service market, different providers have very different low-level mechanisms to accommodate various types of policies from their users. Enforcement of policies over multiple cloud provider domains is an intrinsically complex problem for both sides. In reality, cloud providers have to either manually update enforcement mechanisms or negotiate adjusted policies with their users for enforcement. To save these high-cost and error-prone manual updates or adjustments, an automatic and flexible solution is desired. This paper proposes a semantic modeling and mapping based approach to help enforce high-level user policies across cloud domain boundaries when applications or IT operations have to span over multiple cloud domains. This approach creates policy models and maps these models across cloud domain boundaries automatically or semi-automatically. Policy rules following these mappings can be tied to multiple enforcement mechanisms in different cloud domains. If a rule cannot be mapped, a manual adjustment solution will be suggested. A case study is also included to demonstrate the efficiency and accuracy of this approach.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124707988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A syntactic approach for aspect based opinion mining 基于方面的意见挖掘的句法方法
T. C. Chinsha, Shibily Joseph
Opinion mining or sentiment analysis is the process of analysing the text about a topic written in a natural language and classify them as positive negative or neutral based on the humans sentiments, emotions, opinions expressed in it. Nowadays, the opinions expressed through reviews are increasing day by day on the web. It is practically impossible to analyse and extract opinions from such huge number of reviews manually. To solve this problem an automated opinion mining approach is needed. This task of automatic opinion mining can be done mainly at three different levels, which are document level, sentence level and aspect level. Most of the previous work is in the field of document or sentence level opinion mining. This paper focus on aspect level opinion mining and propose a new syntactic based approach for it, which uses syntactic dependency, aggregate score of opinion words, SentiWordNet and aspect table together for opinion mining. The experimental work was done on restaurant reviews. The dataset of restaurant reviews was collected from web and tagged manually. The proposed method achieved total accuracy of 78.04% on the annotated test set. The method was also compared with the method, which uses Part-Of-Speech tagger for feature extraction; the obtained results show that the proposed method gives 6% more accuracy than previous one on the annotated test set.
观点挖掘或情感分析是分析用自然语言写的关于某个主题的文本,并根据其中表达的人类情感、情绪、观点将其分类为积极、消极或中立的过程。如今,网络上通过评论表达的观点日益增多。从如此庞大的评论中手动分析和提取意见实际上是不可能的。为了解决这个问题,需要一种自动化的意见挖掘方法。该自动意见挖掘任务主要在三个不同的层次上完成,即文档层、句子层和方面层。以前的工作大多是在文档级或句子级的意见挖掘领域。本文以方面级意见挖掘为研究对象,提出了一种新的基于句法的意见挖掘方法,该方法将句法依赖性、意见词总分、SentiWordNet和方面表结合起来进行意见挖掘。实验工作是在餐馆评论上完成的。餐厅评论的数据集是从网上收集的,并手动标记。该方法在标注测试集上的总准确率为78.04%。并与使用词性标注器进行特征提取的方法进行了比较;结果表明,该方法在带注释的测试集上的准确率比之前的方法提高了6%。
{"title":"A syntactic approach for aspect based opinion mining","authors":"T. C. Chinsha, Shibily Joseph","doi":"10.1109/ICOSC.2015.7050774","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050774","url":null,"abstract":"Opinion mining or sentiment analysis is the process of analysing the text about a topic written in a natural language and classify them as positive negative or neutral based on the humans sentiments, emotions, opinions expressed in it. Nowadays, the opinions expressed through reviews are increasing day by day on the web. It is practically impossible to analyse and extract opinions from such huge number of reviews manually. To solve this problem an automated opinion mining approach is needed. This task of automatic opinion mining can be done mainly at three different levels, which are document level, sentence level and aspect level. Most of the previous work is in the field of document or sentence level opinion mining. This paper focus on aspect level opinion mining and propose a new syntactic based approach for it, which uses syntactic dependency, aggregate score of opinion words, SentiWordNet and aspect table together for opinion mining. The experimental work was done on restaurant reviews. The dataset of restaurant reviews was collected from web and tagged manually. The proposed method achieved total accuracy of 78.04% on the annotated test set. The method was also compared with the method, which uses Part-Of-Speech tagger for feature extraction; the obtained results show that the proposed method gives 6% more accuracy than previous one on the annotated test set.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121648619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 74
A study to assess and enhance educational specific search on web for school children 一项评估及加强学童在网上进行教育专题搜寻的研究
S. Gaurav, Y. Jithendranath, Aruna Adil, Sudhakar Yadav, B. Kasturi
Today, search engines play a vital role in accessing the online content. However, the data in the webpages are not clearly perceived by search engines. As a result, it provides a lot of irrelevant data with little desired information. In addition, it takes a lot of time in searching the appropriate result. By studying the online educational needs of Indian school children, we aim to retrieve appropriate educational information in less time through effective search. Schema.org [5] is a collection of markups which helps webmasters to mark up the webpages for retrieval of relevant information. But, properties related to education are not covered completely. Learning Resource Metadata Initiative (LRMI) [9] has created few properties for education and added in schema.org. We map our study with LRMI's work, and propose some new properties as an extension to the schema, which can be useful for students and teachers.
今天,搜索引擎在访问在线内容方面起着至关重要的作用。然而,网页中的数据并没有被搜索引擎清楚地感知到。因此,它提供了大量不相关的数据和很少需要的信息。此外,在搜索适当的结果时需要花费大量时间。通过研究印度学龄儿童的在线教育需求,我们的目标是通过有效的搜索,在更短的时间内检索到合适的教育信息。Schema.org[5]是一个标记集合,它可以帮助网站管理员标记网页,以便检索相关信息。但是,与教育相关的属性并没有完全涵盖。学习资源元数据倡议(LRMI)[9]为教育创建了一些属性,并添加到schema.org中。我们将我们的研究与LRMI的工作相结合,并提出了一些新的属性作为模式的扩展,这对学生和教师都很有用。
{"title":"A study to assess and enhance educational specific search on web for school children","authors":"S. Gaurav, Y. Jithendranath, Aruna Adil, Sudhakar Yadav, B. Kasturi","doi":"10.1109/ICOSC.2015.7050816","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050816","url":null,"abstract":"Today, search engines play a vital role in accessing the online content. However, the data in the webpages are not clearly perceived by search engines. As a result, it provides a lot of irrelevant data with little desired information. In addition, it takes a lot of time in searching the appropriate result. By studying the online educational needs of Indian school children, we aim to retrieve appropriate educational information in less time through effective search. Schema.org [5] is a collection of markups which helps webmasters to mark up the webpages for retrieval of relevant information. But, properties related to education are not covered completely. Learning Resource Metadata Initiative (LRMI) [9] has created few properties for education and added in schema.org. We map our study with LRMI's work, and propose some new properties as an extension to the schema, which can be useful for students and teachers.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132196765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Aggregating financial services data without assumptions: A semantic data reference architecture 聚合没有假设的金融服务数据:语义数据参考体系结构
Sunila Gollapudi
We are seeing a sea change down the pike in terms of financial information aggregation and consumption; this could potentially be a game changer in financial services space with focus on ability to commoditize data. Financial Services Industry deals with a tremendous amount of data that varies in its structure, volume and purpose. The data is generated in the ecosystem (its customers, its own accounts, partner trades, securities transactions etc.), is handled by many systems - each having its own perspective. Front-office systems handle transactional behavior of the data, middle office systems which typically work with a drop-copy of the data subject it to intense processing, business logic, computations (such as inventory positions, fee calculations, commissions) and the back office systems deal with reconciliation, cleansing, exception management etc. Then there are the analytic systems which are concerned with auditing, compliance reporting as well as business analytics. Data that flows through this ecosystem gets aggregated, transformed, and transported time and again. Traditional approaches to managing such data leverage Extract-Transform-Load (ETL) technologies to set up data marts where each data mart serves a specific purpose (such as reconciliation or analytics). The result is proliferation of transformations and marts in the Organization. The need is to have architectures and IT systems that can aggregate data from many such sources without making any assumptions on HOW, WHERE or WHEN this data will be used. The incoming data is semantically annotated and stored in the triple store within storage tier and offers the ability to store, query and draw inferences using the ontology. There is a probable need for a Big Data Solution here that helps ease data liberation and co-location. This paper is a summary of one such business case of the Financial Services Industry where traditional ETL silos was broken to support the structurally dynamic, ever expanding and changing data usage needs employing Ontology and Semantic techniques like RDF/RDFS, SPARQL, OWL and related stack.
我们看到金融信息的聚合和消费正在发生翻天覆地的变化;这可能会改变金融服务领域的游戏规则,重点是数据商品化的能力。金融服务业处理大量的数据,这些数据在结构、数量和用途上都有所不同。数据是在生态系统中生成的(客户、自己的账户、合作伙伴交易、证券交易等),由许多系统处理——每个系统都有自己的视角。前台系统处理数据的事务行为,中台系统通常处理数据的副本,并对其进行密集的处理、业务逻辑、计算(如库存位置、费用计算、佣金),后台系统处理对账、清理、异常管理等。然后是与审计、遵从性报告以及业务分析有关的分析系统。流经这个生态系统的数据被一次又一次地聚合、转换和传输。管理此类数据的传统方法利用提取-转换-加载(Extract-Transform-Load, ETL)技术来设置数据集市,其中每个数据集市都有特定的用途(如协调或分析)。其结果是本组织内变革和市场的扩散。需要的是架构和IT系统能够聚合来自许多此类来源的数据,而无需对如何、在何处或何时使用这些数据进行任何假设。输入的数据经过语义标注并存储在存储层内的三重存储中,并提供使用本体存储、查询和推断的功能。这里可能需要一个大数据解决方案来帮助简化数据解放和托管。本文总结了金融服务行业的一个这样的业务案例,在这个案例中,传统的ETL竖井被打破,以支持结构动态的、不断扩展和变化的数据使用需求,采用了本体和语义技术,如RDF/RDFS、SPARQL、OWL和相关堆栈。
{"title":"Aggregating financial services data without assumptions: A semantic data reference architecture","authors":"Sunila Gollapudi","doi":"10.1109/ICOSC.2015.7050825","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050825","url":null,"abstract":"We are seeing a sea change down the pike in terms of financial information aggregation and consumption; this could potentially be a game changer in financial services space with focus on ability to commoditize data. Financial Services Industry deals with a tremendous amount of data that varies in its structure, volume and purpose. The data is generated in the ecosystem (its customers, its own accounts, partner trades, securities transactions etc.), is handled by many systems - each having its own perspective. Front-office systems handle transactional behavior of the data, middle office systems which typically work with a drop-copy of the data subject it to intense processing, business logic, computations (such as inventory positions, fee calculations, commissions) and the back office systems deal with reconciliation, cleansing, exception management etc. Then there are the analytic systems which are concerned with auditing, compliance reporting as well as business analytics. Data that flows through this ecosystem gets aggregated, transformed, and transported time and again. Traditional approaches to managing such data leverage Extract-Transform-Load (ETL) technologies to set up data marts where each data mart serves a specific purpose (such as reconciliation or analytics). The result is proliferation of transformations and marts in the Organization. The need is to have architectures and IT systems that can aggregate data from many such sources without making any assumptions on HOW, WHERE or WHEN this data will be used. The incoming data is semantically annotated and stored in the triple store within storage tier and offers the ability to store, query and draw inferences using the ontology. There is a probable need for a Big Data Solution here that helps ease data liberation and co-location. This paper is a summary of one such business case of the Financial Services Industry where traditional ETL silos was broken to support the structurally dynamic, ever expanding and changing data usage needs employing Ontology and Semantic techniques like RDF/RDFS, SPARQL, OWL and related stack.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114791171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Enriching mobile semantic search with web services 用web服务丰富移动语义搜索
Minjae Song, Sungkwang Eom, Sangjin Shin, Kyong-Ho Lee
With the increasing number of mobile devices, there have been many researches on searching and managing a large volume of mobile data. Most of the mobile platforms today provide users with keyword-based full text search (FTS) in order to search for mobile data. Recently, voice search interfaces have been deployed. These search methods, however, query only the keywords given as an input to local databases in mobile devices. Therefore, it is quite difficult to figure out and to provide what a user really wants. To overcome this limitation, we propose a semantic search method for mobile platforms. The proposed method augments the results of semantic search on local databases with their related useful Web information according to the intention and context information of a user. Although there are various semantic search techniques, it is hard to apply the existing methods to mobile devices due to the characteristics of mobile devices such as isolated database structures and limited computing resources. To enable semantic search on mobile devices, we also propose a lightweight mobile ontology. The proposed mobile ontology is also aligned with related Web information to enrich search results. Experimental results from prototype implementation of the proposed method verify that our approach provides more accurate results than the conventional FTS does. In addition, the proposed method shows an acceptable amount of response time and battery consumption.
随着移动设备数量的不断增加,大量移动数据的搜索和管理已经成为人们研究的热点。目前,大多数移动平台都为用户提供基于关键字的全文搜索(FTS),以便搜索移动数据。最近部署了语音搜索界面。然而,这些搜索方法只查询作为移动设备本地数据库输入的关键字。因此,很难弄清楚并提供用户真正想要的东西。为了克服这一限制,我们提出了一种面向移动平台的语义搜索方法。该方法根据用户的意图和上下文信息,对本地数据库的语义搜索结果进行扩充,使其具有相关的有用Web信息。虽然有各种各样的语义搜索技术,但由于移动设备数据库结构孤立、计算资源有限等特点,现有的方法很难应用于移动设备。为了在移动设备上实现语义搜索,我们还提出了一个轻量级的移动本体。提议的移动本体还与相关的Web信息保持一致,以丰富搜索结果。实验结果表明,该方法比传统的傅立叶变换方法提供了更精确的结果。此外,所提出的方法显示了可接受的响应时间和电池消耗量。
{"title":"Enriching mobile semantic search with web services","authors":"Minjae Song, Sungkwang Eom, Sangjin Shin, Kyong-Ho Lee","doi":"10.1109/ICOSC.2015.7050850","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050850","url":null,"abstract":"With the increasing number of mobile devices, there have been many researches on searching and managing a large volume of mobile data. Most of the mobile platforms today provide users with keyword-based full text search (FTS) in order to search for mobile data. Recently, voice search interfaces have been deployed. These search methods, however, query only the keywords given as an input to local databases in mobile devices. Therefore, it is quite difficult to figure out and to provide what a user really wants. To overcome this limitation, we propose a semantic search method for mobile platforms. The proposed method augments the results of semantic search on local databases with their related useful Web information according to the intention and context information of a user. Although there are various semantic search techniques, it is hard to apply the existing methods to mobile devices due to the characteristics of mobile devices such as isolated database structures and limited computing resources. To enable semantic search on mobile devices, we also propose a lightweight mobile ontology. The proposed mobile ontology is also aligned with related Web information to enrich search results. Experimental results from prototype implementation of the proposed method verify that our approach provides more accurate results than the conventional FTS does. In addition, the proposed method shows an acceptable amount of response time and battery consumption.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122157489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SemRank: Semantic rank learning for multimedia retrieval 语义秩:多媒体检索的语义秩学习
David Etter, C. Domeniconi
Multimedia retrieval suffers from the lack of common feature representation between a text based query and the visual content of a video repository. One approach to bridging this representation gap is known as query-by-concept, where a query and video are mapped into a common semantic feature space. One of the challenges with using semantic concepts for multimedia retrieval, is that the available vocabulary size is generally not sufficient for representing the content of the query and video. In addition, the lack of training data and visual feature representation often leads to low precision models. In this work, we explore the use of a query-by-concept approach for the multimedia Known Item Search (KIS) problem. We propose a semantic rank learning model, called SemRank, to overcome the challenges of the vocabulary size and lack of training data. First, we construct a semantic fusion model to combine the output from many noisy classifiers. Next, we train a gradient boosted regression tree model, using a semantic feature space derived from the query, video, and query-video similarity. Our approach is evaluated over a large internet video repository, and the results show that query-by-concept can be an effective model for multimedia KIS.
多媒体检索在基于文本的查询和视频存储库的可视内容之间缺乏共同的特征表示。弥合这种表示差距的一种方法是按概念查询,其中将查询和视频映射到公共语义特征空间。使用语义概念进行多媒体检索的挑战之一是可用的词汇表大小通常不足以表示查询和视频的内容。此外,缺乏训练数据和视觉特征表示往往导致模型精度低。在这项工作中,我们探索了多媒体已知项目搜索(KIS)问题中按概念查询方法的使用。我们提出了一种语义排名学习模型,称为SemRank,以克服词汇量大小和缺乏训练数据的挑战。首先,我们构建了一个语义融合模型,将多个噪声分类器的输出组合在一起。接下来,我们使用从查询、视频和查询-视频相似度派生的语义特征空间训练梯度增强回归树模型。我们的方法在一个大型互联网视频库上进行了评估,结果表明按概念查询可以作为多媒体KIS的有效模型。
{"title":"SemRank: Semantic rank learning for multimedia retrieval","authors":"David Etter, C. Domeniconi","doi":"10.1109/ICOSC.2015.7050778","DOIUrl":"https://doi.org/10.1109/ICOSC.2015.7050778","url":null,"abstract":"Multimedia retrieval suffers from the lack of common feature representation between a text based query and the visual content of a video repository. One approach to bridging this representation gap is known as query-by-concept, where a query and video are mapped into a common semantic feature space. One of the challenges with using semantic concepts for multimedia retrieval, is that the available vocabulary size is generally not sufficient for representing the content of the query and video. In addition, the lack of training data and visual feature representation often leads to low precision models. In this work, we explore the use of a query-by-concept approach for the multimedia Known Item Search (KIS) problem. We propose a semantic rank learning model, called SemRank, to overcome the challenges of the vocabulary size and lack of training data. First, we construct a semantic fusion model to combine the output from many noisy classifiers. Next, we train a gradient boosted regression tree model, using a semantic feature space derived from the query, video, and query-video similarity. Our approach is evaluated over a large internet video repository, and the results show that query-by-concept can be an effective model for multimedia KIS.","PeriodicalId":126701,"journal":{"name":"Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127805088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1