2009 International Conference on Information Management and Engineering最新文献

英文中文

The Factors Affecting the Performance of Data Fusion Algorithms 影响数据融合算法性能的因素

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.44

M. Nassar, G. Kanaan

The enormous amount of data which is distributed on the World Wide Web can be very useful if the users became able to get these data in an easy and appropriate method, search engines help the users to find what they need from this enormous amount of data. Meta-search is the application of data fusion to document retrieval, Metasearch engine takes as an input the N ranked lists output by each of N search engines in response to a given query, As output, it computes a single ranked list, which is hopefully an improvement over any input list as measured by standard information retrieval performance metrics such as the mean average precision (MAP). Our goal in this paper is to answer the following question, what are the factors affecting the performance of Data fusion algorithms? The reason behind introducing those factors is the absence of a single source in the literature able to present all those factors in an organized and complete manner. This work is needed to integrate all data fusion performance research findings. This paper contributes to the data fusion literature by two things, firstly; it will deliver all factors affecting the performance of data fusion algorithms in an organized and complete manner. Secondly; it will deliver recommendations which are related to how and when to deal with the factors that affect the performance.

如果用户能够以一种简单而适当的方法获得这些数据，那么分布在万维网上的大量数据将非常有用，搜索引擎可以帮助用户从这些大量数据中找到他们需要的东西。元搜索是数据融合在文档检索中的应用，元搜索引擎将N个搜索引擎中每个搜索引擎对给定查询的N个排序列表作为输入，作为输出，它计算单个排序列表，这比任何输入列表都有改进，可以通过标准信息检索性能指标如平均精度(MAP)来衡量。我们在本文中的目标是回答以下问题，影响数据融合算法性能的因素是什么?引入这些因素背后的原因是在文献中缺乏一个单一的来源，能够以有组织和完整的方式呈现所有这些因素。这项工作需要整合所有数据融合性能的研究成果。本文对数据融合文献的贡献主要体现在两个方面:一是数据融合;它将有组织、完整地传递影响数据融合算法性能的所有因素。其次;它将提供有关如何以及何时处理影响绩效的因素的建议。

{"title":"The Factors Affecting the Performance of Data Fusion Algorithms","authors":"M. Nassar, G. Kanaan","doi":"10.1109/ICIME.2009.44","DOIUrl":"https://doi.org/10.1109/ICIME.2009.44","url":null,"abstract":"The enormous amount of data which is distributed on the World Wide Web can be very useful if the users became able to get these data in an easy and appropriate method, search engines help the users to find what they need from this enormous amount of data. Meta-search is the application of data fusion to document retrieval, Metasearch engine takes as an input the N ranked lists output by each of N search engines in response to a given query, As output, it computes a single ranked list, which is hopefully an improvement over any input list as measured by standard information retrieval performance metrics such as the mean average precision (MAP). Our goal in this paper is to answer the following question, what are the factors affecting the performance of Data fusion algorithms? The reason behind introducing those factors is the absence of a single source in the literature able to present all those factors in an organized and complete manner. This work is needed to integrate all data fusion performance research findings. This paper contributes to the data fusion literature by two things, firstly; it will deliver all factors affecting the performance of data fusion algorithms in an organized and complete manner. Secondly; it will deliver recommendations which are related to how and when to deal with the factors that affect the performance.","PeriodicalId":445284,"journal":{"name":"2009 International Conference on Information Management and Engineering","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116575284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Joint Optimization for Knowledge Mining: Evaluating Parameters of Manufacturing Processes 面向知识挖掘的联合优化:制造工艺参数评价

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.119

C.X.H. Tang, H. Lau

In various kinds of manufacturing production, predicting the influence of process parameters in terms of machine performance is a necessity as they may have a serious impact on product quality as well as on the probability of machine failure. To address this issue, this paper presents a novel knowledge-based algorithm embedded with Artificial Intelligence for evaluating the overall suitability of adopting the predicted control parameters suggested by domain experts. The originality of this research is that the proposed knowledge-based system is equipped with fuzzy-guided genetic algorithm, enabling the identification of the best set of process parameters. Simulation using the RIE machine is provided to validate the practicability of the proposed approach.

在各种制造生产中，预测工艺参数对机器性能的影响是必要的，因为它们可能对产品质量和机器故障的概率产生严重影响。为了解决这一问题，本文提出了一种嵌入人工智能的基于知识的算法，用于评估采用领域专家建议的预测控制参数的整体适用性。本研究的创新之处在于所提出的基于知识的系统配备了模糊引导遗传算法，能够识别出最佳的工艺参数集。利用RIE机器进行了仿真，验证了所提方法的实用性。

引用次数: 0

Evaluation of Prepayment and Default Behaviour of Mortgage Customers: With a Case Study of the Banking Industry in Taiwan 抵押贷款客户提前还贷与违约行为之评估:以台湾银行业为例

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.116

Shuo-fen Hsu, Po-Sheng Ko, Cheng-Chung Wu

This study analyzes the mortgage loans of five Taiwanese commerce banks to identify the key factors that influence prepayments and defaults. Using data from a total of 16,215 data entries of mortgage loans of five Taiwanese commerce banks in 2002 through 2007, this study first conducts Logistic regression to analyze the behavior of prepayments and default. As far the overall predictability is concerned, this paper finds that the logistic regression model is able to provide simplified results in the measurement of model variables concerning defaults and prepayments.

本研究分析台湾五家商业银行的按揭贷款，找出影响提前还款与违约的关键因素。本研究以台湾五家商业银行2002 ~ 2007年共16,215项按揭贷款资料为资料，首先运用Logistic回归分析提前还款与违约行为。就整体可预测性而言，本文发现逻辑回归模型能够在违约和提前支付模型变量的测量中提供简化的结果。

引用次数: 0

Speed up the Search in Bitmap Based Compressed Sparse Arrays 加快位图压缩稀疏数组的搜索速度

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.43

J. Zalaket

MOLAP (multidimensional OLAP) systems are storing data as cubes in multidimensional arrays. Data cubes can be sparse, which slows down the performance of MOLAPs and requests useless additional data storage. Many compression algorithms have been introduced to deal with the sparsity of MOLAP data cubes. In this paper we present a new compression algorithm based on the bitmap compression technique. Instead of the linear structure used by the classical bitmap, we use a balanced tree structure to store the compressed data in order to reduce the search time. We demonstrate in this paper that our algorithm performs a search in the compressed structure in a logarithmic time which overcomes the linear time needed by classical bitmap compression methods. We finally show some empirical results in which our proposed algorithm has been tested over multiple datasets and compared to the classical bitmap algorithm.

MOLAP(多维OLAP)系统将数据存储为多维数组中的多维数据集。数据集可能是稀疏的，这会降低molap的性能，并请求无用的额外数据存储。为了处理MOLAP数据立方体的稀疏性，已经引入了许多压缩算法。本文提出了一种新的基于位图压缩技术的压缩算法。为了减少搜索时间，我们使用平衡树结构来存储压缩后的数据，而不是传统位图使用的线性结构。在本文中，我们证明了我们的算法在对数时间内完成对压缩结构的搜索，克服了传统位图压缩方法所需的线性时间。我们最后展示了一些经验结果，其中我们提出的算法已经在多个数据集上进行了测试，并与经典的位图算法进行了比较。

引用次数: 2

Quality Model for Web Services from Multi-stakeholders' Perspective 多利益相关者视角下的Web服务质量模型

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.11

Zain Balfagih, M. Hassan

In the current implementation of SOA using web services, where different services can provide the same functionality for consumers, QoSs of these services are essential in determining the most suitable services for consumers. Although QoS for web services has attracted wide attention in the literature for the past few years, most of the current efforts did not consider the multi-stakeholders nature of web services. Unlike traditional software paradigms, SOA applications owned, developed, and/or used by different stakeholders. Those stakeholders are provider, consumer,developer, and broker. As a result, non-functional requirements are different from one stakeholder to another.This paper presents a quality model that classifies nonfunctional characteristics based on the different stakeholders' requirements. In the discussion of the stakeholders we focus on developer, provider, and consumer. The model presents also some metrics for these characteristics. We argue that this model gives better view for SOA and web services quality requirements from different perspectives.

在使用web服务的当前SOA实现中，不同的服务可以为消费者提供相同的功能，这些服务的qos对于确定最适合消费者的服务至关重要。尽管在过去的几年中，web服务的QoS在文献中引起了广泛的关注，但目前的大多数努力都没有考虑到web服务的多利益相关者性质。与传统软件范例不同，SOA应用程序由不同的涉众拥有、开发和/或使用。这些涉众是提供者、消费者、开发人员和代理。因此，非功能性需求在涉众之间是不同的。本文提出了一种基于不同干系人需求的非功能特征分类的质量模型。在涉众的讨论中，我们关注于开发人员、提供者和使用者。该模型还为这些特征提供了一些度量。我们认为这个模型可以从不同的角度更好地了解SOA和web服务质量需求。

引用次数: 47

CPB: A Model for Biclustering CPB:一个双聚类模型

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.48

Debahuti Mishra, A. Rath

Mining biclusters that exhibit both consistent trends and trends with similar degrees of fluctuations is vital to bioinformatics research. However, existing biclustering methods are not very efficient and effective at mining such biclusters. Most biclustering models, including those used in subspace clustering, define similarity among different objects by distances over either all or only a subset of dimensions in gene expression data. However, distance functions are not always adequate in capturing co-relations among the objects. In fact, strong co-relations may still exist among a set of objects even if they are far apart from each other as measured by the distance function.Under the CPB (Coherent Pattern Biclustering) model, we proposed, two objects are similar if they exhibit coherent pattern on a subset of dimensions. For instances, in DNA microarray analysis, the expression levels of two genes may rise or fall synchronously in response to a set of environmental stimuli. Though the magnitude of their expression levels may not be close, but the pattern they exhibit can be very much similar. Our proposed model is interested in finding such coherent patterns of biclusters of genes and with a general understanding of biological processes that many genes participate in multiple different processes.

挖掘既具有一致趋势又具有相似波动程度的趋势的双聚类对生物信息学研究至关重要。然而，现有的双聚类方法在挖掘此类双聚类时效率不高。大多数双聚类模型，包括那些用于子空间聚类的模型，通过基因表达数据中所有维度或仅一小部分维度的距离来定义不同对象之间的相似性。然而，距离函数并不总是足以捕捉物体之间的相互关系。事实上，一组物体之间即使通过距离函数测量它们彼此相距很远，也可能仍然存在强的相互关系。在CPB (Coherent Pattern bicclustering)模型下，如果两个对象在一个子集的维度上表现出一致的模式，则它们是相似的。例如，在DNA微阵列分析中，两个基因的表达水平可能在一系列环境刺激下同步上升或下降。虽然它们的表达水平的大小可能不接近，但它们表现出的模式可能非常相似。我们提出的模型感兴趣的是找到这种基因双簇的连贯模式，并对许多基因参与多种不同过程的生物过程有一般的理解。

{"title":"CPB: A Model for Biclustering","authors":"Debahuti Mishra, A. Rath","doi":"10.1109/ICIME.2009.48","DOIUrl":"https://doi.org/10.1109/ICIME.2009.48","url":null,"abstract":"Mining biclusters that exhibit both consistent trends and trends with similar degrees of fluctuations is vital to bioinformatics research. However, existing biclustering methods are not very efficient and effective at mining such biclusters. Most biclustering models, including those used in subspace clustering, define similarity among different objects by distances over either all or only a subset of dimensions in gene expression data. However, distance functions are not always adequate in capturing co-relations among the objects. In fact, strong co-relations may still exist among a set of objects even if they are far apart from each other as measured by the distance function.Under the CPB (Coherent Pattern Biclustering) model, we proposed, two objects are similar if they exhibit coherent pattern on a subset of dimensions. For instances, in DNA microarray analysis, the expression levels of two genes may rise or fall synchronously in response to a set of environmental stimuli. Though the magnitude of their expression levels may not be close, but the pattern they exhibit can be very much similar. Our proposed model is interested in finding such coherent patterns of biclusters of genes and with a general understanding of biological processes that many genes participate in multiple different processes.","PeriodicalId":445284,"journal":{"name":"2009 International Conference on Information Management and Engineering","volume":"224 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123306877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

An Approach to Tracking Data Lineage in Mediator Based Information Integration Systems 基于中介者的信息集成系统中数据沿袭跟踪方法

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.75

Keramat Hassani, R. Roustaei, H. Zafari, E. Zohrevandi, M. Shiri, O. M. Talab

The problem of providing explanation for a query answer is referred to as lineage tracing. This problem has been studied extensively in data warehouse systems, but for mediator-based systems, this is identified as a research problem. In such a system, the mediator does not store data. This means for query processing as well as for tracing, the mediator has to communicate with the data sources. which this communication could be expensive or impossible. so To resolve this, we clearly define forward lineage tracing and show its properties. We propose a tracing method computes data lineage without storing any data and effectively supports aggregation and variable granularity lineage. And we illustrate that our method is more efficient than methods that compute the lineage by executing the reverse query.

为查询答案提供解释的问题称为谱系跟踪。这个问题已经在数据仓库系统中得到了广泛的研究，但对于基于中介的系统，这是一个研究问题。在这样的系统中，中介不存储数据。这意味着对于查询处理和跟踪，中介必须与数据源通信。这种交流可能是昂贵的或不可能的。为了解决这个问题，我们明确定义了前向谱系追踪，并展示了它的属性。我们提出了一种无需存储任何数据即可计算数据沿袭的跟踪方法，并有效地支持聚合和变粒度沿袭。我们通过执行反向查询来说明我们的方法比计算沿袭的方法更有效。

引用次数: 3

PersistOM: An Objects-to-Multivalued Database Mapping Mechanism PersistOM:一种对象到多值数据库映射机制

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.10

Fuguo Wei, S. Lee

Most modern business applications today are developed by using object technology such as Java to build application software and using a relational or multivalued database to store data. Nevertheless, impedance mismatch exists between objects and data store mechanisms. Hence, object persistence has become a necessary practice to map database records into objects for further in-memory processing. However, most research works focus on objects to relational database mapping and very few works focus on objects to multivalued database (O/M) mapping. Nonetheless, these few existing O/M mapping mechanisms are either hard to be extended or difficult to be managed. This paper presents the design of an O/M mapping mechanism called PersistOM with design patterns. A set of design patterns has been applied to make the PersistOM easy to be extended and reused. Layers architectural pattern was applied to structure the whole mapping mechanism to ensure each mapping sub-layer is at a particular level of abstraction. Simulation results show that PersistOM not only shortens the overall development period, but also is comparatively easy to be modified and extended.

今天，大多数现代商业应用程序都是通过使用对象技术(如Java)来构建应用程序软件，并使用关系或多值数据库来存储数据来开发的。然而，对象和数据存储机制之间存在阻抗不匹配。因此，对象持久化已成为将数据库记录映射到对象以进行进一步内存处理的必要实践。然而，大多数研究工作都集中在对象到关系数据库的映射上，很少有研究关注对象到多值数据库(O/M)的映射。尽管如此，这些为数不多的现有O/M映射机制要么难以扩展，要么难以管理。本文提出了一个带有设计模式的O/M映射机制PersistOM的设计。已经应用了一组设计模式，使PersistOM易于扩展和重用。应用层体系结构模式构建整个映射机制，以确保每个映射子层处于特定的抽象级别。仿真结果表明，PersistOM不仅缩短了整体开发周期，而且相对易于修改和扩展。

引用次数: 1

Prediction of Sunspot Series Using BiLinear Recurrent Neural Network 利用双线性递归神经网络预测太阳黑子序列

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.90

Dong-Chul Park, Dong-Min Woo

A prediction scheme of sunspot series using a BiLinear Recurrent Neural Network (BLRNN) is proposed in this paper. Since the BLRNN is based on the bilinear polynomial, it has been successfully used in modeling highly nonlinear systems with time-series characteristics and the BLRNN can be a natural choice in predicting sunspot series. The performance of the proposed BLRNN-based predictor is evaluated and compared with the conventional MultiLayer Perceptron Type Neural Network (MLPNN)-based predictor. Experiments are conducted on the Wolf sunspot series number data. The results show that the proposed BLRNN based predictor outperforms the MLPNN-based one interms of the Normalized Mean Squared Error (NMSE).

提出了一种利用双线性递归神经网络(BLRNN)预测太阳黑子序列的方案。由于BLRNN是基于双线性多项式的，它已成功地用于具有时间序列特征的高度非线性系统的建模，因此BLRNN可以成为预测太阳黑子序列的自然选择。对基于blrnn的预测器的性能进行了评估，并与传统的多层感知器型神经网络(MLPNN)预测器进行了比较。对沃尔夫太阳黑子系列数据进行了实验研究。结果表明，基于BLRNN的预测器优于基于mlpnn的一项归一化均方误差(NMSE)。

引用次数: 5

A Fault Tolerant Comparison Internet Shopping System: BestDeal by Using Mobile Agent 基于移动Agent的网络购物容错比较系统BestDeal

2009 International Conference on Information Management and Engineering

Pub Date : 2009-04-03 DOI: 10.1109/ICIME.2009.77

H. Pathak, Nipur, K. Garg

Mobile agents have been advocated to support electronic commerce over the Internet. While being a promising paradigm, many intricate problems such as security and fault tolerance need to be solved to make this vision reality. In this paper we have proposed a fault tolerant comparison internet shopping system BestDeal. We assume that both the mobile agent and the host responsible to execute Mobile Agent are test worthy and mobile agent does not get tampered, kidnapped or robbed on its way. Hierarchical Fault Tolerance Protocol (HFTP) has been used to make this application fault tolerant i.e. user, who launches the mobile agent receives it back with correct result within time limit in spite of hardware and software faults such as link failure, host failure, or crash of mobile agent or mobile agent system. Proposed protocol has been modeled by using CPN tools and been analyzed by using simulations and data gathering tools.

移动代理一直被提倡支持互联网上的电子商务。虽然这是一个很有前途的范例，但要实现这一愿景，还需要解决许多复杂的问题，如安全性和容错。本文提出了一个容错比较网上购物系统BestDeal。我们假设移动代理和负责执行移动代理的主机都是值得测试的，并且移动代理在其途中没有被篡改、绑架或抢劫。层次化容错协议(HFTP)使该应用程序具有容错性，即用户启动移动代理时，即使出现硬件和软件故障(如链路故障、主机故障、移动代理或移动代理系统崩溃)，也能在规定的时间内收到正确的结果。利用CPN工具对所提出的协议进行了建模，并利用仿真和数据收集工具对所提出的协议进行了分析。

引用次数: 6

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2009 International Conference on Information Management and Engineering

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀