2019 Federated Conference on Computer Science and Information Systems (FedCSIS)最新文献

英文中文

Towards Data Quality Runtime Verification 迈向数据质量运行时验证

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F168

J. Bičevskis, Zane Bicevska, Anastasija Nikiforova, Ivo Oditis

This paper discusses data quality checking during business process execution by using runtime verification. While runtime verification verifies the correctness of business process execution, data quality checks assure that particular process did not negatively impact the stored data. Both, runtime verification and data quality checks run in parallel with the base processes affecting them insignificantly. The proposed idea allows verifying (a) if the process was ended correctly as well as (b) whether the results of the correct process did not negatively impact the stored data in result of its modification caused by the specific process. The desired result will be achieved by use of domain specific languages that would describe runtime verification and data quality checks at every stage of business process execution.

本文通过使用运行时验证来讨论业务流程执行过程中的数据质量检查。运行时验证验证业务流程执行的正确性，而数据质量检查确保特定流程不会对存储的数据产生负面影响。运行时验证和数据质量检查都与对它们影响不大的基本进程并行运行。所提议的想法允许验证(a)工艺是否正确结束，以及(b)正确工艺的结果是否不会由于特定工艺引起的修改而对存储的数据产生负面影响。期望的结果将通过使用特定于领域的语言来实现，这些语言将描述业务流程执行的每个阶段的运行时验证和数据质量检查。

引用次数: 3

Towards Big Data Solutions for Industrial Tomography Data Processing 面向工业层析成像数据处理的大数据解决方案

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F310

Aleksandra Kowalska, Piotr Łuczak, Dawid Sielski, T. Kowalski, A. Romanowski, D. Sankowski

This paper presents an overview of what Big Data can bring to the modern industry. Through following the history of contemporary Big Data frameworks the authors observe that the tools available have reached sufficient maturity so as to be usable in an industrial setting. The authors propose the concept of a system for collecting, organising, processing and analysing experimental data obtained from measurements with process tomography. Process tomography is used for noninvasive flow monitoring and data acquisition. The measurement data is collected, stored and processed to identify process regimes and process threats. Further general examples of solutions that aim to take advantage of the existence of such tools are presented as proof of viability of such approach. As the first step in the process of creating the proposed system, a scalable, distributed, containerisation-based cluster has been constructed, with consumer-grade hardware.

本文概述了大数据对现代工业的影响。通过跟踪当代大数据框架的历史，作者观察到可用的工具已经达到足够的成熟度，可以在工业环境中使用。作者提出了一个系统的概念，用于收集、组织、处理和分析从过程层析成像测量中获得的实验数据。过程断层扫描用于无创血流监测和数据采集。测量数据被收集、存储和处理，以识别过程制度和过程威胁。进一步的解决方案的一般例子旨在利用这些工具的存在，以证明这种方法的可行性。作为创建所建议的系统过程的第一步，已经使用消费者级硬件构建了一个可伸缩的、分布式的、基于容器的集群。

引用次数: 3

Big Data Platform for Smart Grids Power Consumption Anomaly Detection 智能电网用电异常检测大数据平台

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F210

Jakub Lipcak, M. Macák, B. Rossi

Big data processing in the Smart Grid context has many large-scale applications that require real-time data analysis (e.g., intrusion and data injection attacks detection, electric device health monitoring). In this paper, we present a big data platform for anomaly detection of power consumption data. The platform is based on an ingestion layer with data densification options, Apache Flink as part of the speed layer and HDFS/KairosDB as data storage layers. We showcase the application of the platform to a scenario of power consumption anomaly detection, benchmarking different alternative frameworks used at the speed layer level (Flink, Storm, Spark).

智能电网背景下的大数据处理有许多需要实时数据分析的大规模应用(例如，入侵和数据注入攻击检测，电气设备健康监测)。本文提出了一个用于电力消耗数据异常检测的大数据平台。该平台基于具有数据致密化选项的摄取层，Apache Flink作为速度层的一部分，HDFS/KairosDB作为数据存储层。我们展示了该平台在功耗异常检测场景中的应用，对速度层使用的不同替代框架(Flink, Storm, Spark)进行基准测试。

引用次数: 11

Languages’ Impact on Emotional Classification Methods 语言对情绪分类方法的影响

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F143

A. Eilertsen, Dennis Højbjerg Rose, Peter Langballe Erichsen, Rasmus Engesgaard Christensen, Rudra Pratap Deb Nath

There is currently a lack of research concerning whether Emotional Classification (EC) research on a language is applicable to other languages. If this is the case then we can greatly reduce the amount of research needed for different languages. Therefore, we propose a framework to answer the following null hypothesis: The change in classification accuracy for Emotional Classification caused by changing a single preprocessor or classifier is independent of the target language within a significance level of p= 0.05. We test this hypothesis using an English and a Danish data set, and the classification algorithms: Support-Vector Machine, Naive Bayes, and Random Forest. From our statistical test, we got a p We define this area as cross-languagetalic-value of 0.12852 and could therefore not reject our hypothesis. Thus, our hypothesis could still be true. More research is therefore needed within the field of cross-language EC in order to benefit EC for different languages.

情感分类对一种语言的研究是否适用于其他语言，目前还缺乏相关研究。如果是这样的话，那么我们可以大大减少不同语言所需的研究量。因此，我们提出了一个框架来回答以下零假设:在p= 0.05的显著性水平下，改变单个预处理器或分类器导致的情绪分类的分类精度变化与目标语言无关。我们使用英语和丹麦数据集以及分类算法:支持向量机、朴素贝叶斯和随机森林来检验这一假设。我们将该区域定义为跨语言值0.12852，因此不能拒绝我们的假设。因此，我们的假设仍然是正确的。因此，跨语言电子商务领域需要进行更多的研究，以使不同语言的电子商务受益。

引用次数: 1

Supporting Source Code Annotations with Metadata-Aware Development Environment 用元数据感知开发环境支持源代码注释

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F161

Ján Juhár

To augment source code with high-level metadata with the intent to facilitate program comprehension, a programmer can use annotations. There are several types of annotations: either those put directly in the code or external ones. Each type comes with a unique workflow and inherent limitations. In this paper, we present a tool providing uniform annotation process, which also adds custom metadata-awareness for an industrial IDE. We also report an experiment in which we sought whether the created annotating support helps programmers to annotate code with comments faster and more consistently. The experiment showed that with the tool the annotating consistency was significantly higher but also that the increase in annotating speed was not statistically significant.

为了用高级元数据增强源代码，以促进程序理解，程序员可以使用注释。有几种类型的注释:直接放在代码中的注释或外部注释。每种类型都有独特的工作流程和固有的限制。在本文中，我们提出了一个提供统一注释过程的工具，该工具还为工业IDE增加了自定义元数据感知。我们还报告了一个实验，在这个实验中，我们寻求创建的注释支持是否可以帮助程序员更快、更一致地用注释注释代码。实验表明，使用该工具标注的一致性显著提高，但标注速度的提高无统计学意义。

引用次数: 0

Knowledge Extraction and Applications utilizing Context Data in Knowledge Graphs 知识图中上下文数据的知识提取与应用

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F3

Jens Dörpinghaus, Andreas Stefan

Context is widely considered for NLP and knowledge discovery since it highly influences the exact meaning of natural language. The scientific challenge is not only to extract such context data, but also to store this data for further NLP approaches. Here, we propose a multiple step knowledge graphbased approach to utilize context data for NLP and knowledge expression and extraction. We introduce the graph-theoretic foundation for a general context concept within semantic networks and show a proof-of-concept-based on biomedical literature and text mining. We discuss the impact of this novel approach on text analysis, various forms of text recognition and knowledge extraction and retrieval.

上下文在自然语言处理和知识发现中被广泛考虑，因为它高度影响自然语言的确切含义。科学上的挑战不仅在于提取这样的上下文数据，还在于存储这些数据以供进一步的NLP方法使用。在此，我们提出了一种基于知识图的多步骤方法来利用上下文数据进行自然语言处理和知识表达与提取。我们介绍了语义网络中一般上下文概念的图论基础，并展示了基于生物医学文献和文本挖掘的概念验证。我们讨论了这种新方法对文本分析、各种形式的文本识别以及知识提取和检索的影响。

引用次数: 11

Network Effects in Online Marketplaces: The Case of Kiva 在线市场中的网络效应:Kiva案例

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F76

H. Mendelson, Yuanyuan Shen

Advanced information technologies have enabled the development of online marketplaces that connect businesses and people on a global scale. Much of the analysis of the adoption, growth and engagement on these marketplaces in the extant literature is based on the premise that they are characterized by network effects%a premise that has major implications for their deployment, implementation and management. In this paper we test this premise using data from Kiva, the world’s largest online, peer-to-peer social lending marketplace. We find that while network effects are strong and significant during the early growth phase of the marketplace, they become weak or disappear once the marketplace stabilizes.

先进的信息技术使在线市场得以发展，在全球范围内将企业和人们联系起来。在现有文献中，对这些市场的采用、增长和参与的大部分分析都是基于这样一个前提，即它们具有网络效应的特征，这一前提对它们的部署、实施和管理具有重大影响。在本文中，我们使用Kiva(世界上最大的在线点对点社交借贷市场)的数据来验证这一假设。我们发现，虽然网络效应在市场的早期成长阶段是强大和显著的，但一旦市场稳定，它们就会变弱或消失。

引用次数: 0

Accurate Retrieval of Corporate Reputation from Online Media Using Machine Learning 利用机器学习从在线媒体中准确检索企业声誉

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F169

Achim Klein, Martin Riekert, Velizar Dinev

Corporate reputation is an economic asset and its accurate measurement is of increasing interest in practice and science. This measurement task is difficult because reputation depends on numerous factors and stakeholders. Traditional measurement approaches have focused on human ratings and surveys, which are costly, can be conducted only infrequently and emphasize financial aspects of a corporation. Nowadays, online media with comments related to products, services, and corporations provides an abundant source for measuring reputation more comprehensively. Against this backdrop, we propose an information retrieval approach to automatically collect reputation-related text content from online media and analyze this content by machine learning-based sentiment analysis. We contribute an ontology for identifying corporations and a unique dataset of online media texts labelled by corporations’ reputation. Our approach achieves an overall accuracy of 84.4%. Our results help corporations to quickly identify their reputation from online media at low cost.

企业声誉是一种经济资产，它的准确测量在实践和科学上越来越受到关注。这个测量任务很困难，因为声誉取决于许多因素和利益相关者。传统的衡量方法侧重于人力评价和调查，这是昂贵的，只能很少进行，并强调公司的财务方面。如今，与产品、服务和企业相关的网络媒体评论为更全面地衡量声誉提供了丰富的来源。在此背景下，我们提出了一种信息检索方法，自动从网络媒体中收集与声誉相关的文本内容，并通过基于机器学习的情感分析对这些内容进行分析。我们提供了一个用于识别企业的本体和一个由企业声誉标记的在线媒体文本的独特数据集。我们的方法达到了84.4%的总体准确率。我们的研究结果帮助企业以较低的成本从网络媒体中快速识别自己的声誉。

引用次数: 2

Improving Real-Time Performance of U-Nets for Machine Vision in Laser Process Control 提高U-Nets在激光过程控制中的实时性

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F190

Przemyslaw Dolata, J. Reiner

Many industrial machine vision problems, particularly real-time control of manufacturing processes such as laser cladding, require robust and fast image processing. The inherent disturbances in images acquired during these processes makes classical segmentation algorithms uncertain. Among many convolutional neural networks introduced recently to solve such difficult problems, U-Net balances simplicity with segmentation accuracy. However, it is too computationally intensive for usage in many real-time processing pipelines.In this work we present a method of identifying the most informative levels of detail in the U-Net. By only processing the image at the selected levels, we reduce the total computation time by 80%, while still preserving adequate quality of segmentation.

许多工业机器视觉问题，特别是激光熔覆等制造过程的实时控制，需要鲁棒和快速的图像处理。在这些过程中获取的图像存在固有的干扰，使得经典分割算法具有不确定性。在最近推出的许多解决此类难题的卷积神经网络中，U-Net平衡了分割的简单性和准确性。然而，对于在许多实时处理管道中使用，它的计算量太大。在这项工作中，我们提出了一种识别U-Net中最具信息量的细节级别的方法。通过只在选定的级别上处理图像，我们减少了80%的总计算时间，同时仍然保持足够的分割质量。

引用次数: 0

Predicting Automotive Sales using Pre-Purchase Online Search Data 使用预购在线搜索数据预测汽车销售

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

Pub Date : 2019-09-26 DOI: 10.15439/2019F239

Philipp Wachter, Tobias Widmer, Achim Klein

Sales forecasting is an essential element for implementing sustainable business strategies in the automotive industry. Accurate sales forecasts enhance the competitive edge of car manufacturers in the effort to optimize their production planning processes. We propose a forecasting technique that combines keyword-specific customer online search data with economic variables to predict monthly car sales. To isolate online search data related to pre-purchase information search, we follow a backward induction approach and identify those keywords that are frequently applied by search engine users. In a set of experiments using real-world sales data and Google Trends, we find that our keyword-specific forecasting technique reduces the out-of-sample error by 5% as compared to existing techniques without systematic keyword selection. We also find that our regression models outperform the benchmark model by an out-of-sample prediction accuracy of up to 27%.

销售预测是汽车行业实施可持续经营战略的基本要素。准确的销售预测可以增强汽车制造商的竞争优势，从而优化其生产计划流程。我们提出了一种预测技术，将特定关键字的客户在线搜索数据与经济变量相结合，以预测每月的汽车销量。为了分离与预购信息搜索相关的在线搜索数据，我们采用逆向归纳方法并识别搜索引擎用户经常使用的关键字。在一组使用真实世界销售数据和谷歌趋势的实验中，我们发现，与没有系统关键字选择的现有技术相比，我们的关键字特定预测技术将样本外误差降低了5%。我们还发现，我们的回归模型优于基准模型的样本外预测精度高达27%。

引用次数: 6

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2019 Federated Conference on Computer Science and Information Systems (FedCSIS)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀