首页 > 最新文献

2016 Eleventh International Conference on Digital Information Management (ICDIM)最新文献

英文 中文
S4Q: Searching for QoE in P2P streaming neighborhoods S4Q: P2P流媒体社区的QoE搜索
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829783
Peron R. de Sousa, Antonio A. Rocha, S. C. D. Lucena, Morganna C. Diniz, D. Menasché
In P2P video streaming systems, the neighbor selection algorithm is one of the fundamental components that affects performance. Although a few neighbor selection methodologies have been proposed, none of them takes into consideration QoE metrics reported by peers as a parameter for decisions. In this work, we present a new metric that quantifies the QoE as a function of the number of absences of pieces of a video during playback, referred to as stress level. A peer is stable if its stress level remains constant over a certain interval of time. Then, we propose a new neighbor selection algorithm for P2P streaming. The algorithm promotes exchanges of lists of stable peers, in order to get shorter download times and minimum interruption during video playback. The performance results show that our proposal is competitive against other state-of-the-art algorithms, reducing the number of absences of video pieces during playback by at least 19% and the average stress level by up to 32%.
在P2P视频流系统中,邻居选择算法是影响系统性能的基本组成部分之一。虽然已经提出了一些邻居选择方法,但它们都没有考虑到对等体报告的QoE度量作为决策的参数。在这项工作中,我们提出了一个新的度量,将QoE量化为播放期间视频片段缺失次数的函数,称为压力水平。如果一个对等体的压力水平在一段时间内保持不变,那么它就是稳定的。然后,我们提出了一种新的P2P流媒体邻居选择算法。该算法促进稳定节点列表的交换,以缩短下载时间和最小化视频播放期间的中断。性能结果表明,我们的提议与其他最先进的算法相比具有竞争力,在播放过程中至少减少了19%的视频片段缺失次数,平均压力水平减少了32%。
{"title":"S4Q: Searching for QoE in P2P streaming neighborhoods","authors":"Peron R. de Sousa, Antonio A. Rocha, S. C. D. Lucena, Morganna C. Diniz, D. Menasché","doi":"10.1109/ICDIM.2016.7829783","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829783","url":null,"abstract":"In P2P video streaming systems, the neighbor selection algorithm is one of the fundamental components that affects performance. Although a few neighbor selection methodologies have been proposed, none of them takes into consideration QoE metrics reported by peers as a parameter for decisions. In this work, we present a new metric that quantifies the QoE as a function of the number of absences of pieces of a video during playback, referred to as stress level. A peer is stable if its stress level remains constant over a certain interval of time. Then, we propose a new neighbor selection algorithm for P2P streaming. The algorithm promotes exchanges of lists of stable peers, in order to get shorter download times and minimum interruption during video playback. The performance results show that our proposal is competitive against other state-of-the-art algorithms, reducing the number of absences of video pieces during playback by at least 19% and the average stress level by up to 32%.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125344547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Examination of effective features for CRF-based bibliography extraction from reference strings 从参考字符串中提取基于crf的书目的有效特征检验
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829774
Daiki Matsuoka, Manabu Ohta, A. Takasu, J. Adachi
Metadata such as bibliographic information about documents are indispensable in the effective use of digital libraries. In particular, the reference fields of academic papers contain much bibliographic information such as authors' names and document titles. We are therefore developing a method for automatically extracting bibliographic information from reference strings using a conditional random field (CRF). The features used by the CRF determine the accuracy of this method. We examine effective features for accurate extraction by experimentally changing the features used. The experiments showed that lexical features were quite effective in accurate extraction and augmenting lexicons properly could lead to further improvements in accuracy.
文献书目信息等元数据在数字图书馆的有效利用中是不可或缺的。特别是,学术论文的参考字段包含许多书目信息,如作者姓名和文献标题。因此,我们正在开发一种使用条件随机场(CRF)从参考字符串中自动提取书目信息的方法。CRF使用的特征决定了该方法的准确性。我们通过实验改变所使用的特征来检查准确提取的有效特征。实验表明,词汇特征在准确提取中是非常有效的,适当增加词汇可以进一步提高准确性。
{"title":"Examination of effective features for CRF-based bibliography extraction from reference strings","authors":"Daiki Matsuoka, Manabu Ohta, A. Takasu, J. Adachi","doi":"10.1109/ICDIM.2016.7829774","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829774","url":null,"abstract":"Metadata such as bibliographic information about documents are indispensable in the effective use of digital libraries. In particular, the reference fields of academic papers contain much bibliographic information such as authors' names and document titles. We are therefore developing a method for automatically extracting bibliographic information from reference strings using a conditional random field (CRF). The features used by the CRF determine the accuracy of this method. We examine effective features for accurate extraction by experimentally changing the features used. The experiments showed that lexical features were quite effective in accurate extraction and augmenting lexicons properly could lead to further improvements in accuracy.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128842146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
HSDD: A hybrid sampling strategy for class imbalance in defect prediction data sets HSDD:缺陷预测数据集中类不平衡的混合采样策略
Pub Date : 2016-08-01 DOI: 10.1109/FGCT.2016.7605093
M. Öztürk, A. Zengin
Class imbalance is a common problem in defect prediction data sets. In order to cope with this problem, over-sampling and under sampling methods are employed. However, these methods are designed for instance based alteration and not specialized for feature space. Also there is not any distinctive approach to cope with class imbalance in defect prediction data sets. We develop HSDD (hybrid sampling for defect data sets) to solve this problem. HSDD comprises not only derivation of low-level metrics, but also reduction processes of repeated data points. The method was evaluated on industrial and open source project data sets by using Bayes, naive Bayes, random forest, and J48 in terms of g-mean and training time. Obtained results show that HSDD produces promising training performance especially in large-scale data sets.
类不平衡是缺陷预测数据集中常见的问题。为了解决这一问题,采用了过采样和欠采样的方法。然而,这些方法是针对基于实例的改变而设计的,而不是专门针对特征空间的。此外,对于缺陷预测数据集中的类不平衡问题,目前还没有一种独特的解决方法。我们开发HSDD(缺陷数据集的混合采样)来解决这个问题。HSDD不仅包括低级指标的推导,还包括重复数据点的减少过程。利用贝叶斯、朴素贝叶斯、随机森林和J48在工业和开源项目数据集上对该方法进行了g均值和训练时间的评估。实验结果表明,HSDD在大规模数据集上的训练效果非常好。
{"title":"HSDD: A hybrid sampling strategy for class imbalance in defect prediction data sets","authors":"M. Öztürk, A. Zengin","doi":"10.1109/FGCT.2016.7605093","DOIUrl":"https://doi.org/10.1109/FGCT.2016.7605093","url":null,"abstract":"Class imbalance is a common problem in defect prediction data sets. In order to cope with this problem, over-sampling and under sampling methods are employed. However, these methods are designed for instance based alteration and not specialized for feature space. Also there is not any distinctive approach to cope with class imbalance in defect prediction data sets. We develop HSDD (hybrid sampling for defect data sets) to solve this problem. HSDD comprises not only derivation of low-level metrics, but also reduction processes of repeated data points. The method was evaluated on industrial and open source project data sets by using Bayes, naive Bayes, random forest, and J48 in terms of g-mean and training time. Obtained results show that HSDD produces promising training performance especially in large-scale data sets.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132772124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Churn perdiction in the telecom business 电信行业的客户流失预测
Pub Date : 2016-07-14 DOI: 10.1109/ICDIM.2016.7829775
Georgina Esteves, João Mendes-Moreira
Telecommunication companies are acknowledging the existing connection between customer satisfaction and company revenues. Customer churn in telecom refers to a customer that ceases his relationship with a company. Churn prediction in telecom has recently gained substantial interest of stakeholders, who noticed that retaining a customer is substantially cheaper that gaining a new one. This research compares six approaches using different algorithms that identify the clients who are closer to abandon their telecom provider. Those algorithms are: KNN, Naive Bayes, C4.5, Random Forest, Ada Boost and ANN. The use of real data provided by We Do technologies extended the refinement time necessary, but ensured that the developed algorithm and model can be applied to real world situations. The models are evaluated according to three criteria: are under curve, sensitivity and specificity, with special weight to the first two criteria. The Random Forest algorithm proved to be the most adequate in all the test cases.
电信公司正在认识到客户满意度和公司收入之间存在的联系。在电信行业,客户流失是指客户终止与公司的关系。电信客户流失预测最近引起了利益相关者的极大兴趣,他们注意到留住一个客户比获得一个新客户要便宜得多。这项研究比较了六种方法,使用不同的算法来识别哪些客户更接近放弃他们的电信提供商。这些算法是:KNN,朴素贝叶斯,C4.5,随机森林,Ada Boost和ANN。使用We Do技术提供的真实数据,延长了必要的细化时间,但确保了所开发的算法和模型可以应用于真实世界的情况。根据曲线下、敏感性和特异性三个指标对模型进行评价,对前两个指标给予特殊权重。在所有的测试用例中,随机森林算法被证明是最合适的。
{"title":"Churn perdiction in the telecom business","authors":"Georgina Esteves, João Mendes-Moreira","doi":"10.1109/ICDIM.2016.7829775","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829775","url":null,"abstract":"Telecommunication companies are acknowledging the existing connection between customer satisfaction and company revenues. Customer churn in telecom refers to a customer that ceases his relationship with a company. Churn prediction in telecom has recently gained substantial interest of stakeholders, who noticed that retaining a customer is substantially cheaper that gaining a new one. This research compares six approaches using different algorithms that identify the clients who are closer to abandon their telecom provider. Those algorithms are: KNN, Naive Bayes, C4.5, Random Forest, Ada Boost and ANN. The use of real data provided by We Do technologies extended the refinement time necessary, but ensured that the developed algorithm and model can be applied to real world situations. The models are evaluated according to three criteria: are under curve, sensitivity and specificity, with special weight to the first two criteria. The Random Forest algorithm proved to be the most adequate in all the test cases.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126841461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Private photo recommendation system for in-person conversation 私人照片推荐系统,用于面对面的交谈
Pub Date : 1900-01-01 DOI: 10.1109/ICDIM.2016.7829768
Toki Takeda, Taketoshi Ushiama
Mobile devices equipped with cameras, such as smartphones, have become extremely popular in recent years, and have motivated users to capture an increasing number of digital photos in their daily lives. Users often invoke these photos during in-person interactions or conversations with others. In the context of a conversation between two people related to or involving reference to a photo, one of them might be inclined to show the photograph on his/her smartphone. However, it is difficult for users to search for a photo that they want to reference in the brief period during a conversation for which it might be relevant. Hence, in this study, we propose a system that shows users the photos relevant to the topic that they want to discuss in a given conversation according to their preferences. In the method of ranking photos used in our system, the index of the adequacy of a photo in the context of a conversation is defined as the degree of similarity to the photo that is used in the current conversation. Photos on each user's device are sorted by their degree of similarity with a given photo. We show that there is a strong relationship between our proposed ranking and those based on subjective assessment tasks, and confirmed the effectiveness of our system through experiments involving seven subjects.
配有摄像头的移动设备,如智能手机,近年来变得非常流行,并促使用户在日常生活中拍摄越来越多的数码照片。用户经常在面对面的互动或与他人的对话中调用这些照片。在两个人之间与照片相关或涉及到照片的对话中,其中一人可能倾向于在他/她的智能手机上显示照片。然而,用户很难在短暂的对话中搜索到他们想要参考的照片,而这张照片可能与对话相关。因此,在本研究中,我们提出了一个系统,根据用户的偏好,在给定的对话中向用户显示与他们想要讨论的主题相关的照片。在我们系统中使用的照片排名方法中,照片在对话上下文中的充分性指数被定义为与当前对话中使用的照片的相似程度。每个用户设备上的照片会根据与给定照片的相似程度进行排序。我们证明了我们提出的排名与基于主观评估任务的排名之间存在很强的关系,并通过涉及七个受试者的实验证实了我们系统的有效性。
{"title":"Private photo recommendation system for in-person conversation","authors":"Toki Takeda, Taketoshi Ushiama","doi":"10.1109/ICDIM.2016.7829768","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829768","url":null,"abstract":"Mobile devices equipped with cameras, such as smartphones, have become extremely popular in recent years, and have motivated users to capture an increasing number of digital photos in their daily lives. Users often invoke these photos during in-person interactions or conversations with others. In the context of a conversation between two people related to or involving reference to a photo, one of them might be inclined to show the photograph on his/her smartphone. However, it is difficult for users to search for a photo that they want to reference in the brief period during a conversation for which it might be relevant. Hence, in this study, we propose a system that shows users the photos relevant to the topic that they want to discuss in a given conversation according to their preferences. In the method of ranking photos used in our system, the index of the adequacy of a photo in the context of a conversation is defined as the degree of similarity to the photo that is used in the current conversation. Photos on each user's device are sorted by their degree of similarity with a given photo. We show that there is a strong relationship between our proposed ranking and those based on subjective assessment tasks, and confirmed the effectiveness of our system through experiments involving seven subjects.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126510317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Message from the chairs 来自椅子的信息
P. Olivier, A. Krüger
The aim of the SCAM working conference is to bring together researchers and practitioners working on theory, techniques and applications that concern analysis and manipulation of the source code of computer systems. Software plays an enormous role in our lives, in ways both obvious and subtle. This role will continue to expand and evolve. While much attention in the wider software engineering community is directed towards other aspects of systems development and evolution, such as specification, design and requirements engineering, it is the source code that contains the precise, and sometimes only, definitive description of the behaviour of the system. SCAM focuses on the techniques and tools themselves what they can achieve, how they can be improved, refined and combined. SCAM is a working conference: paper presentations are kept short and focused, with ample time reserved for general discussion of issues raised during each session. This always leads to interesting and stimulating interactive discussions.
诈骗工作会议的目的是汇集研究人员和从业人员的理论,技术和应用,有关计算机系统的源代码的分析和操作。软件在我们的生活中扮演着巨大的角色,无论是以明显的还是微妙的方式。这一角色将继续扩大和发展。虽然在更广泛的软件工程社区中,许多注意力都集中在系统开发和进化的其他方面,例如规格说明、设计和需求工程,但是源代码包含了对系统行为的精确的、有时只是明确的描述。骗局专注于技术和工具本身,他们可以实现什么,他们如何可以改进,完善和结合。骗局是一个工作会议:论文报告保持简短和重点,有充足的时间用于一般性讨论在每次会议期间提出的问题。这总是导致有趣和刺激的互动讨论。
{"title":"Message from the chairs","authors":"P. Olivier, A. Krüger","doi":"10.1109/SCAM.2006.19","DOIUrl":"https://doi.org/10.1109/SCAM.2006.19","url":null,"abstract":"The aim of the SCAM working conference is to bring together researchers and practitioners working on theory, techniques and applications that concern analysis and manipulation of the source code of computer systems. Software plays an enormous role in our lives, in ways both obvious and subtle. This role will continue to expand and evolve. While much attention in the wider software engineering community is directed towards other aspects of systems development and evolution, such as specification, design and requirements engineering, it is the source code that contains the precise, and sometimes only, definitive description of the behaviour of the system. SCAM focuses on the techniques and tools themselves what they can achieve, how they can be improved, refined and combined. SCAM is a working conference: paper presentations are kept short and focused, with ample time reserved for general discussion of issues raised during each session. This always leads to interesting and stimulating interactive discussions.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"178 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134319488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2016 Eleventh International Conference on Digital Information Management (ICDIM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1