首页 > 最新文献

Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence最新文献

英文 中文
The Challenge and Prospect of Scalability of Blockchain Technology 区块链技术可扩展性的挑战与展望
Lizhi Wang
Blockchain is a decentralized technology proposed by Satoshi Nakamoto in 2008, without relying on trust, irrevocable and modified, based on a consensus mechanism. Blockchain technology was not created out of thin air, but was born as the underlying technology of bitcoin, a digital currency. Now it has received widespread attention for its implementation in distributed ledger. However, in the current era, the application of blockchain technology in actual scenarios has not yet matured, mainly due to some characteristics of blockchain technology itself. After weighing its advantages and disadvantages, it is increasingly becoming its limitation. One of the main aspects is the poor scalability of the blockchain, which not only limits the functional expansion of the blockchain and makes it impossible to apply the blockchain technology to a wider range of scenarios, but also limits the throughput of the blockchain system to a certain extent Promotion. Taking into account the characteristics and bottlenecks of the above-mentioned blockchain technology, different from previous studies, this article from the perspective of blockchain expansion, introduces some existing expansion technologies in solving the transaction rate of the blockchain, and analyzes its principles and feasibility. Through the classification and comparison of different expansion technologies, the advantages and possible disadvantages of various expansion technologies are described. Finally, on the basis of fully understanding the existing blockchain expansion technology, the outlook for the future development of blockchain expansion technology is proposed, with a view to providing some suggestions for the development of the blockchain expansion technology.
区块链是中本聪在2008年提出的一种去中心化技术,不依赖信任,不可撤销和修改,基于共识机制。区块链技术不是凭空创造出来的,而是作为数字货币比特币的基础技术诞生的。目前,它在分布式账本中的实现受到了广泛的关注。然而,在当前时代,区块链技术在实际场景中的应用尚未成熟,这主要是由于区块链技术本身的一些特点。在权衡了它的利弊之后,它越来越成为它的局限性。其中一个主要方面是区块链的可扩展性差,这不仅限制了区块链的功能扩展,使得区块链技术无法应用到更广泛的场景中,也在一定程度上限制了区块链系统的吞吐量推广。考虑到上述区块链技术的特点和瓶颈,与以往的研究不同,本文从区块链扩容的角度,介绍了解决区块链交易率的一些现有扩容技术,并分析了其原理和可行性。通过对不同扩展技术的分类和比较,阐述了各种扩展技术的优点和可能存在的缺点。最后,在充分了解现有区块链扩展技术的基础上,对区块链扩展技术的未来发展进行展望,以期为区块链扩展技术的发展提供一些建议。
{"title":"The Challenge and Prospect of Scalability of Blockchain Technology","authors":"Lizhi Wang","doi":"10.1145/3507548.3507594","DOIUrl":"https://doi.org/10.1145/3507548.3507594","url":null,"abstract":"Blockchain is a decentralized technology proposed by Satoshi Nakamoto in 2008, without relying on trust, irrevocable and modified, based on a consensus mechanism. Blockchain technology was not created out of thin air, but was born as the underlying technology of bitcoin, a digital currency. Now it has received widespread attention for its implementation in distributed ledger. However, in the current era, the application of blockchain technology in actual scenarios has not yet matured, mainly due to some characteristics of blockchain technology itself. After weighing its advantages and disadvantages, it is increasingly becoming its limitation. One of the main aspects is the poor scalability of the blockchain, which not only limits the functional expansion of the blockchain and makes it impossible to apply the blockchain technology to a wider range of scenarios, but also limits the throughput of the blockchain system to a certain extent Promotion. Taking into account the characteristics and bottlenecks of the above-mentioned blockchain technology, different from previous studies, this article from the perspective of blockchain expansion, introduces some existing expansion technologies in solving the transaction rate of the blockchain, and analyzes its principles and feasibility. Through the classification and comparison of different expansion technologies, the advantages and possible disadvantages of various expansion technologies are described. Finally, on the basis of fully understanding the existing blockchain expansion technology, the outlook for the future development of blockchain expansion technology is proposed, with a view to providing some suggestions for the development of the blockchain expansion technology.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123420930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Research on human-computer interaction portability evaluation model in complex environment 复杂环境下人机交互可移植性评价模型研究
Zhuxin Xue, Yang Bai, Haixin Wang, Chenyu He, Jian Tan
Human-computer interaction is a technology to study the relationship between users and systems. Good user experience can greatly enhance user stickiness. With the development of Internet technology, users begin to face a variety of systems, but the interaction methods involved are different. Learning new interaction means an increase in user learning costs. As a part of product design, the design of interaction also needs to cover special user groups as much as possible, such as blind people. Thus, it is necessary to study whether different ways of interaction can migrate to each other. Traditional research on user experience mainly focuses on qualitative aspects, such as questionnaire survey. In this paper, we propose a quantitative model to evaluate the portability of interaction. For software interaction design, the macro concept of user experience is quantified by different dimensions, and a unified index model and calculation method are output to guide and evaluate the portability of different software interactions.
人机交互是研究用户与系统之间关系的技术。良好的用户体验可以大大增强用户粘性。随着互联网技术的发展,用户开始面对各种各样的系统,但所涉及的交互方式却各不相同。学习新的交互意味着用户学习成本的增加。作为产品设计的一部分,交互的设计也需要尽可能地覆盖特殊的用户群体,比如盲人。因此,有必要研究不同的交互方式是否可以相互迁移。传统的用户体验研究主要集中在定性方面,如问卷调查。在本文中,我们提出了一个定量模型来评估交互的可移植性。对于软件交互设计,将用户体验的宏观概念通过不同维度进行量化,输出统一的指标模型和计算方法,指导和评价不同软件交互的可移植性。
{"title":"Research on human-computer interaction portability evaluation model in complex environment","authors":"Zhuxin Xue, Yang Bai, Haixin Wang, Chenyu He, Jian Tan","doi":"10.1145/3507548.3507615","DOIUrl":"https://doi.org/10.1145/3507548.3507615","url":null,"abstract":"Human-computer interaction is a technology to study the relationship between users and systems. Good user experience can greatly enhance user stickiness. With the development of Internet technology, users begin to face a variety of systems, but the interaction methods involved are different. Learning new interaction means an increase in user learning costs. As a part of product design, the design of interaction also needs to cover special user groups as much as possible, such as blind people. Thus, it is necessary to study whether different ways of interaction can migrate to each other. Traditional research on user experience mainly focuses on qualitative aspects, such as questionnaire survey. In this paper, we propose a quantitative model to evaluate the portability of interaction. For software interaction design, the macro concept of user experience is quantified by different dimensions, and a unified index model and calculation method are output to guide and evaluate the portability of different software interactions.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"211 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123018770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Portfolio Model Based on Temporal Convolution Networks and the Empirical Research on Chinese Stock Market 基于时间卷积网络的投资组合模型及中国股票市场实证研究
Rui Zhang, Zuoquan Zhang, Marui Du, Xiaomin Wang
Aiming at determining the weights of selected investment targets to obtain higher and more stable investment returns, a Temporal Convolution Networks (TCN) based portfolio model, namely, TCNportfolio model is proposed. TCN-portfolio model combines TCNbased time series processing and MLP-based cross-sectional data processing, and finally outputs the investment target weights which changes every ten trading days. We optimize the TCN-portfolio model using a Multi-Objective Genetic Algorithm (MOGA) which optimizes the rate of return and variance at the same time. The component stocks of Shanghai Securities Composite 50 index (SSEC 50) are selected as the investment targets. Experimental results on the test sets reveal that TCN-portfolio model performs well. Its average daily return rate is obviously greater than those of SSEC and SSEC 50, and the cumulative return rate of TCN-portfolio model is always greater than those of SSEC and SSEC 50 on the test data set.
为了确定所选投资目标的权重,以获得更高、更稳定的投资回报,提出了一种基于时间卷积网络(Temporal Convolution Networks, TCN)的投资组合模型,即TCNportfolio模型。TCN-portfolio模型结合了基于tcn的时间序列处理和基于mlp的横截面数据处理,最终输出每10个交易日变化一次的投资目标权重。采用多目标遗传算法(MOGA)对tcn -投资组合模型进行优化,同时优化收益率和方差。选取上证50指数成分股作为投资标的。在测试集上的实验结果表明,tcn -组合模型具有良好的性能。其平均日收益率明显大于SSEC和SSEC 50,在测试数据集上,TCN-portfolio模型的累计收益率始终大于SSEC和SSEC 50。
{"title":"The Portfolio Model Based on Temporal Convolution Networks and the Empirical Research on Chinese Stock Market","authors":"Rui Zhang, Zuoquan Zhang, Marui Du, Xiaomin Wang","doi":"10.1145/3507548.3507593","DOIUrl":"https://doi.org/10.1145/3507548.3507593","url":null,"abstract":"Aiming at determining the weights of selected investment targets to obtain higher and more stable investment returns, a Temporal Convolution Networks (TCN) based portfolio model, namely, TCNportfolio model is proposed. TCN-portfolio model combines TCNbased time series processing and MLP-based cross-sectional data processing, and finally outputs the investment target weights which changes every ten trading days. We optimize the TCN-portfolio model using a Multi-Objective Genetic Algorithm (MOGA) which optimizes the rate of return and variance at the same time. The component stocks of Shanghai Securities Composite 50 index (SSEC 50) are selected as the investment targets. Experimental results on the test sets reveal that TCN-portfolio model performs well. Its average daily return rate is obviously greater than those of SSEC and SSEC 50, and the cumulative return rate of TCN-portfolio model is always greater than those of SSEC and SSEC 50 on the test data set.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126879775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalized Thread Recommendation on Thai Internet Forum 泰国互联网论坛的个性化主题推荐
Bundit Manaskasemsak, Sarita Puttitanun, Jirateep Tantisuwankul, A. Rungsawang
The rise of user-generated content on the Internet today has led to the problem of data overload. Therefore, recommender systems have been introduced in various social platforms to automatically serve interesting content to users. Pantip.com is the most popular Thai Internet forum where people can discuss ideas, tips, and news on a variety of topics. Although Pantip has many recommendation services, these are not specific for individual users. In this paper, we proposed a personalized thread recommender system that is applicable to the Pantip site. The approach finds out appropriate threads for each user based on three aspects: user interests, thread trends, and thread freshness along with the analysis in changing of user behavior over time. We conducted experiments on the Pantip clickstream dataset and evaluated the performance by real users. Experimental results show that the proposed approach recommends threads that are significantly more satisfying for users than the baseline approaches.
如今,互联网上用户生成内容的兴起导致了数据过载的问题。因此,各种社交平台都引入了推荐系统,自动为用户提供感兴趣的内容。Pantip.com是泰国最受欢迎的互联网论坛,人们可以在这里讨论各种主题的想法、技巧和新闻。尽管Pantip有很多推荐服务,但这些都不是针对个人用户的。本文提出了一种适用于Pantip网站的个性化主题推荐系统。该方法基于用户兴趣、线程趋势和线程新鲜度三个方面,并分析用户行为随时间的变化,为每个用户找到合适的线程。我们在Pantip点击流数据集上进行了实验,并对真实用户的性能进行了评估。实验结果表明,该方法推荐的线程明显比基线方法更令人满意。
{"title":"Personalized Thread Recommendation on Thai Internet Forum","authors":"Bundit Manaskasemsak, Sarita Puttitanun, Jirateep Tantisuwankul, A. Rungsawang","doi":"10.1145/3507548.3507589","DOIUrl":"https://doi.org/10.1145/3507548.3507589","url":null,"abstract":"The rise of user-generated content on the Internet today has led to the problem of data overload. Therefore, recommender systems have been introduced in various social platforms to automatically serve interesting content to users. Pantip.com is the most popular Thai Internet forum where people can discuss ideas, tips, and news on a variety of topics. Although Pantip has many recommendation services, these are not specific for individual users. In this paper, we proposed a personalized thread recommender system that is applicable to the Pantip site. The approach finds out appropriate threads for each user based on three aspects: user interests, thread trends, and thread freshness along with the analysis in changing of user behavior over time. We conducted experiments on the Pantip clickstream dataset and evaluated the performance by real users. Experimental results show that the proposed approach recommends threads that are significantly more satisfying for users than the baseline approaches.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133608085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Study of Music Visualization based on CiteSpace at China and the World 基于CiteSpace的中国与世界音乐可视化比较研究
Hai-Yan Zheng, Zhengqing Jiang
Music visualization is a visual art form for understanding, analyzing and comparing the internal structure and expressive features of music. It meets the aesthetic demand of the masses in the digital age. This paper reviews the development and research status of the music visualization literature in the past 20 years, comprehensively analyzes the research process and current hotspots of music visualization, and speculates the future development trend. We have used Web of Science (WoS) and China National Knowledge Infrastructure (CNKI) as data sources, used CiteSpace software to compare and analyze the year, country, subject distribution and hot keywords of music visualization literature at China and the world from 2000 to 2020 by the method of Mapping Knowledge Domain. The results show that the research on music visualization at China and other countries is showing an upward trend, and it presents the characteristics of multi-disciplinary integration. Different application scenarios, research methods and development stages lead to different research hotspots between different countries. The shortcomings of Chinese research in this field lies in that the research content needs to be deepened, the interdisciplinary content needs to be integrated, and applications of music visualization needs to be popularized.
音乐可视化是一种理解、分析和比较音乐的内在结构和表现特征的视觉艺术形式。它满足了数字时代大众的审美需求。本文回顾了近20年来音乐可视化文献的发展和研究现状,综合分析了音乐可视化的研究过程和当前热点,并对未来的发展趋势进行了推测。以WoS (Web of Science)和CNKI (China National Knowledge Infrastructure)为数据源,利用CiteSpace软件,采用知识图谱的方法,对2000 - 2020年中国和世界音乐可视化文献的年份、国家、学科分布和热点关键词进行了比较分析。结果表明,国内外对音乐可视化的研究呈现出上升趋势,呈现出多学科融合的特点。不同的应用场景、研究方法和发展阶段导致不同国家的研究热点不同。中国在该领域研究的不足在于研究内容有待深化,跨学科内容有待整合,音乐可视化应用有待推广。
{"title":"Comparative Study of Music Visualization based on CiteSpace at China and the World","authors":"Hai-Yan Zheng, Zhengqing Jiang","doi":"10.1145/3507548.3507604","DOIUrl":"https://doi.org/10.1145/3507548.3507604","url":null,"abstract":"Music visualization is a visual art form for understanding, analyzing and comparing the internal structure and expressive features of music. It meets the aesthetic demand of the masses in the digital age. This paper reviews the development and research status of the music visualization literature in the past 20 years, comprehensively analyzes the research process and current hotspots of music visualization, and speculates the future development trend. We have used Web of Science (WoS) and China National Knowledge Infrastructure (CNKI) as data sources, used CiteSpace software to compare and analyze the year, country, subject distribution and hot keywords of music visualization literature at China and the world from 2000 to 2020 by the method of Mapping Knowledge Domain. The results show that the research on music visualization at China and other countries is showing an upward trend, and it presents the characteristics of multi-disciplinary integration. Different application scenarios, research methods and development stages lead to different research hotspots between different countries. The shortcomings of Chinese research in this field lies in that the research content needs to be deepened, the interdisciplinary content needs to be integrated, and applications of music visualization needs to be popularized.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115460282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Short Text Classification Model Based on BERT and Fusion Network 基于BERT和融合网络的短文本分类模型
Dongxue Bao, Donghong Qin, Xianye Liang, Lila Hong
Abstract: Aiming at short texts lacking contextual information, large amount of text data, sparse features, and traditional text feature representations that cannot dynamically obtain the key classification information of a word polysemous and contextual semantics. this paper proposes a pre-trained language model based on BERT. The network model B-BAtt-MPC (BERT-BiLSTM-Attention-Max-Pooling-Concat) that integrates BiLSTM, Attention mechanism and Max-Pooling mechanism. Firstly, obtain multi-dimensional and rich feature information such as text context semantics, grammar, and context through the BERT model; Secondly, use the BERT output vector to obtain the most important feature information worth noting through the BiLSTM, Attention layer and Max-Pooling layer; In order to optimize the classification model, the BERT and BiLSTM output vectors are fused and input into Max-Pooling; Finally, the classification results are obtained by fusing two feature vectors with Max-Pooling. The experimental results of two data sets show that the model proposed in this paper can obtain the importance and key rich semantic features of short text classification, and can improve the text classification effect.
摘要:针对缺乏上下文信息的短文本、文本数据量大、特征稀疏、传统文本特征表示不能动态获取词的多义和上下文语义的关键分类信息等问题。本文提出了一种基于BERT的预训练语言模型。集成了BiLSTM、Attention机制和Max-Pooling机制的网络模型b - bat - mpc (BERT-BiLSTM-Attention-Max-Pooling-Concat)。首先,通过BERT模型获得文本上下文语义、语法、上下文等多维、丰富的特征信息;其次,利用BERT输出向量,通过BiLSTM、Attention层和Max-Pooling层获得最重要的值得注意的特征信息;为了优化分类模型,将BERT和BiLSTM输出向量融合并输入到Max-Pooling中;最后,利用Max-Pooling对两个特征向量进行融合,得到分类结果。两个数据集的实验结果表明,本文提出的模型能够获得短文本分类的重要性和关键丰富的语义特征,能够提高文本分类效果。
{"title":"Short Text Classification Model Based on BERT and Fusion Network","authors":"Dongxue Bao, Donghong Qin, Xianye Liang, Lila Hong","doi":"10.1145/3507548.3507574","DOIUrl":"https://doi.org/10.1145/3507548.3507574","url":null,"abstract":"Abstract: Aiming at short texts lacking contextual information, large amount of text data, sparse features, and traditional text feature representations that cannot dynamically obtain the key classification information of a word polysemous and contextual semantics. this paper proposes a pre-trained language model based on BERT. The network model B-BAtt-MPC (BERT-BiLSTM-Attention-Max-Pooling-Concat) that integrates BiLSTM, Attention mechanism and Max-Pooling mechanism. Firstly, obtain multi-dimensional and rich feature information such as text context semantics, grammar, and context through the BERT model; Secondly, use the BERT output vector to obtain the most important feature information worth noting through the BiLSTM, Attention layer and Max-Pooling layer; In order to optimize the classification model, the BERT and BiLSTM output vectors are fused and input into Max-Pooling; Finally, the classification results are obtained by fusing two feature vectors with Max-Pooling. The experimental results of two data sets show that the model proposed in this paper can obtain the importance and key rich semantic features of short text classification, and can improve the text classification effect.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125693208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Distribution Consistency Penalty in the Quadratic Kappa Loss for Ordinal Regression of Imbalanced Datasets 不平衡数据集有序回归的二次Kappa损失中的分布一致性惩罚
Bin-Bin Yang, Shengjie Zhao, Kenan Ye, Rongqing Zhang
Ordinal regression is a typical deep learning problem, which involves inherently ordered labels that are common in practical applications, especially in medical diagnosis tasks. To overcome the neglect of ordered or non-stationary property by merely exploiting classification or regression, quadratic weighted kappa (QWK) is proposed to be employed in the QWK loss function design as an efficient evaluation metric for ordinal regression. However, the paradox that kappa will be higher with an asymmetrical marginal histogram leads the QWK loss function to get the local optimal solution with all-zero-column in the confusion matrices during training. In practice, the all-zero column problem will result in a certain category not being detected at all, which can have serious consequences for the exclusion of pathology. To address this limitation, a new form of penalty term is proposed for the QWK loss function by penalizing the distance of marginal histogram to effectively avoid all-zero-column of the models. The experiments on the category-imbalanced datasets demonstrate that our penalty terms solve all-zero-column problem. On Adience dataset our penalty terms achieve 0.915 QWK, 0.446 MAE and 0.612 accuracy, while on DR dataset our penalty terms achieve 0.744 QWK, 0.281 MAE and 0.810 accuracy. Besides, experiments on the category-balanced datasets HCI show that our penalty terms achieve 0.810 QWK, 0.499 MAE and 0.610 accuracy.
有序回归是一个典型的深度学习问题,它涉及到在实际应用中常见的固有有序标签,特别是在医疗诊断任务中。为了克服单纯利用分类或回归而忽略有序或非平稳性质的问题,提出在QWK损失函数设计中采用二次加权kappa (quadratic weighted kappa, QWK)作为有序回归的有效评价指标。然而,由于边缘直方图不对称时kappa会更高的悖论,导致QWK损失函数在训练时只能得到混淆矩阵中列全为零的局部最优解。在实践中,全零列问题将导致某个类别根本没有被检测到,这可能对排除病理产生严重后果。针对这一局限性,提出了一种新的QWK损失函数惩罚项形式,通过惩罚边缘直方图的距离,有效避免模型的全零列。在类别不平衡数据集上的实验表明,我们的惩罚项解决了全零列问题。在Adience数据集上,我们的惩罚项达到了0.915 QWK、0.446 MAE和0.612的精度,而在DR数据集上,我们的惩罚项达到了0.744 QWK、0.281 MAE和0.810的精度。此外,在类别平衡数据集HCI上的实验表明,我们的惩罚项达到了0.810 QWK, 0.499 MAE和0.610准确率。
{"title":"Distribution Consistency Penalty in the Quadratic Kappa Loss for Ordinal Regression of Imbalanced Datasets","authors":"Bin-Bin Yang, Shengjie Zhao, Kenan Ye, Rongqing Zhang","doi":"10.1145/3507548.3507612","DOIUrl":"https://doi.org/10.1145/3507548.3507612","url":null,"abstract":"Ordinal regression is a typical deep learning problem, which involves inherently ordered labels that are common in practical applications, especially in medical diagnosis tasks. To overcome the neglect of ordered or non-stationary property by merely exploiting classification or regression, quadratic weighted kappa (QWK) is proposed to be employed in the QWK loss function design as an efficient evaluation metric for ordinal regression. However, the paradox that kappa will be higher with an asymmetrical marginal histogram leads the QWK loss function to get the local optimal solution with all-zero-column in the confusion matrices during training. In practice, the all-zero column problem will result in a certain category not being detected at all, which can have serious consequences for the exclusion of pathology. To address this limitation, a new form of penalty term is proposed for the QWK loss function by penalizing the distance of marginal histogram to effectively avoid all-zero-column of the models. The experiments on the category-imbalanced datasets demonstrate that our penalty terms solve all-zero-column problem. On Adience dataset our penalty terms achieve 0.915 QWK, 0.446 MAE and 0.612 accuracy, while on DR dataset our penalty terms achieve 0.744 QWK, 0.281 MAE and 0.810 accuracy. Besides, experiments on the category-balanced datasets HCI show that our penalty terms achieve 0.810 QWK, 0.499 MAE and 0.610 accuracy.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132031111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecast of the Development of COVID-19 Based on the Small-World Network 基于小世界网络的新冠肺炎疫情发展预测
Xingye Bu, Naijie Gu
This world has faced a severe challenge since the breakout of the novel Coronavirus-2019 (COVID-19) has started for more than one year. With the mutation of the virus, the measures of epidemic prevention are keeping upgrading. Various vaccines have been created and brought into operation. To accurately describe and predict the spread of COVID-19, we improve the traditional Susceptible-Exposed-Infected-Removed-Dead model(SEIRD), forecast the development of COVID-19 based on small-world network. A small-world network is a type of mathematical graph in which most nodes are not neighbors of one another, but the neighbors of any given node are likely to be neighbors of each other, and most nodes can be reached from every other node by a small number of hops or steps. We introduce new parameters, Vaccination(V) and Quarantine(Q), into this model. Based on this, through regressing and analyzing the epidemic in the UK, we get the simulation that fits well with the observed data in other countries.
2019年新型冠状病毒(COVID-19)爆发一年多来,世界面临严峻挑战。随着病毒的变异,防疫措施也在不断升级。已经研制出各种疫苗并投入使用。为了准确地描述和预测新冠肺炎的传播,我们改进了传统的易感-暴露-感染-移除-死亡模型(SEIRD),基于小世界网络预测新冠肺炎的发展。小世界网络是一种数学图,其中大多数节点彼此之间不是邻居,但任何给定节点的邻居都可能彼此相邻,并且大多数节点可以通过少量跳数或步骤从每个其他节点到达。我们在这个模型中引入了新的参数,疫苗接种(V)和检疫(Q)。在此基础上,通过对英国疫情的回归分析,得到了与其他国家观测数据吻合较好的模拟结果。
{"title":"Forecast of the Development of COVID-19 Based on the Small-World Network","authors":"Xingye Bu, Naijie Gu","doi":"10.1145/3507548.3507575","DOIUrl":"https://doi.org/10.1145/3507548.3507575","url":null,"abstract":"This world has faced a severe challenge since the breakout of the novel Coronavirus-2019 (COVID-19) has started for more than one year. With the mutation of the virus, the measures of epidemic prevention are keeping upgrading. Various vaccines have been created and brought into operation. To accurately describe and predict the spread of COVID-19, we improve the traditional Susceptible-Exposed-Infected-Removed-Dead model(SEIRD), forecast the development of COVID-19 based on small-world network. A small-world network is a type of mathematical graph in which most nodes are not neighbors of one another, but the neighbors of any given node are likely to be neighbors of each other, and most nodes can be reached from every other node by a small number of hops or steps. We introduce new parameters, Vaccination(V) and Quarantine(Q), into this model. Based on this, through regressing and analyzing the epidemic in the UK, we get the simulation that fits well with the observed data in other countries.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130864257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Use Machine Learning to Predict the Running Time of the Program 使用机器学习来预测程序的运行时间
Xinyi Li, Yiyuan Wang, Ying Qian, Liang Dou
The prediction of program running time can be used to improve scheduling performance of distributed systems. In 2011, Google released a data set documenting the vast amount of information in the Google cluster. However, most of the existing running time prediction models only consider the coarse-grained characteristics of the running environment without considering the influence of the time series data of the running environment on the prediction results. Based on this, this paper innovatively proposes a model to predict the running time of the program, which predicts the future running time through historical information. At the same time, we also propose a new data processing and feature extraction scheme for Google cluster data sets. The results show that our model greatly outperforms the classical model on the Google cluster data set, and the root-mean-square error index of running time under different prediction modes is reduced by more than 60% and 40%, respectively. We hope that the model proposed in this paper can provide new research ideas for cloud computing system design.
程序运行时间的预测可用于提高分布式系统的调度性能。2011年,谷歌发布了一个数据集,记录了谷歌集群中的大量信息。然而,现有的运行时间预测模型大多只考虑了运行环境的粗粒度特征,而没有考虑运行环境时间序列数据对预测结果的影响。在此基础上,本文创新性地提出了一种预测程序运行时间的模型,通过历史信息预测未来的运行时间。同时,我们还提出了一种新的针对Google聚类数据集的数据处理和特征提取方案。结果表明,在Google聚类数据集上,我们的模型大大优于经典模型,不同预测模式下运行时间的均方根误差指数分别降低了60%以上和40%以上。我们希望本文提出的模型能够为云计算系统设计提供新的研究思路。
{"title":"Use Machine Learning to Predict the Running Time of the Program","authors":"Xinyi Li, Yiyuan Wang, Ying Qian, Liang Dou","doi":"10.1145/3507548.3507588","DOIUrl":"https://doi.org/10.1145/3507548.3507588","url":null,"abstract":"The prediction of program running time can be used to improve scheduling performance of distributed systems. In 2011, Google released a data set documenting the vast amount of information in the Google cluster. However, most of the existing running time prediction models only consider the coarse-grained characteristics of the running environment without considering the influence of the time series data of the running environment on the prediction results. Based on this, this paper innovatively proposes a model to predict the running time of the program, which predicts the future running time through historical information. At the same time, we also propose a new data processing and feature extraction scheme for Google cluster data sets. The results show that our model greatly outperforms the classical model on the Google cluster data set, and the root-mean-square error index of running time under different prediction modes is reduced by more than 60% and 40%, respectively. We hope that the model proposed in this paper can provide new research ideas for cloud computing system design.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121607015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-atlas segmentation of knee cartilage via Semi-supervised Regional Label Propagation 基于半监督区域标签传播的膝关节软骨多图谱分割
Christos G. Chadoulos, S. Moustakidis, D. Tsaopoulos, J. Theocharis
Multi-atlas based segmentation techniques have been proven to be effective in multiple automatic segmentation applications. However, mostly they rely on a non-deformable registration model followed by a voxel-wise classification process that incurs a large computational cost in terms of memory requirements and execution time. In this paper, a novel two-stage multi-atlas method is proposed, which combines constructively several concepts, including Semi-Supervised Learning (SSL), sparse graph constructions, voxel’s linear reconstructions via graph weights, and suitable sampling schemes for collecting data from target image and the atlas library. Representative global data sampled from target image are first classified according to SSL, using a newly proposed label propagation scheme. Next, out-of-sample data of yet unlabeled target voxels are iteratively generated through an iterative sampling based on mesh tetrahedralization. A thorough experimental investigation is conducted on 45 subjects provided by the publicly accessible Osteoarthritis Initiative (OAI) repository. Comparative analysis demonstrates that the proposed approach outperforms the existing state-of-the-art patch-based methods, across all evaluation metrics, exhibiting enhanced segmentation performance and reduced computational loads, respectively.
基于多图谱的图像分割技术已被证明在多种自动分割应用中是有效的。然而,它们大多依赖于不可变形的注册模型,然后是体素分类过程,这在内存需求和执行时间方面产生了大量的计算成本。本文提出了一种新的两阶段多地图集方法,该方法结合了半监督学习(SSL)、稀疏图构造、基于图权的体素线性重构以及从目标图像和地图集库中收集数据的合适采样方案等概念。首先,采用一种新提出的标签传播方案,对从目标图像中采样的具有代表性的全局数据进行SSL分类。其次,通过基于网格四面体化的迭代采样,迭代生成尚未标记的目标体素的样本外数据。对公开访问的骨关节炎倡议(OAI)知识库提供的45个受试者进行了彻底的实验调查。对比分析表明,该方法在所有评估指标上都优于现有的基于补丁的方法,分别表现出增强的分割性能和减少的计算负载。
{"title":"Multi-atlas segmentation of knee cartilage via Semi-supervised Regional Label Propagation","authors":"Christos G. Chadoulos, S. Moustakidis, D. Tsaopoulos, J. Theocharis","doi":"10.1145/3507548.3507557","DOIUrl":"https://doi.org/10.1145/3507548.3507557","url":null,"abstract":"Multi-atlas based segmentation techniques have been proven to be effective in multiple automatic segmentation applications. However, mostly they rely on a non-deformable registration model followed by a voxel-wise classification process that incurs a large computational cost in terms of memory requirements and execution time. In this paper, a novel two-stage multi-atlas method is proposed, which combines constructively several concepts, including Semi-Supervised Learning (SSL), sparse graph constructions, voxel’s linear reconstructions via graph weights, and suitable sampling schemes for collecting data from target image and the atlas library. Representative global data sampled from target image are first classified according to SSL, using a newly proposed label propagation scheme. Next, out-of-sample data of yet unlabeled target voxels are iteratively generated through an iterative sampling based on mesh tetrahedralization. A thorough experimental investigation is conducted on 45 subjects provided by the publicly accessible Osteoarthritis Initiative (OAI) repository. Comparative analysis demonstrates that the proposed approach outperforms the existing state-of-the-art patch-based methods, across all evaluation metrics, exhibiting enhanced segmentation performance and reduced computational loads, respectively.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114199461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1