首页 > 最新文献

World Wide Web-Internet and Web Information Systems最新文献

英文 中文
A generalized multi-skill aggregation method for cognitive diagnosis. 一种广义的多技能聚合认知诊断方法。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-01-01 DOI: 10.1007/s11280-021-00990-4
Suojuan Zhang, Song Huang, Xiaohan Yu, Enhong Chen, Fei Wang, Zhenya Huang

Online education brings more possibilities for personalized learning, in which identifying the cognitive state of learners is conducive to better providing learning services. Cognitive diagnosis is an effective measurement to assess the cognitive state of students through response data of answering the problems(e.g., right or wrong). Generally, the cognitive diagnosis framework includes the mastery of skills required by a specified problem and the aggregation of skills. The current multi-skill aggregation methods are mainly divided into conjunctive and compensatory methods and generally considered that each skill has the same effect on the correct response. However, in practical learning situations, there may be more complex interactions between skills, in which each skill has different weight impacting the final result. To this end, this paper proposes a generalized multi-skill aggregation method based on the Sugeno integral (SI-GAM) and introduces fuzzy measures to characterize the complex interactions between skills. We also provide a new idea for modeling multi-strategy problems. The cognitive diagnosis process is implemented by a more general and interpretable aggregation method. Finally, the feasibility and effectiveness of the model are verified on synthetic and real-world datasets.

在线教育为个性化学习带来了更多的可能性,识别学习者的认知状态有助于更好地提供学习服务。认知诊断是通过学生回答问题的反应数据来评估学生认知状态的一种有效手段。(对或错)。一般来说,认知诊断框架包括对特定问题所需技能的掌握和技能的聚合。目前的多技能聚合方法主要分为连接法和补偿法,一般认为每种技能对正确反应的影响是相同的。然而,在实际的学习情境中,技能之间可能存在更复杂的相互作用,其中每种技能对最终结果的影响程度不同。为此,本文提出了一种基于Sugeno积分(SI-GAM)的广义多技能聚合方法,并引入模糊度量来表征技能之间复杂的相互作用。为多策略问题的建模提供了新的思路。认知诊断过程采用一种更通用、可解释的聚合方法实现。最后,在合成数据集和实际数据集上验证了该模型的可行性和有效性。
{"title":"A generalized multi-skill aggregation method for cognitive diagnosis.","authors":"Suojuan Zhang,&nbsp;Song Huang,&nbsp;Xiaohan Yu,&nbsp;Enhong Chen,&nbsp;Fei Wang,&nbsp;Zhenya Huang","doi":"10.1007/s11280-021-00990-4","DOIUrl":"https://doi.org/10.1007/s11280-021-00990-4","url":null,"abstract":"<p><p>Online education brings more possibilities for personalized learning, in which identifying the cognitive state of learners is conducive to better providing learning services. Cognitive diagnosis is an effective measurement to assess the cognitive state of students through response data of answering the problems(e.g., right or wrong). Generally, the cognitive diagnosis framework includes the mastery of skills required by a specified problem and the aggregation of skills. The current multi-skill aggregation methods are mainly divided into conjunctive and compensatory methods and generally considered that each skill has the same effect on the correct response. However, in practical learning situations, there may be more complex interactions between skills, in which each skill has different weight impacting the final result. To this end, this paper proposes a generalized multi-skill aggregation method based on the Sugeno integral (SI-GAM) and introduces fuzzy measures to characterize the complex interactions between skills. We also provide a new idea for modeling multi-strategy problems. The cognitive diagnosis process is implemented by a more general and interpretable aggregation method. Finally, the feasibility and effectiveness of the model are verified on synthetic and real-world datasets.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 2","pages":"585-614"},"PeriodicalIF":3.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9106983/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9080628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Identifying informative tweets during a pandemic via a topic-aware neural language model. 通过主题感知神经语言模型识别大流行期间的信息性推文。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-01-01 DOI: 10.1007/s11280-022-01034-1
Wang Gao, Lin Li, Xiaohui Tao, Jing Zhou, Jun Tao

Every epidemic affects the real lives of many people around the world and leads to terrible consequences. Recently, many tweets about the COVID-19 pandemic have been shared publicly on social media platforms. The analysis of these tweets is helpful for emergency response organizations to prioritize their tasks and make better decisions. However, most of these tweets are non-informative, which is a challenge for establishing an automated system to detect useful information in social media. Furthermore, existing methods ignore unlabeled data and topic background knowledge, which can provide additional semantic information. In this paper, we propose a novel Topic-Aware BERT (TABERT) model to solve the above challenges. TABERT first leverages a topic model to extract the latent topics of tweets. Secondly, a flexible framework is used to combine topic information with the output of BERT. Finally, we adopt adversarial training to achieve semi-supervised learning, and a large amount of unlabeled data can be used to improve inner representations of the model. Experimental results on the dataset of COVID-19 English tweets show that our model outperforms classic and state-of-the-art baselines.

每一种流行病都影响到世界各地许多人的现实生活,并导致可怕的后果。最近,社交媒体平台上公开分享了许多关于COVID-19大流行的推文。对这些推文的分析有助于应急响应组织确定任务的优先级并做出更好的决策。然而,这些推文大多是非信息性的,这对建立一个自动化系统来检测社交媒体中有用的信息是一个挑战。此外,现有方法忽略了未标记数据和主题背景知识,这可以提供额外的语义信息。在本文中,我们提出了一个新的主题感知BERT (TABERT)模型来解决上述挑战。TABERT首先利用主题模型提取推文的潜在主题。其次,采用灵活的框架将主题信息与BERT的输出结合起来。最后,我们采用对抗性训练来实现半监督学习,并且可以使用大量的未标记数据来改进模型的内部表示。在COVID-19英文推文数据集上的实验结果表明,我们的模型优于经典和最先进的基线。
{"title":"Identifying informative tweets during a pandemic via a topic-aware neural language model.","authors":"Wang Gao,&nbsp;Lin Li,&nbsp;Xiaohui Tao,&nbsp;Jing Zhou,&nbsp;Jun Tao","doi":"10.1007/s11280-022-01034-1","DOIUrl":"https://doi.org/10.1007/s11280-022-01034-1","url":null,"abstract":"<p><p>Every epidemic affects the real lives of many people around the world and leads to terrible consequences. Recently, many tweets about the COVID-19 pandemic have been shared publicly on social media platforms. The analysis of these tweets is helpful for emergency response organizations to prioritize their tasks and make better decisions. However, most of these tweets are non-informative, which is a challenge for establishing an automated system to detect useful information in social media. Furthermore, existing methods ignore unlabeled data and topic background knowledge, which can provide additional semantic information. In this paper, we propose a novel Topic-Aware BERT (TABERT) model to solve the above challenges. TABERT first leverages a topic model to extract the latent topics of tweets. Secondly, a flexible framework is used to combine topic information with the output of BERT. Finally, we adopt adversarial training to achieve semi-supervised learning, and a large amount of unlabeled data can be used to improve inner representations of the model. Experimental results on the dataset of COVID-19 English tweets show that our model outperforms classic and state-of-the-art baselines.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 1","pages":"55-70"},"PeriodicalIF":3.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8924578/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10568308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Auxiliary signal-guided knowledge encoder-decoder for medical report generation. 用于医疗报告生成的辅助信号引导知识编解码器。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-01-01 DOI: 10.1007/s11280-022-01013-6
Mingjie Li, Rui Liu, Fuyu Wang, Xiaojun Chang, Xiaodan Liang

Medical reports have significant clinical value to radiologists and specialists, especially during a pandemic like COVID. However, beyond the common difficulties faced in the natural image captioning, medical report generation specifically requires the model to describe a medical image with a fine-grained and semantic-coherence paragraph that should satisfy both medical commonsense and logic. Previous works generally extract the global image features and attempt to generate a paragraph that is similar to referenced reports; however, this approach has two limitations. Firstly, the regions of primary interest to radiologists are usually located in a small area of the global image, meaning that the remainder parts of the image could be considered as irrelevant noise in the training procedure. Secondly, there are many similar sentences used in each medical report to describe the normal regions of the image, which causes serious data bias. This deviation is likely to teach models to generate these inessential sentences on a regular basis. To address these problems, we propose an Auxiliary Signal-Guided Knowledge Encoder-Decoder (ASGK) to mimic radiologists' working patterns. Specifically, the auxiliary patches are explored to expand the widely used visual patch features before fed to the Transformer encoder, while the external linguistic signals help the decoder better master prior knowledge during the pre-training process. Our approach performs well on common benchmarks, including CX-CHR, IU X-Ray, and COVID-19 CT Report dataset (COV-CTR), demonstrating combining auxiliary signals with transformer architecture can bring a significant improvement in terms of medical report generation. The experimental results confirm that auxiliary signals driven Transformer-based models are with solid capabilities to outperform previous approaches on both medical terminology classification and paragraph generation metrics.

医疗报告对放射科医生和专家具有重要的临床价值,特别是在COVID等大流行期间。然而,除了在自然图像字幕中面临的常见困难之外,医学报告生成特别要求模型用细粒度和语义连贯的段落来描述医学图像,这些段落应满足医学常识和逻辑。以前的作品一般提取全局图像特征,并尝试生成与参考报告相似的段落;然而,这种方法有两个限制。首先,放射科医生主要感兴趣的区域通常位于全局图像的一小块区域,这意味着图像的其余部分在训练过程中可以被视为无关的噪声。其次,每一份医学报告都使用了许多相似的句子来描述图像的正常区域,这导致了严重的数据偏差。这种偏差很可能会教会模型定期生成这些不必要的句子。为了解决这些问题,我们提出了一个辅助信号引导知识编码器-解码器(ASGK)来模拟放射科医生的工作模式。具体来说,我们探索了辅助补丁来扩展广泛使用的视觉补丁特征,然后再馈给变压器编码器,而外部语言信号帮助解码器在预训练过程中更好地掌握先验知识。我们的方法在CX-CHR、IU X-Ray和COVID-19 CT报告数据集(COV-CTR)等常用基准上表现良好,表明将辅助信号与变压器架构相结合可以显著改善医疗报告生成。实验结果证实,辅助信号驱动的基于变压器的模型在医学术语分类和段落生成指标上都具有优于先前方法的坚实能力。
{"title":"Auxiliary signal-guided knowledge encoder-decoder for medical report generation.","authors":"Mingjie Li,&nbsp;Rui Liu,&nbsp;Fuyu Wang,&nbsp;Xiaojun Chang,&nbsp;Xiaodan Liang","doi":"10.1007/s11280-022-01013-6","DOIUrl":"https://doi.org/10.1007/s11280-022-01013-6","url":null,"abstract":"<p><p>Medical reports have significant clinical value to radiologists and specialists, especially during a pandemic like COVID. However, beyond the common difficulties faced in the natural image captioning, medical report generation specifically requires the model to describe a medical image with a fine-grained and semantic-coherence paragraph that should satisfy both medical commonsense and logic. Previous works generally extract the global image features and attempt to generate a paragraph that is similar to referenced reports; however, this approach has two limitations. Firstly, the regions of primary interest to radiologists are usually located in a small area of the global image, meaning that the remainder parts of the image could be considered as irrelevant noise in the training procedure. Secondly, there are many similar sentences used in each medical report to describe the normal regions of the image, which causes serious data bias. This deviation is likely to teach models to generate these inessential sentences on a regular basis. To address these problems, we propose an Auxiliary Signal-Guided Knowledge Encoder-Decoder (ASGK) to mimic radiologists' working patterns. Specifically, the auxiliary patches are explored to expand the widely used visual patch features before fed to the Transformer encoder, while the external linguistic signals help the decoder better master prior knowledge during the pre-training process. Our approach performs well on common benchmarks, including CX-CHR, IU X-Ray, and COVID-19 CT Report dataset (COV-CTR), demonstrating combining auxiliary signals with transformer architecture can bring a significant improvement in terms of medical report generation. The experimental results confirm that auxiliary signals driven Transformer-based models are with solid capabilities to outperform previous approaches on both medical terminology classification and paragraph generation metrics.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 1","pages":"253-270"},"PeriodicalIF":3.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9417931/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9136361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip. 用于髋关节发育不良在线自动诊断的多任务沙漏网络。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-01-01 DOI: 10.1007/s11280-022-01051-0
Jingyuan Xu, Hongtao Xie, Qingfeng Tan, Hai Wu, Chuanbin Liu, Sicheng Zhang, Zhendong Mao, Yongdong Zhang

Developmental dysplasia of the hip (DDH) is one of the most common diseases in children. Due to the experience-requiring medical image analysis work, online automatic diagnosis of DDH has intrigued the researchers. Traditional implementation of online diagnosis faces challenges with reliability and interpretability. In this paper, we establish an online diagnosis tool based on a multi-task hourglass network, which can accurately extract landmarks to detect the extent of hip dislocation and predict the age of the femoral head. Our method utilizes a multi-task hourglass network, which trains an encoder-decoder network to regress the landmarks and predict the developmental age for online DDH diagnosis. With the support of precise image analysis and fast GPU computing, our method can help overcome the shortage of medical resources and enable telehealth for DDH diagnosis. Applying this approach to a dataset of DDH X-ray images, we demonstrate 4.64 mean pixel error of landmark detection compared to the results of human experts. Moreover, we can improve the accuracy of the age prediction of femoral heads to 89%. Our online automatic diagnosis system has provided service to 112 patients, and the results demonstrate the effectiveness of our method.

髋关节发育不良(DDH)是儿童最常见的疾病之一。由于需要经验的医学图像分析工作,DDH的在线自动诊断引起了研究人员的兴趣。传统的在线诊断实现面临可靠性和可解释性的挑战。本文建立了一种基于多任务沙漏网络的在线诊断工具,该工具可以准确地提取标志物,检测髋关节脱位程度,预测股骨头年龄。我们的方法利用多任务沙漏网络,该网络训练编码器-解码器网络来回归标记并预测在线DDH诊断的发育年龄。在精确的图像分析和快速的GPU计算的支持下,我们的方法可以帮助克服医疗资源的短缺,实现DDH诊断的远程医疗。将该方法应用于DDH x射线图像数据集,与人类专家的结果相比,我们证明了4.64个平均像素误差。此外,我们可以将股骨头年龄预测的准确率提高到89%。该在线自动诊断系统已为112例患者提供了服务,结果证明了该方法的有效性。
{"title":"Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip.","authors":"Jingyuan Xu,&nbsp;Hongtao Xie,&nbsp;Qingfeng Tan,&nbsp;Hai Wu,&nbsp;Chuanbin Liu,&nbsp;Sicheng Zhang,&nbsp;Zhendong Mao,&nbsp;Yongdong Zhang","doi":"10.1007/s11280-022-01051-0","DOIUrl":"https://doi.org/10.1007/s11280-022-01051-0","url":null,"abstract":"<p><p>Developmental dysplasia of the hip (DDH) is one of the most common diseases in children. Due to the experience-requiring medical image analysis work, online automatic diagnosis of DDH has intrigued the researchers. Traditional implementation of online diagnosis faces challenges with reliability and interpretability. In this paper, we establish an online diagnosis tool based on a multi-task hourglass network, which can accurately extract landmarks to detect the extent of hip dislocation and predict the age of the femoral head. Our method utilizes a multi-task hourglass network, which trains an encoder-decoder network to regress the landmarks and predict the developmental age for online DDH diagnosis. With the support of precise image analysis and fast GPU computing, our method can help overcome the shortage of medical resources and enable telehealth for DDH diagnosis. Applying this approach to a dataset of DDH X-ray images, we demonstrate 4.64 mean pixel error of landmark detection compared to the results of human experts. Moreover, we can improve the accuracy of the age prediction of femoral heads to 89<i>%</i>. Our online automatic diagnosis system has provided service to 112 patients, and the results demonstrate the effectiveness of our method.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 2","pages":"539-559"},"PeriodicalIF":3.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9064728/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9390760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Improving medical experts' efficiency of misinformation detection: an exploratory study. 提高医学专家误报检测效率的探索性研究。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-01-01 DOI: 10.1007/s11280-022-01084-5
Aleksandra Nabożny, Bartłomiej Balcerzak, Mikołaj Morzy, Adam Wierzbicki, Pavel Savov, Kamil Warpechowski

Fighting medical disinformation in the era of the pandemic is an increasingly important problem. Today, automatic systems for assessing the credibility of medical information do not offer sufficient precision, so human supervision and the involvement of medical expert annotators are required. Our work aims to optimize the utilization of medical experts' time. We also equip them with tools for semi-automatic initial verification of the credibility of the annotated content. We introduce a general framework for filtering medical statements that do not require manual evaluation by medical experts, thus focusing annotation efforts on non-credible medical statements. Our framework is based on the construction of filtering classifiers adapted to narrow thematic categories. This allows medical experts to fact-check and identify over two times more non-credible medical statements in a given time interval without applying any changes to the annotation flow. We verify our results across a broad spectrum of medical topic areas. We perform quantitative, as well as exploratory analysis on our output data. We also point out how those filtering classifiers can be modified to provide experts with different types of feedback without any loss of performance.

在大流行时代,打击医疗虚假信息是一个日益重要的问题。今天,用于评估医疗信息可信度的自动系统不能提供足够的精度,因此需要人类监督和医学专家注释者的参与。我们的工作旨在优化利用医学专家的时间。我们还为他们配备了工具,用于半自动初始验证注释内容的可信度。我们引入了一个通用框架,用于过滤不需要医学专家手动评估的医学陈述,从而将注释工作集中在不可信的医学陈述上。我们的框架是基于适应窄主题类别的过滤分类器的构造。这允许医学专家在给定的时间间隔内检查和识别超过两倍的不可信的医学陈述,而无需对注释流应用任何更改。我们在广泛的医学主题领域验证我们的结果。我们对输出数据进行定量和探索性分析。我们还指出如何修改这些过滤分类器,以便在不损失任何性能的情况下为专家提供不同类型的反馈。
{"title":"Improving medical experts' efficiency of misinformation detection: an exploratory study.","authors":"Aleksandra Nabożny,&nbsp;Bartłomiej Balcerzak,&nbsp;Mikołaj Morzy,&nbsp;Adam Wierzbicki,&nbsp;Pavel Savov,&nbsp;Kamil Warpechowski","doi":"10.1007/s11280-022-01084-5","DOIUrl":"https://doi.org/10.1007/s11280-022-01084-5","url":null,"abstract":"<p><p>Fighting medical disinformation in the era of the pandemic is an increasingly important problem. Today, automatic systems for assessing the credibility of medical information do not offer sufficient precision, so human supervision and the involvement of medical expert annotators are required. Our work aims to optimize the utilization of medical experts' time. We also equip them with tools for semi-automatic initial verification of the credibility of the annotated content. We introduce a general framework for filtering medical statements that do not require manual evaluation by medical experts, thus focusing annotation efforts on non-credible medical statements. Our framework is based on the construction of filtering classifiers adapted to narrow thematic categories. This allows medical experts to fact-check and identify over two times more non-credible medical statements in a given time interval without applying any changes to the annotation flow. We verify our results across a broad spectrum of medical topic areas. We perform quantitative, as well as exploratory analysis on our output data. We also point out how those filtering classifiers can be modified to provide experts with different types of feedback without any loss of performance.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 2","pages":"773-798"},"PeriodicalIF":3.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371952/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9390782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Clustering-enhanced stock price prediction using deep learning. 基于深度学习的聚类增强股价预测。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-01-01 DOI: 10.1007/s11280-021-01003-0
Man Li, Ye Zhu, Yuxin Shen, Maia Angelova

In recent years, artificial intelligence technologies have been successfully applied in time series prediction and analytic tasks. At the same time, a lot of attention has been paid to financial time series prediction, which targets the development of novel deep learning models or optimize the forecasting results. To optimize the accuracy of stock price prediction, in this paper, we propose a clustering-enhanced deep learning framework to predict stock prices with three matured deep learning forecasting models, such as Long Short-Term Memory (LSTM), Recurrent Neural Network (RNN) and Gated Recurrent Unit (GRU). The proposed framework considers the clustering as the forecasting pre-processing, which can improve the quality of the training models. To achieve the effective clustering, we propose a new similarity measure, called Logistic Weighted Dynamic Time Warping (LWDTW), by extending a Weighted Dynamic Time Warping (WDTW) method to capture the relative importance of return observations when calculating distance matrices. Especially, based on the empirical distributions of stock returns, the cost weight function of WDTW is modified with logistic probability density distribution function. In addition, we further implement the clustering-based forecasting framework with the above three deep learning models. Finally, extensive experiments on daily US stock price data sets show that our framework has achieved excellent forecasting performance with overall best results for the combination of Logistic WDTW clustering and LSTM model using 5 different evaluation metrics.

近年来,人工智能技术已成功地应用于时间序列预测和分析任务中。与此同时,金融时间序列预测也受到了很多关注,其目标是开发新的深度学习模型或优化预测结果。为了优化股票价格预测的准确性,本文提出了一个聚类增强的深度学习框架,利用长短期记忆(LSTM)、循环神经网络(RNN)和门控循环单元(GRU)三种成熟的深度学习预测模型来预测股票价格。该框架将聚类作为预测预处理,可以提高训练模型的质量。为了实现有效的聚类,我们提出了一种新的相似性度量,称为Logistic加权动态时间扭曲(LWDTW),通过扩展加权动态时间扭曲(WDTW)方法来捕获返回观测值在计算距离矩阵时的相对重要性。特别地,基于股票收益的经验分布,用logistic概率密度分布函数对WDTW的成本权重函数进行了修正。此外,我们利用上述三个深度学习模型进一步实现了基于聚类的预测框架。最后,在美国每日股票价格数据集上进行的大量实验表明,我们的框架取得了出色的预测性能,使用5种不同的评估指标将Logistic WDTW聚类和LSTM模型相结合,总体效果最好。
{"title":"Clustering-enhanced stock price prediction using deep learning.","authors":"Man Li,&nbsp;Ye Zhu,&nbsp;Yuxin Shen,&nbsp;Maia Angelova","doi":"10.1007/s11280-021-01003-0","DOIUrl":"https://doi.org/10.1007/s11280-021-01003-0","url":null,"abstract":"<p><p>In recent years, artificial intelligence technologies have been successfully applied in time series prediction and analytic tasks. At the same time, a lot of attention has been paid to financial time series prediction, which targets the development of novel deep learning models or optimize the forecasting results. To optimize the accuracy of stock price prediction, in this paper, we propose a clustering-enhanced deep learning framework to predict stock prices with three matured deep learning forecasting models, such as Long Short-Term Memory (LSTM), Recurrent Neural Network (RNN) and Gated Recurrent Unit (GRU). The proposed framework considers the clustering as the forecasting pre-processing, which can improve the quality of the training models. To achieve the effective clustering, we propose a new similarity measure, called Logistic Weighted Dynamic Time Warping (LWDTW), by extending a Weighted Dynamic Time Warping (WDTW) method to capture the relative importance of return observations when calculating distance matrices. Especially, based on the empirical distributions of stock returns, the cost weight function of WDTW is modified with logistic probability density distribution function. In addition, we further implement the clustering-based forecasting framework with the above three deep learning models. Finally, extensive experiments on daily US stock price data sets show that our framework has achieved excellent forecasting performance with overall best results for the combination of Logistic WDTW clustering and LSTM model using 5 different evaluation metrics.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 1","pages":"207-232"},"PeriodicalIF":3.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9009501/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10573909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Preface to the special issue on the Asia Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data (APWeb-WAIM) 2021 2021年亚太网络(APWeb)和网络时代信息管理(WAIM)网络与大数据联合国际会议(APWeb-WAIM)特刊前言
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-31 DOI: 10.1007/s11280-022-01133-z
Leong Hou U, Yasushi Sakurai, M. Spaniol
{"title":"Preface to the special issue on the Asia Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data (APWeb-WAIM) 2021","authors":"Leong Hou U, Yasushi Sakurai, M. Spaniol","doi":"10.1007/s11280-022-01133-z","DOIUrl":"https://doi.org/10.1007/s11280-022-01133-z","url":null,"abstract":"","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"26 1","pages":"635-636"},"PeriodicalIF":3.7,"publicationDate":"2022-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90971985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Durable queries over non-synchronized temporal data 对非同步时态数据的持久查询
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-24 DOI: 10.1007/s11280-022-01122-2
Yalan Xie, W. Weng, Jianmin Li
{"title":"Durable queries over non-synchronized temporal data","authors":"Yalan Xie, W. Weng, Jianmin Li","doi":"10.1007/s11280-022-01122-2","DOIUrl":"https://doi.org/10.1007/s11280-022-01122-2","url":null,"abstract":"","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"43 1","pages":"2099-2113"},"PeriodicalIF":3.7,"publicationDate":"2022-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83038882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MCGM: A multi-channel CTR model with hierarchical gated mechanism for precision marketing MCGM:基于分级门控机制的精准营销多渠道点击率模型
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-24 DOI: 10.1007/s11280-022-01125-z
Zilong Jiang, Lin Li, Dali Wang
{"title":"MCGM: A multi-channel CTR model with hierarchical gated mechanism for precision marketing","authors":"Zilong Jiang, Lin Li, Dali Wang","doi":"10.1007/s11280-022-01125-z","DOIUrl":"https://doi.org/10.1007/s11280-022-01125-z","url":null,"abstract":"","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":"7 1","pages":"2115-2141"},"PeriodicalIF":3.7,"publicationDate":"2022-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87321257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modeling the social influence of COVID-19 via personalized propagation with deep learning. 通过深度学习的个性化传播对 COVID-19 的社会影响力进行建模。
IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-12-17 DOI: 10.1007/s11280-022-01129-9
Yufei Liu, Jie Cao, Jia Wu, Dechang Pi

Social influence prediction has permeated many domains, including marketing, behavior prediction, recommendation systems, and more. However, traditional methods of predicting social influence not only require domain expertise, they also rely on extracting user features, which can be very tedious. Additionally, graph convolutional networks (GCNs), which deals with graph data in non-Euclidean space, are not directly applicable to Euclidean space. To overcome these problems, we extended DeepInf such that it can predict the social influence of COVID-19 via the transition probability of the page rank domain. Furthermore, our implementation gives rise to a deep learning-based personalized propagation algorithm, called DeepPP. The resulting algorithm combines the personalized propagation of a neural prediction model with the approximate personalized propagation of a neural prediction model from page rank analysis. Four social networks from different domains as well as two COVID-19 datasets were used to analyze the proposed algorithm's efficiency and effectiveness. Compared to other baseline methods, DeepPP provides more accurate social influence predictions. Further, experiments demonstrate that DeepPP can be applied to real-world prediction data for COVID-19.

社会影响力预测已经渗透到许多领域,包括市场营销、行为预测、推荐系统等。然而,预测社会影响力的传统方法不仅需要领域专业知识,还依赖于提取用户特征,这可能非常繁琐。此外,处理非欧几里得空间图数据的图卷积网络(GCN)并不能直接适用于欧几里得空间。为了克服这些问题,我们对 DeepInf 进行了扩展,使其能够通过页面排名域的过渡概率预测 COVID-19 的社会影响力。此外,我们的实现还产生了一种基于深度学习的个性化传播算法,称为 DeepPP。该算法将神经预测模型的个性化传播与来自页面排名分析的神经预测模型的近似个性化传播相结合。我们使用了四个不同领域的社交网络以及两个 COVID-19 数据集来分析所提出算法的效率和效果。与其他基准方法相比,DeepPP 提供了更准确的社会影响力预测。此外,实验证明 DeepPP 可以应用于 COVID-19 的真实世界预测数据。
{"title":"Modeling the social influence of COVID-19 via personalized propagation with deep learning.","authors":"Yufei Liu, Jie Cao, Jia Wu, Dechang Pi","doi":"10.1007/s11280-022-01129-9","DOIUrl":"10.1007/s11280-022-01129-9","url":null,"abstract":"<p><p>Social influence prediction has permeated many domains, including marketing, behavior prediction, recommendation systems, and more. However, traditional methods of predicting social influence not only require domain expertise, they also rely on extracting user features, which can be very tedious. Additionally, graph convolutional networks (GCNs), which deals with graph data in non-Euclidean space, are not directly applicable to Euclidean space. To overcome these problems, we extended DeepInf such that it can predict the social influence of COVID-19 via the transition probability of the page rank domain. Furthermore, our implementation gives rise to a deep learning-based personalized propagation algorithm, called DeepPP. The resulting algorithm combines the personalized propagation of a neural prediction model with the approximate personalized propagation of a neural prediction model from page rank analysis. Four social networks from different domains as well as two COVID-19 datasets were used to analyze the proposed algorithm's efficiency and effectiveness. Compared to other baseline methods, DeepPP provides more accurate social influence predictions. Further, experiments demonstrate that DeepPP can be applied to real-world prediction data for COVID-19.</p>","PeriodicalId":49356,"journal":{"name":"World Wide Web-Internet and Web Information Systems","volume":" ","pages":"1-23"},"PeriodicalIF":3.7,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9758027/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10443669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
World Wide Web-Internet and Web Information Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1