Data Technologies and Applications最新文献_第2页

Practice challenge recommendations in online judge using implicit rating extraction and utility sequence patterns 利用隐性评级提取和效用序列模式在在线评判中推荐实践挑战

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-05-29 DOI: 10.1108/dta-10-2023-0688

Ramesh P Natarajan, Kannimuthu S, Bhanu D

Purpose

The existing traditional recommendations based on content-based filtering (CBF), collaborative filtering (CF) and hybrid approaches are inadequate for recommending practice challenges in programming online judge (POJ). These systems only consider the preferences of the target users or similar users to recommend items. In the learning environment, recommender systems should consider the learning path, knowledge level and ability of the learner. Another major problem in POJ is the learners don't give ratings to practice challenges like e-commerce and video streaming portals. This purpose of the proposed approach is to overcome the abovementioned shortcomings.

Design/methodology/approach

To achieve the context-aware practice challenge recommendation, the data preparation techniques including implicit rating extraction, data preprocessing to remove outliers, sequence-based learner clustering and utility sequence pattern mining approaches are used in the proposed approach. The approach ensures that the recommender system considers the knowledge level, learning path and learning goals of the learner to recommend practice challenges.

Findings

Experiments on practice challenge recommendations conducted using real-world POJ dataset show that the proposed system outperforms other traditional approaches. The experiment also demonstrates that the proposed system is recommending challenges based on the learner's current context. The implicit rating extracted using the proposed approach works accurately in the recommender system.

Originality/value

The proposed system contains the following novel approaches to address the lack of rating and context-aware recommendations. The mathematical model was used to extract ratings from learner submissions. The statistical approach was used in data preprocessing. The sequence similarity-based learner clustering was used in transition matrix. Utilizing the rating as a utility in the USPAN algorithm provides useful insights into learner–challenge relationships.

目的现有的基于内容过滤（CBF）、协同过滤（CF）和混合方法的传统推荐方法不足以应对编程在线评判（POJ）中的实践挑战。这些系统仅考虑目标用户或相似用户的偏好来推荐项目。在学习环境中，推荐系统应考虑学习者的学习路径、知识水平和能力。POJ 的另一个主要问题是，学习者不会对电子商务和视频流门户等实践挑战给出评分。为了实现情境感知的练习挑战推荐，所提出的方法采用了数据准备技术，包括隐含评分提取、去除异常值的数据预处理、基于序列的学习者聚类和实用序列模式挖掘方法。研究结果使用真实世界的 POJ 数据集进行的练习挑战推荐实验表明，所提出的系统优于其他传统方法。实验还表明，所提出的系统是根据学习者当前的情境来推荐挑战的。利用所提出的方法提取的隐含评分在推荐系统中准确地发挥作用。原创性/价值所提出的系统包含以下新方法，以解决缺乏评分和情境感知推荐的问题。数学模型用于从学习者提交的内容中提取评分。统计方法用于数据预处理。在过渡矩阵中使用了基于序列相似性的学习者聚类。在 USPAN 算法中将评级作为一种实用工具，有助于深入了解学习者与挑战之间的关系。

{"title":"Practice challenge recommendations in online judge using implicit rating extraction and utility sequence patterns","authors":"Ramesh P Natarajan, Kannimuthu S, Bhanu D","doi":"10.1108/dta-10-2023-0688","DOIUrl":"https://doi.org/10.1108/dta-10-2023-0688","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>The existing traditional recommendations based on content-based filtering (CBF), collaborative filtering (CF) and hybrid approaches are inadequate for recommending practice challenges in programming online judge (POJ). These systems only consider the preferences of the target users or similar users to recommend items. In the learning environment, recommender systems should consider the learning path, knowledge level and ability of the learner. Another major problem in POJ is the learners don't give ratings to practice challenges like e-commerce and video streaming portals. This purpose of the proposed approach is to overcome the abovementioned shortcomings.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>To achieve the context-aware practice challenge recommendation, the data preparation techniques including implicit rating extraction, data preprocessing to remove outliers, sequence-based learner clustering and utility sequence pattern mining approaches are used in the proposed approach. The approach ensures that the recommender system considers the knowledge level, learning path and learning goals of the learner to recommend practice challenges.</p>\u0000<h3>Findings</h3>\u0000<p>Experiments on practice challenge recommendations conducted using real-world POJ dataset show that the proposed system outperforms other traditional approaches. The experiment also demonstrates that the proposed system is recommending challenges based on the learner's current context. The implicit rating extracted using the proposed approach works accurately in the recommender system.</p>\u0000<h3>Originality/value</h3>\u0000<p>The proposed system contains the following novel approaches to address the lack of rating and context-aware recommendations. The mathematical model was used to extract ratings from learner submissions. The statistical approach was used in data preprocessing. The sequence similarity-based learner clustering was used in transition matrix. Utilizing the rating as a utility in the USPAN algorithm provides useful insights into learner–challenge relationships.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"2 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A novel neural network architecture and cross-model transfer learning for multi-task autonomous driving 用于多任务自动驾驶的新型神经网络架构和交叉模型迁移学习

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-04-12 DOI: 10.1108/dta-08-2022-0307

Youwei Li, Jian Qu

Purpose

The purpose of this research is to achieve multi-task autonomous driving by adjusting the network architecture of the model. Meanwhile, after achieving multi-task autonomous driving, the authors found that the trained neural network model performs poorly in untrained scenarios. Therefore, the authors proposed to improve the transfer efficiency of the model for new scenarios through transfer learning.

Design/methodology/approach

First, the authors achieved multi-task autonomous driving by training a model combining convolutional neural network and different structured long short-term memory (LSTM) layers. Second, the authors achieved fast transfer of neural network models in new scenarios by cross-model transfer learning. Finally, the authors combined data collection and data labeling to improve the efficiency of deep learning. Furthermore, the authors verified that the model has good robustness through light and shadow test.

Findings

This research achieved road tracking, real-time acceleration–deceleration, obstacle avoidance and left/right sign recognition. The model proposed by the authors (UniBiCLSTM) outperforms the existing models tested with model cars in terms of autonomous driving performance. Furthermore, the CMTL-UniBiCL-RL model trained by the authors through cross-model transfer learning improves the efficiency of model adaptation to new scenarios. Meanwhile, this research proposed an automatic data annotation method, which can save 1/4 of the time for deep learning.

Originality/value

This research provided novel solutions in the achievement of multi-task autonomous driving and neural network model scenario for transfer learning. The experiment was achieved on a single camera with an embedded chip and a scale model car, which is expected to simplify the hardware for autonomous driving.

目的本研究的目的是通过调整模型的网络结构来实现多任务自动驾驶。同时，在实现多任务自动驾驶后，作者发现经过训练的神经网络模型在未经训练的场景中表现不佳。因此，作者提出通过迁移学习提高模型在新场景下的迁移效率。首先，作者通过训练一个结合了卷积神经网络和不同结构的长短期记忆（LSTM）层的模型实现了多任务自动驾驶。其次，作者通过交叉模型迁移学习实现了神经网络模型在新场景中的快速迁移。最后，作者将数据收集和数据标注结合起来，提高了深度学习的效率。此外，作者还通过光影测试验证了模型具有良好的鲁棒性。研究结果这项研究实现了道路跟踪、实时加减速、避障和左右标志识别。作者提出的模型（UniBiCLSTM）在自动驾驶性能方面优于使用模型车测试的现有模型。此外，作者通过交叉模型迁移学习训练的 CMTL-UniBiCL-RL 模型提高了模型适应新场景的效率。同时，该研究提出了一种自动数据标注方法，可为深度学习节省1/4的时间。原创性/价值该研究为实现多任务自动驾驶和神经网络模型场景下的迁移学习提供了新颖的解决方案。实验在嵌入式芯片的单摄像头和比例模型车上实现，有望简化自动驾驶的硬件。

{"title":"A novel neural network architecture and cross-model transfer learning for multi-task autonomous driving","authors":"Youwei Li, Jian Qu","doi":"10.1108/dta-08-2022-0307","DOIUrl":"https://doi.org/10.1108/dta-08-2022-0307","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>The purpose of this research is to achieve multi-task autonomous driving by adjusting the network architecture of the model. Meanwhile, after achieving multi-task autonomous driving, the authors found that the trained neural network model performs poorly in untrained scenarios. Therefore, the authors proposed to improve the transfer efficiency of the model for new scenarios through transfer learning.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>First, the authors achieved multi-task autonomous driving by training a model combining convolutional neural network and different structured long short-term memory (LSTM) layers. Second, the authors achieved fast transfer of neural network models in new scenarios by cross-model transfer learning. Finally, the authors combined data collection and data labeling to improve the efficiency of deep learning. Furthermore, the authors verified that the model has good robustness through light and shadow test.</p>\u0000<h3>Findings</h3>\u0000<p>This research achieved road tracking, real-time acceleration–deceleration, obstacle avoidance and left/right sign recognition. The model proposed by the authors (UniBiCLSTM) outperforms the existing models tested with model cars in terms of autonomous driving performance. Furthermore, the CMTL-UniBiCL-RL model trained by the authors through cross-model transfer learning improves the efficiency of model adaptation to new scenarios. Meanwhile, this research proposed an automatic data annotation method, which can save 1/4 of the time for deep learning.</p>\u0000<h3>Originality/value</h3>\u0000<p>This research provided novel solutions in the achievement of multi-task autonomous driving and neural network model scenario for transfer learning. The experiment was achieved on a single camera with an embedded chip and a scale model car, which is expected to simplify the hardware for autonomous driving.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"6 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140579652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Application of deep learning model incorporating domain knowledge in international migration forecasting 结合领域知识的深度学习模型在国际移民预测中的应用

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-04-12 DOI: 10.1108/dta-08-2023-0523

Tongzheng Pu, Chongxing Huang, Haimo Zhang, Jingjing Yang, Ming Huang

Purpose

Forecasting population movement trends is crucial for implementing effective policies to regulate labor force growth and understand demographic changes. Combining migration theory expertise and neural network technology can bring a fresh perspective to international migration forecasting research.

Design/methodology/approach

This study proposes a conditional generative adversarial neural network model incorporating the migration knowledge – conditional generative adversarial network (MK-CGAN). By using the migration knowledge to design the parameters, MK-CGAN can effectively address the limited data problem, thereby enhancing the accuracy of migration forecasts.

Findings

The model was tested by forecasting migration flows between different countries and had good generalizability and validity. The results are robust as the proposed solutions can achieve lesser mean absolute error, mean squared error, root mean square error, mean absolute percentage error and R² values, reaching 0.9855 compared to long short-term memory (LSTM), gated recurrent unit, generative adversarial network (GAN) and the traditional gravity model.

Originality/value

This study is significant because it demonstrates a highly effective technique for predicting international migration using conditional GANs. By incorporating migration knowledge into our models, we can achieve prediction accuracy, gaining valuable insights into the differences between various model characteristics. We used SHapley Additive exPlanations to enhance our understanding of these differences and provide clear and concise explanations for our model predictions. The results demonstrated the theoretical significance and practical value of the MK-CGAN model in predicting international migration.

目的预测人口流动趋势对于实施有效的劳动力增长调控政策和了解人口变化至关重要。本研究提出了一种包含移民知识的条件生成对抗神经网络模型--条件生成对抗网络（MK-CGAN）。通过利用移民知识设计参数，MK-CGAN 可以有效解决数据有限的问题，从而提高移民预测的准确性。研究结果该模型通过预测不同国家之间的移民流量进行了测试，具有良好的普适性和有效性。与长短时记忆（LSTM）、门控递归单元、生成对抗网络（GAN）和传统重力模型相比，所提出的解决方案可以获得较小的均值绝对误差、均值平方误差、均值平方根误差、均值绝对百分比误差和 R2 值，达到 0.9855，因此结果是稳健的。通过将移民知识纳入模型，我们可以实现预测的准确性，并对各种模型特征之间的差异获得有价值的见解。我们利用 SHapley Additive exPlanations 增强了对这些差异的理解，并为我们的模型预测提供了简洁明了的解释。结果证明了 MK-CGAN 模型在预测国际移民方面的理论意义和实用价值。

{"title":"Application of deep learning model incorporating domain knowledge in international migration forecasting","authors":"Tongzheng Pu, Chongxing Huang, Haimo Zhang, Jingjing Yang, Ming Huang","doi":"10.1108/dta-08-2023-0523","DOIUrl":"https://doi.org/10.1108/dta-08-2023-0523","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>Forecasting population movement trends is crucial for implementing effective policies to regulate labor force growth and understand demographic changes. Combining migration theory expertise and neural network technology can bring a fresh perspective to international migration forecasting research.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>This study proposes a conditional generative adversarial neural network model incorporating the migration knowledge – conditional generative adversarial network (MK-CGAN). By using the migration knowledge to design the parameters, MK-CGAN can effectively address the limited data problem, thereby enhancing the accuracy of migration forecasts.</p>\u0000<h3>Findings</h3>\u0000<p>The model was tested by forecasting migration flows between different countries and had good generalizability and validity. The results are robust as the proposed solutions can achieve lesser mean absolute error, mean squared error, root mean square error, mean absolute percentage error and <em>R</em><sup>2</sup> values, reaching 0.9855 compared to long short-term memory (LSTM), gated recurrent unit, generative adversarial network (GAN) and the traditional gravity model.</p>\u0000<h3>Originality/value</h3>\u0000<p>This study is significant because it demonstrates a highly effective technique for predicting international migration using conditional GANs. By incorporating migration knowledge into our models, we can achieve prediction accuracy, gaining valuable insights into the differences between various model characteristics. We used SHapley Additive exPlanations to enhance our understanding of these differences and provide clear and concise explanations for our model predictions. The results demonstrated the theoretical significance and practical value of the MK-CGAN model in predicting international migration.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"10 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140579647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring cross-cultural disparities in tourists' perceived images: a text mining and sentiment analysis study using LDA and BERT-BILSTM models 探索游客感知图像中的跨文化差异：利用 LDA 和 BERT-BILSTM 模型进行的文本挖掘和情感分析研究

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-03-20 DOI: 10.1108/dta-10-2023-0645

Qiuying Chen, Ronghui Liu, Qingquan Jiang, Shangyue Xu

Purpose

Tourists with different cultural backgrounds think and behave differently. Accurately capturing and correctly understanding cultural differences will help tourist destinations in product/service planning, marketing communication and attracting and retaining tourists. This research employs Hofstede's cultural dimensions theory to analyse the variations in destination image perceptions of Chinese-speaking and English-speaking tourists to Xiamen, a prominent tourist attraction in China.

Design/methodology/approach

The evaluation utilizes a two-stage approach, incorporating LDA and BERT-BILSTM models. By leveraging text mining, sentiment analysis and t-tests, this research investigates the variations in tourists' perceptions of Xiamen across different cultures.

Findings

The results reveal that cultural disparities significantly impact tourists' perceived image of Xiamen, particularly regarding their preferences for renowned tourist destinations and the factors influencing their travel experience.

Originality/value

This research pioneers applying natural language processing methods and machine learning techniques to affirm the substantial differences in the perceptions of tourist destinations among Chinese-speaking and English-speaking tourists based on Hofstede's cultural theory. The findings furnish theoretical insights for destination marketing organizations to target diverse cultural tourists through precise marketing strategies and illuminate the practical application of Hofstede's cultural theory in tourism and hospitality.

目的不同文化背景的游客有不同的思维和行为方式。准确把握和正确理解文化差异有助于旅游目的地的产品/服务规划、营销传播以及吸引和留住游客。本研究采用霍夫斯泰德的文化维度理论，分析了中国著名旅游景点厦门的汉语游客和英语游客对目的地形象认知的差异。通过文本挖掘、情感分析和 t 检验，本研究调查了不同文化背景下游客对厦门的认知差异。研究结果表明，文化差异极大地影响了游客对厦门的认知形象，尤其是在游客对知名旅游目的地的偏好以及影响其旅游体验的因素方面。研究结果为旅游目的地营销机构通过精准营销策略锁定不同文化游客提供了理论依据，并阐明了霍夫斯泰德文化理论在旅游业和酒店业中的实际应用。

{"title":"Exploring cross-cultural disparities in tourists' perceived images: a text mining and sentiment analysis study using LDA and BERT-BILSTM models","authors":"Qiuying Chen, Ronghui Liu, Qingquan Jiang, Shangyue Xu","doi":"10.1108/dta-10-2023-0645","DOIUrl":"https://doi.org/10.1108/dta-10-2023-0645","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>Tourists with different cultural backgrounds think and behave differently. Accurately capturing and correctly understanding cultural differences will help tourist destinations in product/service planning, marketing communication and attracting and retaining tourists. This research employs Hofstede's cultural dimensions theory to analyse the variations in destination image perceptions of Chinese-speaking and English-speaking tourists to Xiamen, a prominent tourist attraction in China.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>The evaluation utilizes a two-stage approach, incorporating LDA and BERT-BILSTM models. By leveraging text mining, sentiment analysis and <em>t</em>-tests, this research investigates the variations in tourists' perceptions of Xiamen across different cultures.</p>\u0000<h3>Findings</h3>\u0000<p>The results reveal that cultural disparities significantly impact tourists' perceived image of Xiamen, particularly regarding their preferences for renowned tourist destinations and the factors influencing their travel experience.</p>\u0000<h3>Originality/value</h3>\u0000<p>This research pioneers applying natural language processing methods and machine learning techniques to affirm the substantial differences in the perceptions of tourist destinations among Chinese-speaking and English-speaking tourists based on Hofstede's cultural theory. The findings furnish theoretical insights for destination marketing organizations to target diverse cultural tourists through precise marketing strategies and illuminate the practical application of Hofstede's cultural theory in tourism and hospitality.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"273 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140170304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Light field image coding using a residual channel attention network–based view synthesis 利用基于残差通道注意网络的视图合成技术进行光场图像编码

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-02-21 DOI: 10.1108/dta-03-2023-0071

Faguo Liu, Qian Zhang, Tao Yan, Bin Wang, Ying Gao, Jiaqi Hou, Feiniu Yuan

Purpose

Light field images (LFIs) have gained popularity as a technology to increase the field of view (FoV) of plenoptic cameras since they can capture information about light rays with a large FoV. Wide FoV causes light field (LF) data to increase rapidly, which restricts the use of LF imaging in image processing, visual analysis and user interface. Effective LFI coding methods become of paramount importance. This paper aims to eliminate more redundancy by exploring sparsity and correlation in the angular domain of LFIs, as well as mitigate the loss of perceptual quality of LFIs caused by encoding.

Design/methodology/approach

This work proposes a new efficient LF coding framework. On the coding side, a new sampling scheme and a hierarchical prediction structure are used to eliminate redundancy in the LFI's angular and spatial domains. At the decoding side, high-quality dense LF is reconstructed using a view synthesis method based on the residual channel attention network (RCAN).

Findings

In three different LF datasets, our proposed coding framework not only reduces the transmitted bit rate but also maintains a higher view quality than the current more advanced methods.

Originality/value

(1) A new sampling scheme is designed to synthesize high-quality LFIs while better ensuring LF angular domain sparsity. (2) To further eliminate redundancy in the spatial domain, new ranking schemes and hierarchical prediction structures are designed. (3) A synthetic network based on RCAN and a novel loss function is designed to mitigate the perceptual quality loss due to the coding process.

目的光场图像（LFIs）可以捕捉大视场（FoV）的光线信息，因此作为一种增加全视角照相机视场（FoV）的技术而广受欢迎。宽视场会导致光场（LF）数据迅速增加，从而限制了 LF 成像在图像处理、视觉分析和用户界面中的应用。有效的光场成像编码方法变得至关重要。本文旨在通过探索 LFI 角度域的稀疏性和相关性来消除更多冗余，同时减轻编码对 LFI 感知质量造成的损失。在编码方面，采用了新的采样方案和分层预测结构来消除 LFI 角域和空间域中的冗余。在解码端，使用基于残差信道注意网络（RCAN）的视图合成方法重建高质量的密集 LF。在三个不同的 LF 数据集中，我们提出的编码框架不仅降低了传输比特率，而且与当前更先进的方法相比保持了更高的视图质量。(2）为进一步消除空间域的冗余，设计了新的排序方案和分层预测结构。(3) 设计了基于 RCAN 和新型损失函数的合成网络，以减轻编码过程造成的感知质量损失。

{"title":"Light field image coding using a residual channel attention network–based view synthesis","authors":"Faguo Liu, Qian Zhang, Tao Yan, Bin Wang, Ying Gao, Jiaqi Hou, Feiniu Yuan","doi":"10.1108/dta-03-2023-0071","DOIUrl":"https://doi.org/10.1108/dta-03-2023-0071","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>Light field images (LFIs) have gained popularity as a technology to increase the field of view (FoV) of plenoptic cameras since they can capture information about light rays with a large FoV. Wide FoV causes light field (LF) data to increase rapidly, which restricts the use of LF imaging in image processing, visual analysis and user interface. Effective LFI coding methods become of paramount importance. This paper aims to eliminate more redundancy by exploring sparsity and correlation in the angular domain of LFIs, as well as mitigate the loss of perceptual quality of LFIs caused by encoding.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>This work proposes a new efficient LF coding framework. On the coding side, a new sampling scheme and a hierarchical prediction structure are used to eliminate redundancy in the LFI's angular and spatial domains. At the decoding side, high-quality dense LF is reconstructed using a view synthesis method based on the residual channel attention network (RCAN).</p>\u0000<h3>Findings</h3>\u0000<p>In three different LF datasets, our proposed coding framework not only reduces the transmitted bit rate but also maintains a higher view quality than the current more advanced methods.</p>\u0000<h3>Originality/value</h3>\u0000<p>(1) A new sampling scheme is designed to synthesize high-quality LFIs while better ensuring LF angular domain sparsity. (2) To further eliminate redundancy in the spatial domain, new ranking schemes and hierarchical prediction structures are designed. (3) A synthetic network based on RCAN and a novel loss function is designed to mitigate the perceptual quality loss due to the coding process.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"33 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139920787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

False alarm detection in intensive care unit for monitoring arrhythmia condition using bio-signals 利用生物信号监测重症监护室心律失常状况的误报检测

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-02-13 DOI: 10.1108/dta-08-2023-0437

Aleena Swetapadma, Tishya Manna, Maryam Samami

Purpose

A novel method has been proposed to reduce the false alarm rate of arrhythmia patients regarding life-threatening conditions in the intensive care unit. In this purpose, the atrial blood pressure, photoplethysmogram (PLETH), electrocardiogram (ECG) and respiratory (RESP) signals are considered as input signals.

Design/methodology/approach

Three machine learning approaches feed-forward artificial neural network (ANN), ensemble learning method and k-nearest neighbors searching methods are used to detect the false alarm. The proposed method has been implemented using Arduino and MATLAB/SIMULINK for real-time ICU-arrhythmia patients' monitoring data.

Findings

The proposed method detects the false alarm with an accuracy of 99.4 per cent during asystole, 100 per cent during ventricular flutter, 98.5 per cent during ventricular tachycardia, 99.6 per cent during bradycardia and 100 per cent during tachycardia. The proposed framework is adaptive in many scenarios, easy to implement, computationally friendly and highly accurate and robust with overfitting issue.

Originality/value

As ECG signals consisting with PQRST wave, any deviation from the normal pattern may signify some alarming conditions. These deviations can be utilized as input to classifiers for the detection of false alarms; hence, there is no need for other feature extraction techniques. Feed-forward ANN with the Lavenberg–Marquardt algorithm has shown higher rate of convergence than other neural network algorithms which helps provide better accuracy with no overfitting.

目的为降低重症监护室中心律失常患者在危及生命的情况下的误报率，提出了一种新方法。设计/方法/途径使用了三种机器学习方法：前馈人工神经网络（ANN）、集合学习法和 k 近邻搜索法来检测误报。使用 Arduino 和 MATLAB/SIMULINK 对 ICU 心律失常患者的实时监测数据实施了所提出的方法。研究结果所提出的方法检测误报的准确率为：心搏骤停 99.4%、心室扑动 100%、室性心动过速 98.5%、心动过缓 99.6%、心动过速 100%。由于心电信号由 PQRST 波组成，任何与正常模式的偏差都可能意味着一些警报情况。这些偏差可作为分类器的输入，用于检测误报，因此无需其他特征提取技术。与其他神经网络算法相比，采用 Lavenberg-Marquardt 算法的前馈神经网络显示出更高的收敛速度，这有助于提供更好的准确性，同时不会出现过度拟合。

{"title":"False alarm detection in intensive care unit for monitoring arrhythmia condition using bio-signals","authors":"Aleena Swetapadma, Tishya Manna, Maryam Samami","doi":"10.1108/dta-08-2023-0437","DOIUrl":"https://doi.org/10.1108/dta-08-2023-0437","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>A novel method has been proposed to reduce the false alarm rate of arrhythmia patients regarding life-threatening conditions in the intensive care unit. In this purpose, the atrial blood pressure, photoplethysmogram (PLETH), electrocardiogram (ECG) and respiratory (RESP) signals are considered as input signals.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>Three machine learning approaches feed-forward artificial neural network (ANN), ensemble learning method and <em>k</em>-nearest neighbors searching methods are used to detect the false alarm. The proposed method has been implemented using Arduino and MATLAB/SIMULINK for real-time ICU-arrhythmia patients' monitoring data.</p>\u0000<h3>Findings</h3>\u0000<p>The proposed method detects the false alarm with an accuracy of 99.4 per cent during asystole, 100 per cent during ventricular flutter, 98.5 per cent during ventricular tachycardia, 99.6 per cent during bradycardia and 100 per cent during tachycardia. The proposed framework is adaptive in many scenarios, easy to implement, computationally friendly and highly accurate and robust with overfitting issue.</p>\u0000<h3>Originality/value</h3>\u0000<p>As ECG signals consisting with PQRST wave, any deviation from the normal pattern may signify some alarming conditions. These deviations can be utilized as input to classifiers for the detection of false alarms; hence, there is no need for other feature extraction techniques. Feed-forward ANN with the Lavenberg–Marquardt algorithm has shown higher rate of convergence than other neural network algorithms which helps provide better accuracy with no overfitting.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"88 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139758104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Community relations discovery methods for users in Fancircle based on sentiment analysis in China 基于情感分析的中国 Fancircle 用户社区关系发现方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-01-29 DOI: 10.1108/dta-09-2023-0570

Kai Wang

Purpose

The identification of network user relationship in Fancircle contributes to quantifying the violence index of user text, mining the internal correlation of network behaviors among users, which provides necessary data support for the construction of knowledge graph.

Design/methodology/approach

A correlation identification method based on sentiment analysis (CRDM-SA) is put forward by extracting user semantic information, as well as introducing violent sentiment membership. To be specific, the topic of the implementation of topology mapping in the community can be obtained based on self-built field of violent sentiment dictionary (VSD) by extracting user text information. Afterward, the violence index of the user text is calculated to quantify the fuzzy sentiment representation between the user and the topic. Finally, the multi-granularity violence association rules mining of user text is realized by constructing violence fuzzy concept lattice.

Findings

It is helpful to reveal the internal relationship of online violence under complex network environment. In that case, the sentiment dependence of users can be characterized from a granular perspective.

Originality/value

The membership degree of violent sentiment into user relationship recognition in Fancircle community is introduced, and a text sentiment association recognition method based on VSD is proposed. By calculating the value of violent sentiment in the user text, the annotation of violent sentiment in the topic dimension of the text is achieved, and the partial order relation between fuzzy concepts of violence under the effective confidence threshold is utilized to obtain the association relation.

目的Fancircle中网络用户关系的识别有助于量化用户文本的暴力指数，挖掘用户间网络行为的内在关联性，为知识图谱的构建提供必要的数据支持。设计/方法/途径通过提取用户语义信息，并引入暴力情感成员，提出了一种基于情感分析的关联识别方法（CRDM-SA）。具体来说，通过提取用户文本信息，在自建的暴力情感字典（VSD）字段基础上，可以获得社区中实施拓扑映射的主题。然后，计算用户文本的暴力指数，量化用户与话题之间的模糊情感表征。最后，通过构建暴力模糊概念网格，实现对用户文本的多粒度暴力关联规则挖掘。研究结果这有助于揭示复杂网络环境下网络暴力的内在关系。原创性/价值介绍了Fancircle社区中暴力情感在用户关系识别中的成员度，提出了一种基于VSD的文本情感关联识别方法。通过计算用户文本中的暴力情感值，实现文本主题维度的暴力情感标注，并利用有效置信度阈值下暴力模糊概念间的偏序关系得到关联关系。

{"title":"Community relations discovery methods for users in Fancircle based on sentiment analysis in China","authors":"Kai Wang","doi":"10.1108/dta-09-2023-0570","DOIUrl":"https://doi.org/10.1108/dta-09-2023-0570","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>The identification of network user relationship in Fancircle contributes to quantifying the violence index of user text, mining the internal correlation of network behaviors among users, which provides necessary data support for the construction of knowledge graph.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>A correlation identification method based on sentiment analysis (CRDM-SA) is put forward by extracting user semantic information, as well as introducing violent sentiment membership. To be specific, the topic of the implementation of topology mapping in the community can be obtained based on self-built field of violent sentiment dictionary (VSD) by extracting user text information. Afterward, the violence index of the user text is calculated to quantify the fuzzy sentiment representation between the user and the topic. Finally, the multi-granularity violence association rules mining of user text is realized by constructing violence fuzzy concept lattice.</p>\u0000<h3>Findings</h3>\u0000<p>It is helpful to reveal the internal relationship of online violence under complex network environment. In that case, the sentiment dependence of users can be characterized from a granular perspective.</p>\u0000<h3>Originality/value</h3>\u0000<p>The membership degree of violent sentiment into user relationship recognition in Fancircle community is introduced, and a text sentiment association recognition method based on VSD is proposed. By calculating the value of violent sentiment in the user text, the annotation of violent sentiment in the topic dimension of the text is achieved, and the partial order relation between fuzzy concepts of violence under the effective confidence threshold is utilized to obtain the association relation.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"85 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139578943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Bayesian Inference-based approach for extracting driving data with implicit intention 基于贝叶斯推理的隐含意图驾驶数据提取方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-01-19 DOI: 10.1108/dta-03-2023-0074

Ping Huang, Haitao Ding, Hong Chen, Jianwei Zhang, Zhenjia Sun

Purpose

The growing availability of naturalistic driving datasets (NDDs) presents a valuable opportunity to develop various models for autonomous driving. However, while current NDDs include data on vehicles with and without intended driving behavior changes, they do not explicitly demonstrate a type of data on vehicles that intend to change their driving behavior but do not execute the behaviors because of safety, efficiency, or other factors. This missing data is essential for autonomous driving decisions. This study aims to extract the driving data with implicit intentions to support the development of decision-making models.

Design/methodology/approach

According to Bayesian inference, drivers who have the same intended changes likely share similar influencing factors and states. Building on this principle, this study proposes an approach to extract data on vehicles that intended to execute specific behaviors but failed to do so. This is achieved by computing driving similarities between the candidate vehicles and benchmark vehicles with incorporation of the standard similarity metrics, which takes into account information on the surrounding vehicles' location topology and individual vehicle motion states. By doing so, the method enables a more comprehensive analysis of driving behavior and intention.

Findings

The proposed method is verified on the Next Generation SIMulation dataset (NGSim), which confirms its ability to reveal similarities between vehicles executing similar behaviors during the decision-making process in nature. The approach is also validated using simulated data, achieving an accuracy of 96.3 per cent in recognizing vehicles with specific driving behavior intentions that are not executed.

Originality/value

This study provides an innovative approach to extract driving data with implicit intentions and offers strong support to develop data-driven decision-making models for autonomous driving. With the support of this approach, the development of autonomous vehicles can capture more real driving experience from human drivers moving towards a safer and more efficient future.

目的越来越多的自然驾驶数据集（NDD）为开发各种自动驾驶模型提供了宝贵的机会。然而，尽管当前的自然驾驶数据集包含有驾驶行为变化和无驾驶行为变化车辆的数据，但它们并没有明确展示有驾驶行为变化意图但因安全、效率或其他因素而未执行驾驶行为的车辆的数据类型。这些缺失的数据对于自动驾驶决策至关重要。本研究旨在提取具有隐含意图的驾驶数据，以支持决策模型的开发。根据贝叶斯推理，具有相同意图改变的驾驶员可能具有相似的影响因素和状态。基于这一原则，本研究提出了一种方法，用于提取打算执行特定行为但未能执行的车辆的数据。该方法通过计算候选车辆与基准车辆之间的驾驶相似性，并结合标准的相似性度量，将周围车辆的位置拓扑和单个车辆的运动状态等信息考虑在内。研究结果在下一代 SIMulation 数据集（NGSim）上验证了所提出的方法，证实该方法能够揭示车辆在自然决策过程中执行类似行为的相似性。该方法还通过模拟数据进行了验证，在识别具有未执行的特定驾驶行为意图的车辆方面，准确率达到 96.3%。原创性/价值本研究提供了一种提取具有隐含意图的驾驶数据的创新方法，为开发数据驱动的自动驾驶决策模型提供了有力支持。在这种方法的支持下，自动驾驶汽车的开发可以从人类驾驶员那里获取更多真实的驾驶经验，从而迈向更安全、更高效的未来。

{"title":"A Bayesian Inference-based approach for extracting driving data with implicit intention","authors":"Ping Huang, Haitao Ding, Hong Chen, Jianwei Zhang, Zhenjia Sun","doi":"10.1108/dta-03-2023-0074","DOIUrl":"https://doi.org/10.1108/dta-03-2023-0074","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>The growing availability of naturalistic driving datasets (NDDs) presents a valuable opportunity to develop various models for autonomous driving. However, while current NDDs include data on vehicles with and without intended driving behavior changes, they do not explicitly demonstrate a type of data on vehicles that intend to change their driving behavior but do not execute the behaviors because of safety, efficiency, or other factors. This missing data is essential for autonomous driving decisions. This study aims to extract the driving data with implicit intentions to support the development of decision-making models.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>According to Bayesian inference, drivers who have the same intended changes likely share similar influencing factors and states. Building on this principle, this study proposes an approach to extract data on vehicles that intended to execute specific behaviors but failed to do so. This is achieved by computing driving similarities between the candidate vehicles and benchmark vehicles with incorporation of the standard similarity metrics, which takes into account information on the surrounding vehicles' location topology and individual vehicle motion states. By doing so, the method enables a more comprehensive analysis of driving behavior and intention.</p>\u0000<h3>Findings</h3>\u0000<p>The proposed method is verified on the Next Generation SIMulation dataset (NGSim), which confirms its ability to reveal similarities between vehicles executing similar behaviors during the decision-making process in nature. The approach is also validated using simulated data, achieving an accuracy of 96.3 per cent in recognizing vehicles with specific driving behavior intentions that are not executed.</p>\u0000<h3>Originality/value</h3>\u0000<p>This study provides an innovative approach to extract driving data with implicit intentions and offers strong support to develop data-driven decision-making models for autonomous driving. With the support of this approach, the development of autonomous vehicles can capture more real driving experience from human drivers moving towards a safer and more efficient future.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"4 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139496379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ID-SF-Fusion: a cooperative model of intent detection and slot filling for natural language understanding ID-SF-Fusion：用于自然语言理解的意图检测和槽填充合作模型

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-01-19 DOI: 10.1108/dta-03-2023-0088

Meng Zhu, Xiaolong Xu

Purpose

Intent detection (ID) and slot filling (SF) are two important tasks in natural language understanding. ID is to identify the main intent of a paragraph of text. The goal of SF is to extract the information that is important to the intent from the input sentence. However, most of the existing methods use sentence-level intention recognition, which has the risk of error propagation, and the relationship between intention recognition and SF is not explicitly modeled. Aiming at this problem, this paper proposes a collaborative model of ID and SF for intelligent spoken language understanding called ID-SF-Fusion.

Design/methodology/approach

ID-SF-Fusion uses Bidirectional Encoder Representation from Transformers (BERT) and Bidirectional Long Short-Term Memory (BiLSTM) to extract effective word embedding and context vectors containing the whole sentence information respectively. Fusion layer is used to provide intent–slot fusion information for SF task. In this way, the relationship between ID and SF task is fully explicitly modeled. This layer takes the result of ID and slot context vectors as input to obtain the fusion information which contains both ID result and slot information. Meanwhile, to further reduce error propagation, we use word-level ID for the ID-SF-Fusion model. Finally, two tasks of ID and SF are realized by joint optimization training.

Findings

We conducted experiments on two public datasets, Airline Travel Information Systems (ATIS) and Snips. The results show that the Intent ACC score and Slot F1 score of ID-SF-Fusion on ATIS and Snips are 98.0 per cent and 95.8 per cent, respectively, and the two indicators on Snips dataset are 98.6 per cent and 96.7 per cent, respectively. These models are superior to slot-gated, SF-ID NetWork, stack-Prop and other models. In addition, ablation experiments were performed to further analyze and discuss the proposed model.

Originality/value

This paper uses word-level intent recognition and introduces intent information into the SF process, which is a significant improvement on both data sets.

目的意图检测（ID）和槽填充（SF）是自然语言理解中的两项重要任务。意图检测的目的是识别一段文本的主要意图。槽填充（SF）的目标是从输入句子中提取对意图重要的信息。然而，现有的方法大多使用句子级的意图识别，这存在错误传播的风险，而且意图识别和 SF 之间的关系没有明确的模型。针对这一问题，本文提出了一种用于智能口语理解的 ID 和 SF 协作模型，称为 ID-SF-Fusion。设计/方法/途径ID-SF-Fusion 使用双向变压器编码器表示法（BERT）和双向长短期记忆法（BiLSTM）分别提取有效的词嵌入和包含整句信息的上下文向量。融合层用于为 SF 任务提供意图-槽融合信息。通过这种方式，ID 和 SF 任务之间的关系被完全明确地建模出来。该层将 ID 和时隙上下文向量的结果作为输入，以获得包含 ID 结果和时隙信息的融合信息。同时，为了进一步减少错误传播，我们在 ID-SF-Fusion 模型中使用了词级 ID。最后，通过联合优化训练实现 ID 和 SF 两项任务。结果表明，ID-SF-Fusion 在 ATIS 和 Snips 数据集上的 Intent ACC 得分和 Slot F1 得分分别为 98.0% 和 95.8%，在 Snips 数据集上的这两项指标分别为 98.6% 和 96.7%。这些模型优于 slot-gated、SF-ID NetWork、stack-Prop 和其他模型。此外，还进行了消融实验，对提出的模型进行了进一步的分析和讨论。原创性/价值本文采用词级意图识别，将意图信息引入 SF 流程，在两个数据集上都有显著改进。

{"title":"ID-SF-Fusion: a cooperative model of intent detection and slot filling for natural language understanding","authors":"Meng Zhu, Xiaolong Xu","doi":"10.1108/dta-03-2023-0088","DOIUrl":"https://doi.org/10.1108/dta-03-2023-0088","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>Intent detection (ID) and slot filling (SF) are two important tasks in natural language understanding. ID is to identify the main intent of a paragraph of text. The goal of SF is to extract the information that is important to the intent from the input sentence. However, most of the existing methods use sentence-level intention recognition, which has the risk of error propagation, and the relationship between intention recognition and SF is not explicitly modeled. Aiming at this problem, this paper proposes a collaborative model of ID and SF for intelligent spoken language understanding called ID-SF-Fusion.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>ID-SF-Fusion uses Bidirectional Encoder Representation from Transformers (BERT) and Bidirectional Long Short-Term Memory (BiLSTM) to extract effective word embedding and context vectors containing the whole sentence information respectively. Fusion layer is used to provide intent–slot fusion information for SF task. In this way, the relationship between ID and SF task is fully explicitly modeled. This layer takes the result of ID and slot context vectors as input to obtain the fusion information which contains both ID result and slot information. Meanwhile, to further reduce error propagation, we use word-level ID for the ID-SF-Fusion model. Finally, two tasks of ID and SF are realized by joint optimization training.</p>\u0000<h3>Findings</h3>\u0000<p>We conducted experiments on two public datasets, Airline Travel Information Systems (ATIS) and Snips. The results show that the Intent ACC score and Slot F1 score of ID-SF-Fusion on ATIS and Snips are 98.0 per cent and 95.8 per cent, respectively, and the two indicators on Snips dataset are 98.6 per cent and 96.7 per cent, respectively. These models are superior to slot-gated, SF-ID NetWork, stack-Prop and other models. In addition, ablation experiments were performed to further analyze and discuss the proposed model.</p>\u0000<h3>Originality/value</h3>\u0000<p>This paper uses word-level intent recognition and introduces intent information into the SF process, which is a significant improvement on both data sets.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"18 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139501012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A hybrid method for forecasting coal price based on ensemble learning and deep learning with data decomposition and data enhancement 基于数据分解和数据增强的集合学习和深度学习的煤炭价格预测混合方法

IF 1.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications

Pub Date : 2024-01-18 DOI: 10.1108/dta-07-2023-0377

Jing Tang, Yida Guo, Yilin Han

Purpose

Coal is a critical global energy source, and fluctuations in its price significantly impact related enterprises' profitability. This study aims to develop a robust model for predicting the coal price index to enhance coal purchase strategies for coal-consuming enterprises and provide crucial information for global carbon emission reduction.

Design/methodology/approach

The proposed coal price forecasting system combines data decomposition, semi-supervised feature engineering, ensemble learning and deep learning. It addresses the challenge of merging low-resolution and high-resolution data by adaptively combining both types of data and filling in missing gaps through interpolation for internal missing data and self-supervision for initiate/terminal missing data. The system employs self-supervised learning to complete the filling of complex missing data.

Findings

The ensemble model, which combines long short-term memory, XGBoost and support vector regression, demonstrated the best prediction performance among the tested models. It exhibited superior accuracy and stability across multiple indices in two datasets, namely the Bohai-Rim steam-coal price index and coal daily settlement price.

Originality/value

The proposed coal price forecasting system stands out as it integrates data decomposition, semi-supervised feature engineering, ensemble learning and deep learning. Moreover, the system pioneers the use of self-supervised learning for filling in complex missing data, contributing to its originality and effectiveness.

目的煤炭是全球重要的能源，其价格的波动会严重影响相关企业的盈利能力。本研究旨在开发一种稳健的煤炭价格指数预测模型，以加强煤炭消费企业的煤炭采购策略，并为全球碳减排提供重要信息。设计/方法/途径所提出的煤炭价格预测系统结合了数据分解、半监督特征工程、集合学习和深度学习。它通过自适应地合并低分辨率数据和高分辨率数据，并通过对内部缺失数据的插值和对初始/终端缺失数据的自监督来填补缺失空白，从而解决了合并低分辨率数据和高分辨率数据的难题。该系统采用自我监督学习来完成复杂缺失数据的填补。研究结果该集合模型结合了长短期记忆、XGBoost 和支持向量回归，在测试的模型中表现出最佳的预测性能。在两个数据集（即环渤海汽煤价格指数和煤炭日结算价格）中，该模型在多个指数中表现出了卓越的准确性和稳定性。此外，该系统还开创性地使用了自监督学习来填补复杂的缺失数据，从而提高了其原创性和有效性。

{"title":"A hybrid method for forecasting coal price based on ensemble learning and deep learning with data decomposition and data enhancement","authors":"Jing Tang, Yida Guo, Yilin Han","doi":"10.1108/dta-07-2023-0377","DOIUrl":"https://doi.org/10.1108/dta-07-2023-0377","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>Coal is a critical global energy source, and fluctuations in its price significantly impact related enterprises' profitability. This study aims to develop a robust model for predicting the coal price index to enhance coal purchase strategies for coal-consuming enterprises and provide crucial information for global carbon emission reduction.</p>\u0000<h3>Design/methodology/approach</h3>\u0000<p>The proposed coal price forecasting system combines data decomposition, semi-supervised feature engineering, ensemble learning and deep learning. It addresses the challenge of merging low-resolution and high-resolution data by adaptively combining both types of data and filling in missing gaps through interpolation for internal missing data and self-supervision for initiate/terminal missing data. The system employs self-supervised learning to complete the filling of complex missing data.</p>\u0000<h3>Findings</h3>\u0000<p>The ensemble model, which combines long short-term memory, XGBoost and support vector regression, demonstrated the best prediction performance among the tested models. It exhibited superior accuracy and stability across multiple indices in two datasets, namely the Bohai-Rim steam-coal price index and coal daily settlement price.</p>\u0000<h3>Originality/value</h3>\u0000<p>The proposed coal price forecasting system stands out as it integrates data decomposition, semi-supervised feature engineering, ensemble learning and deep learning. Moreover, the system pioneers the use of self-supervised learning for filling in complex missing data, contributing to its originality and effectiveness.</p>","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":"41 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139501078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0