首页 > 最新文献

Data and information management最新文献

英文 中文
Trustworthy AI: AI developers’ lens to implementation challenges and opportunities 值得信赖的人工智能:人工智能开发人员实现挑战和机遇的视角
Pub Date : 2025-06-01 DOI: 10.1016/j.dim.2024.100082
Carter Cousineau , Rozita Dara , Ataharul Chowdhury
As organizations continue to embrace the use of artificial intelligence (AI) systems, it is crucial to ensure that these AI systems can be trusted. However, there is still a significant gap between research on trustworthy AI and its implementation in real-world applications. To address this issue, we sought to explore the perspectives of AI developers and the challenges they face in creating trustworthy AI systems. This exploratory study involved conducting interviews with 19 AI developers. We identified key challenges faced by AI developers due to the immature state of trustworthy AI, inconsistent global regulatory landscape, a lack of standardized definitions of key concepts, limited tools and standards for practical implementation in organizations. This paper provides recommendations for organizations to invest in trustworthy AI processes and practices, this includes building a foundation for trustworthy AI specific to their organization, adopting an organizational approach to trustworthy AI culture, and providing proper data infrastructures to support AI developers in creating trustworthy AI systems. By investing in trustworthy AI practices, organizations can prepare for evolving regulations and ensure that their AI systems are reliable and trustworthy.
随着组织继续接受人工智能(AI)系统的使用,确保这些人工智能系统是值得信任的至关重要。然而,可信人工智能的研究与其在现实应用中的实现之间仍然存在着巨大的差距。为了解决这个问题,我们试图探索人工智能开发人员的观点,以及他们在创建值得信赖的人工智能系统时面临的挑战。这项探索性研究包括对19名人工智能开发人员进行采访。我们确定了人工智能开发人员面临的主要挑战,这是由于可信赖的人工智能的不成熟状态、不一致的全球监管环境、缺乏关键概念的标准化定义、组织中实际实施的工具和标准有限。本文为组织提供了投资于可信赖的人工智能流程和实践的建议,包括为其组织特定的可信赖的人工智能建立基础,采用可信赖的人工智能文化的组织方法,并提供适当的数据基础设施来支持人工智能开发人员创建可信赖的人工智能系统。通过投资可信赖的人工智能实践,组织可以为不断变化的法规做好准备,并确保其人工智能系统可靠且值得信赖。
{"title":"Trustworthy AI: AI developers’ lens to implementation challenges and opportunities","authors":"Carter Cousineau ,&nbsp;Rozita Dara ,&nbsp;Ataharul Chowdhury","doi":"10.1016/j.dim.2024.100082","DOIUrl":"10.1016/j.dim.2024.100082","url":null,"abstract":"<div><div>As organizations continue to embrace the use of artificial intelligence (AI) systems, it is crucial to ensure that these AI systems can be trusted. However, there is still a significant gap between research on trustworthy AI and its implementation in real-world applications. To address this issue, we sought to explore the perspectives of AI developers and the challenges they face in creating trustworthy AI systems. This exploratory study involved conducting interviews with 19 AI developers. We identified key challenges faced by AI developers due to the immature state of trustworthy AI, inconsistent global regulatory landscape, a lack of standardized definitions of key concepts, limited tools and standards for practical implementation in organizations. This paper provides recommendations for organizations to invest in trustworthy AI processes and practices, this includes building a foundation for trustworthy AI specific to their organization, adopting an organizational approach to trustworthy AI culture, and providing proper data infrastructures to support AI developers in creating trustworthy AI systems. By investing in trustworthy AI practices, organizations can prepare for evolving regulations and ensure that their AI systems are reliable and trustworthy.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 2","pages":"Article 100082"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144204863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting changes in task difficulty perception based on visual behavior in mobile health information search 根据移动健康信息搜索中的视觉行为预测任务难度感知的变化
Pub Date : 2025-06-01 DOI: 10.1016/j.dim.2024.100074
Jing Chen , Hongli Chen , Shubin Zhou , Quan Lu
With the proliferation of online health resources, mobile health information search has become the new norm, in which the task difficulty perception in search affects the user's search experience. This study aimed to investigate the relationship between visual behavior that reflects users' cognitive processing and changes in perceived task difficulty, thereby predicting such changes.
This study was conducted through a controlled experiment. 46 participants were recruited to complete four tasks. Visual behavior data were collected through eye-tracking technology, and changes in task difficulty perception were measured through pre-task and post-task questionnaires. The mobile health information search process is divided into three search activities: querying, browsing, and viewing activities. Predictors were inspected from the overall session and individual search activity levels using the Mann-Whitney U test, and then K-Nearest Neighbor, Extreme-Trees, Naive Bayesian, Support Vector Machine, Logistic Regression algorithms were used to predict and evaluate prediction effects.
The results showed significant differences in participants' fixation and saccade behaviors between increases and decreases in task difficulty, both at the overall session and individual search activity level. The logistic regression algorithm demonstrated the highest predictive performance, Furthermore, visual behavioral indicators for the browsing activity proved to perform better relative to the other search activities.
This study highlights the importance of visual behavioral indicators as reliable predictors of changes in users' perceived task difficulty in mobile health information search. It can help health information providers and administrators to provide timely and targeted assistance and implement effective guidance strategies.
随着在线健康资源的激增,移动健康信息搜索成为新常态,搜索任务难度感知影响着用户的搜索体验。本研究旨在探讨反映用户认知加工的视觉行为与感知任务难度变化之间的关系,从而预测感知任务难度的变化。这项研究是通过对照实验进行的。46名参与者被招募来完成四项任务。通过眼动追踪技术收集视觉行为数据,通过任务前问卷和任务后问卷测量任务难度感知的变化。移动健康信息搜索过程分为三个搜索活动:查询、浏览和查看活动。使用Mann-Whitney U检验从整体会话和个人搜索活动水平检查预测因子,然后使用k -最近邻,极端树,朴素贝叶斯,支持向量机,逻辑回归算法来预测和评估预测效果。结果表明,在任务难度增加和降低的情况下,被试的注视和扫视行为在整体会话和个体搜索活动水平上都存在显著差异。逻辑回归算法表现出最高的预测性能,此外,相对于其他搜索活动,浏览活动的视觉行为指标表现得更好。本研究强调了视觉行为指标作为移动健康信息搜索中用户感知任务难度变化的可靠预测因素的重要性。它可以帮助卫生信息提供者和管理者提供及时和有针对性的援助,并实施有效的指导策略。
{"title":"Predicting changes in task difficulty perception based on visual behavior in mobile health information search","authors":"Jing Chen ,&nbsp;Hongli Chen ,&nbsp;Shubin Zhou ,&nbsp;Quan Lu","doi":"10.1016/j.dim.2024.100074","DOIUrl":"10.1016/j.dim.2024.100074","url":null,"abstract":"<div><div>With the proliferation of online health resources, mobile health information search has become the new norm, in which the task difficulty perception in search affects the user's search experience. This study aimed to investigate the relationship between visual behavior that reflects users' cognitive processing and changes in perceived task difficulty, thereby predicting such changes.</div><div>This study was conducted through a controlled experiment. 46 participants were recruited to complete four tasks. Visual behavior data were collected through eye-tracking technology, and changes in task difficulty perception were measured through pre-task and post-task questionnaires. The mobile health information search process is divided into three search activities: querying, browsing, and viewing activities. Predictors were inspected from the overall session and individual search activity levels using the Mann-Whitney <em>U</em> test, and then K-Nearest Neighbor, Extreme-Trees, Naive Bayesian, Support Vector Machine, Logistic Regression algorithms were used to predict and evaluate prediction effects.</div><div>The results showed significant differences in participants' fixation and saccade behaviors between increases and decreases in task difficulty, both at the overall session and individual search activity level. The logistic regression algorithm demonstrated the highest predictive performance, Furthermore, visual behavioral indicators for the browsing activity proved to perform better relative to the other search activities.</div><div>This study highlights the importance of visual behavioral indicators as reliable predictors of changes in users' perceived task difficulty in mobile health information search. It can help health information providers and administrators to provide timely and targeted assistance and implement effective guidance strategies.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 2","pages":"Article 100074"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141391065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing team creativity among information technology professionals through knowledge sharing and motivational rewards: A self-determination perspective 通过知识共享和激励奖励提高信息技术专业人员的团队创造力:自我决定的视角
Pub Date : 2025-06-01 DOI: 10.1016/j.dim.2024.100075
Xiling Cui , Xuan Yang , Jifan Ren , Paul Benjamin Lowry , Timon Chih-ting Du
This study aims to investigate how to leverage knowledge sharing (KS) to boost team creativity among information technology (IT) professionals. We examine the effects of intrinsic and intangible extrinsic rewards on in-role and extra-role KS, which increases team creativity. We use data collected from 322 employees in 80 teams from organizations in the IT industry to test the research model and confirm the important roles of KS and motivational rewards. The two types of KS show different patterns in terms of their antecedents and outcomes. Specifically, in-role KS does not affect team creativity directly, while extra-role KS does. Intrinsic rewards significantly affect both in-role and extra-role KS, and the effect on the latter is greater. Image rewards have a greater effect on in-role KS than on extra-role KS. In addition, the two forms of intangible extrinsic rewards exhibit internalization. The study pioneers in addressing a pressing research gap by investigating and comparing the effects of the two types of KS—in-role and extra-role KS—on team creativity.
本研究旨在探讨资讯科技(IT)专业人员如何利用知识分享(KS)来提升团队创造力。我们考察了内在和无形的外在奖励对角色内和角色外KS的影响,从而提高团队创造力。我们使用来自IT行业组织的80个团队的322名员工的数据来检验研究模型,并证实了KS和激励性奖励的重要作用。两种类型的KS在其前因和结果方面表现出不同的模式。具体而言,角色内KS对团队创造力没有直接影响,而角色外KS对团队创造力有直接影响。内在奖励对角色内KS和角色外KS均有显著影响,且对角色外KS的影响更大。形象奖励对角色内KS的影响大于对角色外KS的影响。此外,这两种形式的无形外在奖励都表现出内化。本研究通过调查和比较角色内ks和角色外ks对团队创造力的影响,率先解决了一个紧迫的研究空白。
{"title":"Enhancing team creativity among information technology professionals through knowledge sharing and motivational rewards: A self-determination perspective","authors":"Xiling Cui ,&nbsp;Xuan Yang ,&nbsp;Jifan Ren ,&nbsp;Paul Benjamin Lowry ,&nbsp;Timon Chih-ting Du","doi":"10.1016/j.dim.2024.100075","DOIUrl":"10.1016/j.dim.2024.100075","url":null,"abstract":"<div><div>This study aims to investigate how to leverage knowledge sharing (KS) to boost team creativity among information technology (IT) professionals. We examine the effects of intrinsic and intangible extrinsic rewards on in-role and extra-role KS, which increases team creativity. We use data collected from 322 employees in 80 teams from organizations in the IT industry to test the research model and confirm the important roles of KS and motivational rewards. The two types of KS show different patterns in terms of their antecedents and outcomes. Specifically, in-role KS does not affect team creativity directly, while extra-role KS does. Intrinsic rewards significantly affect both in-role and extra-role KS, and the effect on the latter is greater. Image rewards have a greater effect on in-role KS than on extra-role KS. In addition, the two forms of intangible extrinsic rewards exhibit internalization. The study pioneers in addressing a pressing research gap by investigating and comparing the effects of the two types of KS—in-role and extra-role KS—on team creativity.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 2","pages":"Article 100075"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141408578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparative analysis of China-themed books in three ASEAN countries: Implications for resource development and intercultural communication 东盟三国中国主题图书的比较分析:对资源开发和跨文化交流的启示
Pub Date : 2025-06-01 DOI: 10.1016/j.dim.2024.100081
Fan Jin, Pengyi Zhang
Books are an important carrier of culture. Library catalogs across different countries can reflect intercultural communication. This paper aims to explore the situation and differences of China-themed books in three ASEAN countries: Malaysia, the Philippines and Thailand. These countries are representative of the intercultural relations between China and ASEAN members. Descriptive analysis is used to analyze the series of books, focusing on the time and subjects of the book resources. The paper also compares the distribution of books about the three countries in the National Library of China. The results show that Malaysia and Thailand have more comprehensive and in-depth cultural collection of China-themed book resources, while the Philippines includes more books related to China's development and social realities. This study is helpful to resource and collection development related to particular countries and regions and the intercultural communication between countries.
书籍是文化的重要载体。不同国家的图书馆目录可以反映跨文化交流。本文旨在探讨马来西亚、菲律宾和泰国这三个东盟国家中国主题图书的现状和差异。这些国家是中国与东盟国家跨文化关系的代表。描述性分析是对丛书进行分析的方法,侧重于图书资源的时间和主题。本文还比较了中国国家图书馆中三国图书的分布情况。结果表明,马来西亚和泰国对中国主题图书资源的文化收藏更为全面和深入,而菲律宾则包含了更多与中国发展和社会现实相关的图书。这一研究有助于特定国家和地区的资源和馆藏开发以及国家间的跨文化交流。
{"title":"A comparative analysis of China-themed books in three ASEAN countries: Implications for resource development and intercultural communication","authors":"Fan Jin,&nbsp;Pengyi Zhang","doi":"10.1016/j.dim.2024.100081","DOIUrl":"10.1016/j.dim.2024.100081","url":null,"abstract":"<div><div>Books are an important carrier of culture. Library catalogs across different countries can reflect intercultural communication. This paper aims to explore the situation and differences of China-themed books in three ASEAN countries: Malaysia, the Philippines and Thailand. These countries are representative of the intercultural relations between China and ASEAN members. Descriptive analysis is used to analyze the series of books, focusing on the time and subjects of the book resources. The paper also compares the distribution of books about the three countries in the National Library of China. The results show that Malaysia and Thailand have more comprehensive and in-depth cultural collection of China-themed book resources, while the Philippines includes more books related to China's development and social realities. This study is helpful to resource and collection development related to particular countries and regions and the intercultural communication between countries.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 2","pages":"Article 100081"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144204866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smart cities services and solutions: A systematic review 智慧城市服务和解决方案:系统回顾
Pub Date : 2025-06-01 DOI: 10.1016/j.dim.2024.100087
Walid Miloud Dahmane , Samir Ouchani , Hafida Bouarfa
Throughout history, cities have represented enduring symbols of human civilization and progress. Today, we are witnessing a technological revolution fueled by the rapid advancement of Information and Communication Technologies (ICT). This transformation has dramatically improved data analysis capabilities through the integration of the Internet of Things (IoT), Artificial Intelligence (AI), cloud computing, and other cutting-edge innovations. As key contributors to urban development, researchers must adopt effective methodologies to thoroughly explore the concept of smart cities. Furthermore, it is essential to raise awareness among stakeholders about the inevitable adoption of IoT technologies and their associated benefits. This paper aims to review various methodologies used to collect critical data, prioritize key urban challenges, and assess the performance of urban services. By comparing the findings of the surveyed studies, insights are drawn, and potential directions for future research are outlined.
纵观历史,城市一直是人类文明和进步的持久象征。今天,我们正在目睹一场由信息通信技术迅速发展推动的技术革命。这种转变通过物联网(IoT)、人工智能(AI)、云计算和其他前沿创新的融合,极大地提高了数据分析能力。作为城市发展的关键贡献者,研究人员必须采用有效的方法来深入探索智慧城市的概念。此外,必须提高利益相关者对物联网技术不可避免的采用及其相关好处的认识。本文旨在回顾用于收集关键数据、优先考虑关键城市挑战和评估城市服务绩效的各种方法。通过对调查研究结果的比较,得出了见解,并概述了未来研究的潜在方向。
{"title":"Smart cities services and solutions: A systematic review","authors":"Walid Miloud Dahmane ,&nbsp;Samir Ouchani ,&nbsp;Hafida Bouarfa","doi":"10.1016/j.dim.2024.100087","DOIUrl":"10.1016/j.dim.2024.100087","url":null,"abstract":"<div><div>Throughout history, cities have represented enduring symbols of human civilization and progress. Today, we are witnessing a technological revolution fueled by the rapid advancement of Information and Communication Technologies (ICT). This transformation has dramatically improved data analysis capabilities through the integration of the Internet of Things (IoT), Artificial Intelligence (AI), cloud computing, and other cutting-edge innovations. As key contributors to urban development, researchers must adopt effective methodologies to thoroughly explore the concept of smart cities. Furthermore, it is essential to raise awareness among stakeholders about the inevitable adoption of IoT technologies and their associated benefits. This paper aims to review various methodologies used to collect critical data, prioritize key urban challenges, and assess the performance of urban services. By comparing the findings of the surveyed studies, insights are drawn, and potential directions for future research are outlined.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 2","pages":"Article 100087"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144204865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying exaggeration in ESG reports using machine learning techniques 使用机器学习技术识别ESG报告中的夸大
Pub Date : 2025-06-01 DOI: 10.1016/j.dim.2024.100084
Yunfang Luo , Xiling Cui , Qiang Liu , Qiang Zhou , Yingxuan Zhang
Exaggeration is a specific way in which companies potentially overstate certain aspects of their actual environmental performance, strategically disclosing positive information about their environmental performance. This research aims to identify instances of exaggerated information within environmental, social, and governance (ESG) reports by employing machine learning techniques. We crawled 594 ESG reports and employed a variety of machine learning algorithms to identify instances of exaggeration. Through the cross-validation, we found that random forest exhibits the best performance in predicting exaggeration and ridge regression demonstrates superior performance in predicting the exaggeration scores. A significant contribution of our study is the development of an exaggerated thesaurus tailored specifically to this domain. Ultimately, our study lays a foundation for further investigations into addressing the impact of exaggerated information in ESG reporting.
夸大是一种特定的方式,公司可能夸大其实际环境绩效的某些方面,战略性地披露有关其环境绩效的积极信息。本研究旨在通过使用机器学习技术来识别环境、社会和治理(ESG)报告中夸大信息的实例。我们抓取了594份ESG报告,并使用了各种机器学习算法来识别夸大的实例。通过交叉验证,我们发现随机森林在预测夸张分数方面表现出最好的效果,山脊回归在预测夸张分数方面表现出更好的效果。我们研究的一个重要贡献是专门为这个领域量身定做的夸张的同义词典的发展。最终,我们的研究为进一步研究解决ESG报告中夸大信息的影响奠定了基础。
{"title":"Identifying exaggeration in ESG reports using machine learning techniques","authors":"Yunfang Luo ,&nbsp;Xiling Cui ,&nbsp;Qiang Liu ,&nbsp;Qiang Zhou ,&nbsp;Yingxuan Zhang","doi":"10.1016/j.dim.2024.100084","DOIUrl":"10.1016/j.dim.2024.100084","url":null,"abstract":"<div><div>Exaggeration is a specific way in which companies potentially overstate certain aspects of their actual environmental performance, strategically disclosing positive information about their environmental performance. This research aims to identify instances of exaggerated information within environmental, social, and governance (ESG) reports by employing machine learning techniques. We crawled 594 ESG reports and employed a variety of machine learning algorithms to identify instances of exaggeration. Through the cross-validation, we found that random forest exhibits the best performance in predicting exaggeration and ridge regression demonstrates superior performance in predicting the exaggeration scores. A significant contribution of our study is the development of an exaggerated thesaurus tailored specifically to this domain. Ultimately, our study lays a foundation for further investigations into addressing the impact of exaggerated information in ESG reporting.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 2","pages":"Article 100084"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144204864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aspect-based sentiment evolution and its correlation with review rounds in multi-round peer reviews: A deep learning approach 多轮同行评审中基于方面的情感演变及其与评审轮次的相关性:一种深度学习方法
Pub Date : 2025-05-28 DOI: 10.1016/j.dim.2025.100105
Ruxue Han , Haomin Zhou , Jiangtao Zhong , Chengzhi Zhang
Mining sentiment information from the textual content of peer review comments offers valuable insights into the scientific evaluation process. However, previous studies are often constrained by coarse-grained analysis and the lack of differentiation across review rounds. Notably, the dynamic shifts in reviewers' focus and sentiment tendencies throughout multiple review stages remain underexplored. To address this gap, the present study investigates the distribution and evolution of aspect-level sentiments and examines their correlation with the number of review rounds. We begin by segmenting the multi-round review comments of 11,063 accepted papers from Nature Communications and identifying fine-grained review aspect clusters. A manually annotated corpus of approximately 5000 review sentences is then constructed. Using this dataset, we train a series of deep learning-based aspect sentiment classification models. Among them, the LCF-BERT-CDM model achieves the best performance, with a Macro-F1 score of 82.65 %. Subsequent statistical analysis reveals a consistent trend: as the number of review rounds increases, the proportion of positive sentiments rises, while negative sentiments decline. Correlation analysis further indicates that aspect sentiment scores are negatively associated with the total number of review rounds. Key aspects exhibiting stronger correlations include “experiments”, “research significance” and “result analysis”.
从同行评审评论的文本内容中挖掘情感信息,为科学评估过程提供了有价值的见解。然而,以前的研究经常受到粗粒度分析的限制,并且在审查轮之间缺乏区分。值得注意的是,审稿人在多个审稿阶段的关注点和情感倾向的动态变化仍然没有得到充分的研究。为了解决这一差距,本研究调查了方面级情绪的分布和演变,并检查了它们与审查轮数的相关性。我们首先对《自然通讯》11063篇被接受论文的多轮评审意见进行分割,并确定细粒度的评审方面集群。然后构建一个人工注释的大约5000个复习句子的语料库。利用该数据集,我们训练了一系列基于深度学习的面向情感分类模型。其中LCF-BERT-CDM模型表现最好,其Macro-F1得分为82.65%。随后的统计分析显示了一个一致的趋势:随着审查轮次的增加,积极情绪的比例上升,而消极情绪的比例下降。相关分析进一步表明,方面情绪得分与总评审轮数呈负相关。表现出较强相关性的关键方面包括“实验”、“研究意义”和“结果分析”。
{"title":"Aspect-based sentiment evolution and its correlation with review rounds in multi-round peer reviews: A deep learning approach","authors":"Ruxue Han ,&nbsp;Haomin Zhou ,&nbsp;Jiangtao Zhong ,&nbsp;Chengzhi Zhang","doi":"10.1016/j.dim.2025.100105","DOIUrl":"10.1016/j.dim.2025.100105","url":null,"abstract":"<div><div>Mining sentiment information from the textual content of peer review comments offers valuable insights into the scientific evaluation process. However, previous studies are often constrained by coarse-grained analysis and the lack of differentiation across review rounds. Notably, the dynamic shifts in reviewers' focus and sentiment tendencies throughout multiple review stages remain underexplored. To address this gap, the present study investigates the distribution and evolution of aspect-level sentiments and examines their correlation with the number of review rounds. We begin by segmenting the multi-round review comments of 11,063 accepted papers from Nature Communications and identifying fine-grained review aspect clusters. A manually annotated corpus of approximately 5000 review sentences is then constructed. Using this dataset, we train a series of deep learning-based aspect sentiment classification models. Among them, the LCF-BERT-CDM model achieves the best performance, with a Macro-F<sub>1</sub> score of 82.65 %. Subsequent statistical analysis reveals a consistent trend: as the number of review rounds increases, the proportion of positive sentiments rises, while negative sentiments decline. Correlation analysis further indicates that aspect sentiment scores are negatively associated with the total number of review rounds. Key aspects exhibiting stronger correlations include “experiments”, “research significance” and “result analysis”.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"10 1","pages":"Article 100105"},"PeriodicalIF":0.0,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145529209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Irregular sleep pattern identification and analysis from social media dataset using hybrid deep learning based attention mechanism 基于混合深度学习注意机制的社交媒体数据集不规则睡眠模式识别与分析
Pub Date : 2025-05-20 DOI: 10.1016/j.dim.2025.100104
Mohammed Jawwadul Islam , Mohammad Fahad Al Rafi , Pranto Podder , Aysha Siddika , Moumy Kabir , Euna Mehnaz Khan , Najmul Islam , Saddam Mukta
Following an irregular bedtime routine and having different amounts of sleep each night might increase a person's risk of obesity, cardiovascular problems, high blood pressure, insulin levels, and other metabolic problems. Similarly, in recent times, social media platforms have gained popularity among users for sharing their interests, thoughts, and opinions. Through social media activities, researchers have been able to mine the text data generated on these platforms to investigate and understand users' behaviors and habits. In this paper, we examine a total of 2,468,697 tweets to identify users' irregular sleeping patterns (ISP) using psycholinguistic and contextual features from their tweets. We conduct a linguistic analysis to understand the factors influencing users' psychological behavior and word use patterns, and find a correlation with their irregular sleeping patterns. We observe that users who have irregular sleeping patterns use anger, anxiety, death, and future categories of words in their tweets largely. In contrast, users with irregular sleeping patterns tend to use positive emotions, family, and other categories of words in their tweets. Building upon our findings, we develop a hybrid prediction model that predicts users' irregular sleeping patterns from psycholinguistic features with an accuracy of 91%. We examine the application of social media data for the early identification of irregular sleep patterns and their related mental and psychological concerns while investigating design prospects for future health technologies to enhance the monitoring and support of healthy sleep behavior.
不规律的就寝时间和每晚不同的睡眠时间可能会增加一个人患肥胖症、心血管疾病、高血压、胰岛素水平和其他代谢问题的风险。同样,近年来,社交媒体平台在用户中越来越受欢迎,可以分享他们的兴趣、想法和观点。通过社交媒体活动,研究人员已经能够挖掘这些平台上产生的文本数据,以调查和了解用户的行为和习惯。在本文中,我们研究了总共2,468,697条推文,利用推文的心理语言学和上下文特征来识别用户的不规则睡眠模式(ISP)。我们通过语言分析来了解影响用户心理行为和词语使用模式的因素,并发现其与不规律的睡眠模式之间的相关性。我们观察到,睡眠模式不规律的用户在推特上主要使用愤怒、焦虑、死亡和未来类别的词语。相比之下,睡眠模式不规律的用户倾向于在推特上使用积极情绪、家庭和其他类别的词语。基于我们的发现,我们开发了一个混合预测模型,可以根据心理语言特征预测用户的不规则睡眠模式,准确率达到91%。我们研究了社交媒体数据在早期识别不规则睡眠模式及其相关精神和心理问题方面的应用,同时研究了未来健康技术的设计前景,以加强对健康睡眠行为的监测和支持。
{"title":"Irregular sleep pattern identification and analysis from social media dataset using hybrid deep learning based attention mechanism","authors":"Mohammed Jawwadul Islam ,&nbsp;Mohammad Fahad Al Rafi ,&nbsp;Pranto Podder ,&nbsp;Aysha Siddika ,&nbsp;Moumy Kabir ,&nbsp;Euna Mehnaz Khan ,&nbsp;Najmul Islam ,&nbsp;Saddam Mukta","doi":"10.1016/j.dim.2025.100104","DOIUrl":"10.1016/j.dim.2025.100104","url":null,"abstract":"<div><div>Following an irregular bedtime routine and having different amounts of sleep each night might increase a person's risk of obesity, cardiovascular problems, high blood pressure, insulin levels, and other metabolic problems. Similarly, in recent times, social media platforms have gained popularity among users for sharing their interests, thoughts, and opinions. Through social media activities, researchers have been able to mine the text data generated on these platforms to investigate and understand users' behaviors and habits. In this paper, we examine a total of 2,468,697 tweets to identify users' irregular sleeping patterns (ISP) using psycholinguistic and contextual features from their tweets. We conduct a linguistic analysis to understand the factors influencing users' psychological behavior and word use patterns, and find a correlation with their irregular sleeping patterns. We observe that users who have irregular sleeping patterns use anger, anxiety, death, and future categories of words in their tweets largely. In contrast, users with irregular sleeping patterns tend to use positive emotions, family, and other categories of words in their tweets. Building upon our findings, we develop a hybrid prediction model that predicts users' irregular sleeping patterns from psycholinguistic features with an accuracy of 91%. We examine the application of social media data for the early identification of irregular sleep patterns and their related mental and psychological concerns while investigating design prospects for future health technologies to enhance the monitoring and support of healthy sleep behavior.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"10 1","pages":"Article 100104"},"PeriodicalIF":0.0,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145529210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised topic labeling and opportunity model of social media data for enhancing automotive product design processes 无监督主题标签和机会模型的社会媒体数据,以提高汽车产品设计过程
Pub Date : 2025-05-13 DOI: 10.1016/j.dim.2025.100103
Broto Widya Hartanto, Subagyo, I.G.B. Budi Dharma
This study introduces a hybrid method combining topic modeling and an opportunity model to generate novel ideas for automobile product design improvements. Furthermore, it proposes a novel unsupervised topic labeling procedure to address the limitations in current topic modeling interpretations, which are often not fully unsupervised. The procedure comprised automatic generation of labels that directly support opportunity modeling and facilitate product design development. To achieve the stated objectives, data was collected from user comments on YouTube car reviews and analyzed using various algorithms and part-of-speech rules, finding that Non-Negative Matrix Factorization with noun-adjective combinations proved most effective in generating comprehensible topic labels and capturing emotional expressions. The results revealed six underserved labels, one served right, and two overserved categories for new vehicle design improvements, providing valuable insights into user experiences. The insights provided in this context are expected to contribute to the potential improvement of vehicle attribute designs, thereby enhancing the efficiency of the entire design process.
本研究采用主题建模与机会模型相结合的混合方法,为汽车产品设计改进提供新思路。此外,它提出了一种新的无监督主题标记过程,以解决当前主题建模解释中的局限性,这些解释通常不是完全无监督的。该过程包括标签的自动生成,直接支持机会建模和促进产品设计开发。为了实现既定目标,我们从YouTube汽车评论的用户评论中收集数据,并使用各种算法和词性规则进行分析,发现带有名词-形容词组合的非负矩阵分解在生成可理解的主题标签和捕捉情感表达方面最有效。结果显示,在新车设计改进方面,有6个服务不足的类别、1个服务正确的类别和2个服务过度的类别,为用户体验提供了有价值的见解。在此背景下提供的见解有望有助于改进车辆属性设计,从而提高整个设计过程的效率。
{"title":"Unsupervised topic labeling and opportunity model of social media data for enhancing automotive product design processes","authors":"Broto Widya Hartanto,&nbsp;Subagyo,&nbsp;I.G.B. Budi Dharma","doi":"10.1016/j.dim.2025.100103","DOIUrl":"10.1016/j.dim.2025.100103","url":null,"abstract":"<div><div>This study introduces a hybrid method combining topic modeling and an opportunity model to generate novel ideas for automobile product design improvements. Furthermore, it proposes a novel unsupervised topic labeling procedure to address the limitations in current topic modeling interpretations, which are often not fully unsupervised. The procedure comprised automatic generation of labels that directly support opportunity modeling and facilitate product design development. To achieve the stated objectives, data was collected from user comments on YouTube car reviews and analyzed using various algorithms and part-of-speech rules, finding that Non-Negative Matrix Factorization with noun-adjective combinations proved most effective in generating comprehensible topic labels and capturing emotional expressions. The results revealed six underserved labels, one served right, and two overserved categories for new vehicle design improvements, providing valuable insights into user experiences. The insights provided in this context are expected to contribute to the potential improvement of vehicle attribute designs, thereby enhancing the efficiency of the entire design process.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 4","pages":"Article 100103"},"PeriodicalIF":0.0,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reforming the SERVQUAL model for accommodation sharing services: A mixed-method approach 改造住宿共享服务的SERVQUAL模型:一种混合方法
Pub Date : 2025-03-29 DOI: 10.1016/j.dim.2025.100102
Wenlong Liu , Yashuo Yuan , Yi Jiang , Jian Mou
Shared accommodation has seen robust growth with the advancement of the platform economy. With the growing participation of non-professional service deliverers (i.e., property owners) in this field, service quality and customer satisfaction are currently facing challenges. Leveraging the SERVQUAL model, this research investigates the factors that determine customers' continuance intention (CI) to reuse this type of accommodation. A mixed-method approach is used, in which text analysis is first conducted to identify the dimensions constituting the reformed SERVQUAL model in the accommodation sharing context. Thereafter, a survey-based empirical analysis is carried out to examine the impact of the SERVQUAL dimensions. In the text analysis, 29,787 online reviews on accommodation sharing services from Ctrip.com were collected. After word segmenting and high-frequency word coding using the Jieba package of Python and NVivo 12 plus, eight dimensions of SERVQUAL for accommodation sharing services were extracted: necessities, complementarity, reliability, empathy, assurance, responsiveness, authenticity, and similarity. In the empirical study, the results based on 588 valid samples show that all dimensions identified in this research have either direct or indirect impacts on consumers’ CI. The research findings hold great theoretical and practical significance.
随着平台经济的发展,共享住宿出现了强劲的增长。随着非专业服务提供者(即业主)越来越多地参与这一领域,服务质量和客户满意度正面临挑战。利用SERVQUAL模型,本研究调查了决定客户重用这种类型住宿的持续意向(CI)的因素。采用混合方法,首先进行文本分析,以确定在住宿共享上下文中构成改造后的SERVQUAL模型的维度。然后,进行了基于调查的实证分析,以检验SERVQUAL维度的影响。在文本分析中,我们收集了携程网上29787条关于住宿共享服务的在线评论。利用Python的Jieba包和NVivo 12 plus进行分词和高频词编码后,提取出住宿共享服务SERVQUAL的八个维度:必需品、互补性、可靠性、共情性、保证性、响应性、真实性和相似性。在实证研究中,基于588个有效样本的结果表明,本研究确定的所有维度对消费者CI都有直接或间接的影响。研究结果具有重要的理论和现实意义。
{"title":"Reforming the SERVQUAL model for accommodation sharing services: A mixed-method approach","authors":"Wenlong Liu ,&nbsp;Yashuo Yuan ,&nbsp;Yi Jiang ,&nbsp;Jian Mou","doi":"10.1016/j.dim.2025.100102","DOIUrl":"10.1016/j.dim.2025.100102","url":null,"abstract":"<div><div>Shared accommodation has seen robust growth with the advancement of the platform economy. With the growing participation of non-professional service deliverers (i.e., property owners) in this field, service quality and customer satisfaction are currently facing challenges. Leveraging the SERVQUAL model, this research investigates the factors that determine customers' continuance intention (CI) to reuse this type of accommodation. A mixed-method approach is used, in which text analysis is first conducted to identify the dimensions constituting the reformed SERVQUAL model in the accommodation sharing context. Thereafter, a survey-based empirical analysis is carried out to examine the impact of the SERVQUAL dimensions. In the text analysis, 29,787 online reviews on accommodation sharing services from <span><span>Ctrip.com</span><svg><path></path></svg></span> were collected. After word segmenting and high-frequency word coding using the Jieba package of Python and NVivo 12 plus, eight dimensions of SERVQUAL for accommodation sharing services were extracted: necessities, complementarity, reliability, empathy, assurance, responsiveness, authenticity, and similarity. In the empirical study, the results based on 588 valid samples show that all dimensions identified in this research have either direct or indirect impacts on consumers’ CI. The research findings hold great theoretical and practical significance.</div></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"9 4","pages":"Article 100102"},"PeriodicalIF":0.0,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data and information management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1