首页 > 最新文献

Data & Knowledge Engineering最新文献

英文 中文
CALEB: A Conditional Adversarial Learning Framework to enhance bot detection 迦勒:一个条件对抗学习框架,以增强机器人检测
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-14 DOI: 10.1016/j.datak.2023.102245
Ilias Dimitriadis, George Dialektakis, Athena Vakali

The high growth of Online Social Networks (OSNs) over the last few years has allowed automated accounts, known as social bots, to gain ground. As highlighted by other researchers, many of these bots have malicious purposes and tend to mimic human behavior, posing high-level security threats on OSN platforms. Moreover, recent studies have shown that social bots evolve over time by reforming and reinventing unforeseen and sophisticated characteristics, making them capable of evading the current machine learning state-of-the-art bot detection systems. This work is motivated by the critical need to establish adaptive bot detection methods in order to proactively capture unseen evolved bots towards healthier OSNs interactions. In contrast with most earlier supervised ML approaches which are limited by the inability to effectively detect new types of bots, this paper proposes CALEB, a robust end-to-end proactive framework based on the Conditional Generative Adversarial Network (CGAN) and its extension, Auxiliary Classifier GAN (AC-GAN), to simulate bot evolution by creating realistic synthetic instances of different bot types. These simulated evolved bots augment existing bot datasets and therefore enhance the detection of emerging generations of bots before they even appear. Furthermore, we show that our augmentation approach overpasses other earlier augmentation techniques which fail at simulating evolving bots. Extensive experimentation on well established public bot datasets, show that our approach offers a performance boost of up to 10% regarding the detection of new unseen bots. Finally, the use of the AC-GAN Discriminator as a bot detector, has outperformed former ML approaches, showcasing the efficiency of our end to end framework.

过去几年,在线社交网络(Online Social Networks,简称OSNs)的高速增长,使得被称为社交机器人(Social bots)的自动账户获得了发展。正如其他研究人员所强调的那样,许多这些机器人具有恶意目的,倾向于模仿人类行为,对OSN平台构成高级别的安全威胁。此外,最近的研究表明,随着时间的推移,社交机器人通过改革和重塑不可预见的复杂特征而进化,使它们能够逃避目前最先进的机器学习机器人检测系统。这项工作的动机是建立自适应机器人检测方法的迫切需要,以便主动捕获未见过的进化机器人,以实现更健康的osn交互。与大多数早期的监督式机器学习方法相比,这些方法受到无法有效检测新型机器人的限制,本文提出了CALEB,这是一种基于条件生成对抗网络(CGAN)及其扩展,辅助分类器GAN (AC-GAN)的鲁棒端到端主动框架,通过创建不同机器人类型的真实合成实例来模拟机器人进化。这些模拟进化的机器人增强了现有的机器人数据集,因此增强了对新一代机器人的检测,甚至在它们出现之前。此外,我们表明我们的增强方法优于其他早期的增强技术,这些技术在模拟进化机器人方面失败。在完善的公共机器人数据集上进行的大量实验表明,我们的方法在检测新的未见过的机器人方面提供了高达10%的性能提升。最后,使用AC-GAN鉴别器作为机器人检测器,优于以前的机器学习方法,展示了我们的端到端框架的效率。
{"title":"CALEB: A Conditional Adversarial Learning Framework to enhance bot detection","authors":"Ilias Dimitriadis,&nbsp;George Dialektakis,&nbsp;Athena Vakali","doi":"10.1016/j.datak.2023.102245","DOIUrl":"10.1016/j.datak.2023.102245","url":null,"abstract":"<div><p><span>The high growth of Online Social Networks<span> (OSNs) over the last few years has allowed automated accounts, known as social bots, to gain ground. As highlighted by other researchers, many of these bots have malicious purposes and tend to mimic human behavior, posing high-level security threats on OSN platforms. Moreover, recent studies have shown that social bots evolve over time by reforming and reinventing unforeseen and sophisticated characteristics, making them capable of evading the current machine learning<span> state-of-the-art bot detection systems. This work is motivated by the critical need to establish adaptive bot detection methods in order to proactively capture unseen evolved bots towards healthier OSNs interactions. In contrast with most earlier supervised ML approaches which are limited by the inability to effectively detect new types of bots, this paper proposes CALEB, a robust end-to-end proactive framework based on the Conditional </span></span></span>Generative Adversarial Network<span><span> (CGAN) and its extension, Auxiliary Classifier GAN (AC-GAN), to simulate bot evolution by creating realistic synthetic instances of different bot types. These simulated evolved bots augment existing bot datasets and therefore enhance the detection of emerging generations of bots before they even appear. Furthermore, we show that our augmentation approach overpasses other earlier augmentation techniques which fail at simulating evolving bots. Extensive experimentation on well established public bot datasets, show that our approach offers a performance boost of up to 10% regarding the detection of new unseen bots. Finally, the use of the AC-GAN </span>Discriminator as a bot detector, has outperformed former ML approaches, showcasing the efficiency of our end to end framework.</span></p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"149 ","pages":"Article 102245"},"PeriodicalIF":2.5,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135763694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What is the business value of your data? A multi-perspective empirical study on monetary valuation factors and methods for data governance 数据的商业价值是什么?货币估值因素与数据治理方法的多视角实证研究
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-11 DOI: 10.1016/j.datak.2023.102242
Frank Bodendorf, Jörg Franke

Digitalization has greatly increased the importance of data in recent years, making data an indispensable resource for value creation in our time. There is currently still a lack of theories as well as practicable methods and techniques for the monetary valuation of data, and data is therefore not yet sufficiently managed in terms of business management principles. In this context, this research is intended to design theory ingrained principles for a multidimensional conceptual approach to the monetary valuation of data as assets. We draw on the theory of dynamic capabilities as a further development of resource theory as well as value theory. To this end, the research conducts a qualitative field study followed by a quantitative survey study. Literature analysis is used to explain different dimensions in the qualitative field study. Structural equation modeling is used to analyze empirical data collected in the quantitative study. The results show that data value determination is a multidimensional and hierarchical construct consisting of three primary dimensions. These are the benefit-oriented, cost-oriented, and quality-oriented dimensions. The results also confirm that institutional pressures (coercive, normative, mimetic) that influence organizational behaviors lead to a greater intention for organizations to adapt a monetary data value determination.

近年来,数字化大大提高了数据的重要性,使数据成为我们这个时代创造价值不可或缺的资源。目前还缺乏对数据进行货币估值的理论,也缺乏切实可行的方法和技术,因此在商业管理原则方面对数据的管理还不够充分。在这种情况下,本研究的目的是设计理论根深蒂固的原则,多维的概念方法,以数据作为资产的货币估值。我们借鉴了动态能力理论,作为资源理论和价值理论的进一步发展。为此,本研究首先进行定性的实地研究,然后进行定量的调查研究。在定性的实地研究中,文献分析法被用来解释不同的维度。采用结构方程模型对定量研究中收集的经验数据进行分析。结果表明,数据值确定是一个多维层次结构,由三个主要维度组成。这三个维度分别是利益导向、成本导向和质量导向。研究结果还证实,影响组织行为的制度压力(强制性、规范性、模仿性)导致组织更倾向于适应货币数据价值的确定。
{"title":"What is the business value of your data? A multi-perspective empirical study on monetary valuation factors and methods for data governance","authors":"Frank Bodendorf,&nbsp;Jörg Franke","doi":"10.1016/j.datak.2023.102242","DOIUrl":"10.1016/j.datak.2023.102242","url":null,"abstract":"<div><p><span>Digitalization has greatly increased the importance of data in recent years, making data an indispensable resource for value creation in our time. There is currently still a lack of theories as well as practicable methods and techniques for the monetary valuation of data, and data is therefore not yet sufficiently managed in terms of business management principles. In this context, this research is intended to design theory ingrained principles for a multidimensional conceptual approach to the monetary valuation of data as assets. We draw on the theory of dynamic capabilities as a further development of resource theory as well as value theory. To this end, the research conducts a qualitative field study followed by a quantitative survey study. Literature analysis is used to explain different dimensions in the qualitative field study. Structural equation modeling is used to analyze empirical data collected in the quantitative study. The results show that data value determination is a multidimensional and hierarchical construct consisting of three primary dimensions. These are the benefit-oriented, cost-oriented, and quality-oriented dimensions. The results also confirm that institutional pressures (coercive, normative, mimetic) that influence </span>organizational behaviors lead to a greater intention for organizations to adapt a monetary data value determination.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"149 ","pages":"Article 102242"},"PeriodicalIF":2.5,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135714069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A framework for approximate product search using faceted navigation and user preference ranking 一个使用分面导航和用户偏好排序的近似产品搜索框架
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-11 DOI: 10.1016/j.datak.2023.102241
Damir Vandic , Lennart J. Nederstigt , Flavius Frasincar , Uzay Kaymak , Enzo Ido

One of the problems that e-commerce users face is that the desired products are sometimes not available and Web shops fail to provide similar products due to their exclusive reliance on Boolean faceted search. User preferences are also often not taken into account. In order to address these problems, we present a novel framework specifically geared towards approximate faceted search within the product catalog of a Web shop. It is based on adaptations to the p-norm extended Boolean model, to account for the domain-specific characteristics of faceted search in an e-commerce environment. These e-commerce specific characteristics are, for example, the use of quantitative properties and the presence of user preferences. Our approach explores the concept of facet similarity functions in order to better match products to queries. In addition, the user preferences are used to assign importance weights to the query terms. Using a large-scale experimental setup based on real-world data, we conclude that the proposed algorithm outperforms the considered benchmark algorithms. Last, we have performed a user-based study in which we found that users who use our approach find more relevant products with less effort.

电子商务用户面临的一个问题是,有时无法获得所需的产品,而Web商店由于完全依赖布尔面搜索而无法提供类似的产品。用户偏好通常也不会被考虑在内。为了解决这些问题,我们提出了一个新的框架,专门针对Web商店产品目录中的近似分面搜索。它基于对p范数扩展布尔模型的适应,以解释电子商务环境中分面搜索的领域特定特征。这些电子商务的具体特征是,例如,定量特性的使用和用户偏好的存在。我们的方法探索了面相似函数的概念,以便更好地将产品与查询匹配。此外,用户首选项用于为查询项分配重要性权重。使用基于真实世界数据的大规模实验设置,我们得出结论,提出的算法优于考虑的基准算法。最后,我们进行了一项基于用户的研究,我们发现使用我们的方法的用户用更少的努力找到了更多相关的产品。
{"title":"A framework for approximate product search using faceted navigation and user preference ranking","authors":"Damir Vandic ,&nbsp;Lennart J. Nederstigt ,&nbsp;Flavius Frasincar ,&nbsp;Uzay Kaymak ,&nbsp;Enzo Ido","doi":"10.1016/j.datak.2023.102241","DOIUrl":"10.1016/j.datak.2023.102241","url":null,"abstract":"<div><p>One of the problems that e-commerce users face is that the desired products are sometimes not available and Web shops fail to provide similar products due to their exclusive reliance on Boolean faceted search. User preferences are also often not taken into account. In order to address these problems, we present a novel framework specifically geared towards approximate faceted search within the product catalog of a Web shop. It is based on adaptations to the p-norm extended Boolean model, to account for the domain-specific characteristics of faceted search in an e-commerce environment. These e-commerce specific characteristics are, for example, the use of quantitative properties and the presence of user preferences. Our approach explores the concept of facet similarity functions in order to better match products to queries. In addition, the user preferences are used to assign importance weights to the query terms. Using a large-scale experimental setup based on real-world data, we conclude that the proposed algorithm outperforms the considered benchmark algorithms. Last, we have performed a user-based study in which we found that users who use our approach find more relevant products with less effort.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"149 ","pages":"Article 102241"},"PeriodicalIF":2.5,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135664518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering and evaluating organizational knowledge from textual data: Application to crisis management 从文本数据中发现和评估组织知识:在危机管理中的应用
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-01 DOI: 10.1016/j.datak.2023.102237
Dhouha Grissa, Eric Andonoff, Chihab Hanachi

Crisis management effectiveness relies mainly on the quality of the distributed human organization deployed for saving lives, limiting damage and reducing risks. Organizations set up in this context are not always predefined and static; they could evolve and new forms could emerge since actors, such as volunteers or NGO, could join dynamically to collaborate. To improve crisis resolution effectiveness, it is first important to understand, analyze and evaluate such dynamic organizations in order to adjust crisis management plans and ease coordination among actors. Giving a textual experience feedback from past crisis, the objective of this paper is to discover the organizational structure deployed in the considered crisis and then evaluate it according to a set of criteria. For that purpose, we combine in a coherent framework text and association rule mining for pattern discovery and annotation, and multi-agent system models and techniques for formally building and evaluating organizational structures. We present the OSminer algorithm that discovers association rules based on relevant textual patterns and then builds an organizational structure including three main relations between actors: power, control and coordination. A real-life case study, a flood crisis hitting the south west of France, serves as a basis for testing/experimenting our solution. The organizational structure, discovered in this case study, has 24 actors. Its evaluation indicates its efficiency, but shows that it is neither robust nor flexible. Our findings highlight the potential of our approach to discover and evaluate organizational structures from a text recording interactions between stakeholders in a crisis context.

危机管理的有效性主要依赖于为拯救生命、限制损害和降低风险而部署的分布式人员组织的质量。在这方面建立的组织并不总是预先确定的和静态的;它们可以进化,新的形式可以出现,因为参与者,如志愿者或非政府组织,可以动态地加入合作。为了提高危机解决的有效性,首先重要的是了解、分析和评估这些动态组织,以便调整危机管理计划,减轻行动者之间的协调。从过去的危机中获得文本经验反馈,本文的目的是发现在考虑的危机中部署的组织结构,然后根据一套标准对其进行评估。为此,我们在一个连贯的框架中结合了用于模式发现和注释的文本和关联规则挖掘,以及用于正式构建和评估组织结构的多智能体系统模型和技术。我们提出了基于相关文本模式发现关联规则的OSminer算法,然后构建了一个包含行动者之间三种主要关系的组织结构:权力、控制和协调。一个现实生活中的案例研究,一个袭击法国西南部的洪水危机,作为测试/试验我们的解决方案的基础。在本案例研究中发现,组织结构有24个参与者。评价表明该方法是有效的,但鲁棒性和灵活性较差。我们的研究结果强调了我们的方法的潜力,即通过记录危机背景下利益相关者之间的互动来发现和评估组织结构。
{"title":"Discovering and evaluating organizational knowledge from textual data: Application to crisis management","authors":"Dhouha Grissa,&nbsp;Eric Andonoff,&nbsp;Chihab Hanachi","doi":"10.1016/j.datak.2023.102237","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102237","url":null,"abstract":"<div><p><span><span>Crisis management effectiveness relies mainly on the quality of the distributed human organization deployed for saving lives, limiting damage and reducing risks. Organizations set up in this context are not always predefined and static; they could evolve and new forms could emerge since actors, such as volunteers or NGO, could join dynamically to collaborate. To improve crisis resolution effectiveness, it is first important to understand, analyze and evaluate such dynamic organizations in order to adjust crisis management plans and ease coordination among actors. Giving a textual experience feedback from past crisis, the objective of this paper is to discover the organizational structure deployed in the considered crisis and then evaluate it according to a set of criteria. For that purpose, we combine in a coherent framework text and </span>association rule mining for pattern discovery and annotation, and multi-agent system models and techniques for formally building and evaluating organizational structures. We present the </span><em>OSminer</em> algorithm that discovers association rules based on relevant textual patterns and then builds an organizational structure including three main relations between actors: power, control and coordination. A real-life case study, a flood crisis hitting the south west of France, serves as a basis for testing/experimenting our solution. The organizational structure, discovered in this case study, has 24 actors. Its evaluation indicates its efficiency, but shows that it is neither robust nor flexible. Our findings highlight the potential of our approach to discover and evaluate organizational structures from a text recording interactions between stakeholders in a crisis context.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"148 ","pages":"Article 102237"},"PeriodicalIF":2.5,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92046670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User-generated short-text classification using cograph editing-based network clustering with an application in invoice categorization 基于图形编辑的网络聚类用户生成短文本分类及其在发票分类中的应用
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-01 DOI: 10.1016/j.datak.2023.102238
Dewan F. Wahid , Elkafi Hassini

Rapid adaptation of online business platforms in every sector creates an enormous amount of user-generated textual data related to providing product or service descriptions, reviewing, marketing, invoicing and bookkeeping. These data are often short in size, noisy (e.g., misspellings, abbreviations), and do not have accurate classifying labels (line-item categories). Classifying these user-generated short-text data with appropriate line-item categories is crucial for corresponding platforms to understand users’ needs. This paper proposed a framework for user-generated short-text classification based on identified line-item categories. In the line-item identification phase, we used cograph editing (CoE)-based clustering on keywords network, which can be formulated from users’ generated short-texts. We also proposed integer linear programming (ILP) formulations for CoE on weighted networks and designed a heuristic algorithm to identify clusters in large-scale networks. Finally, we outlined an application of this framework to categorize invoices in an empirical setting. Our framework showed promising results in identifying invoice line-item categories for large-scale data.

每个领域的在线商业平台的快速适应创造了大量的用户生成的文本数据,这些数据与提供产品或服务描述、审查、营销、发票和簿记有关。这些数据通常是短的,嘈杂的(例如,拼写错误,缩写),并且没有准确的分类标签(行-项分类)。将这些用户生成的短文本数据分类为适当的行项分类对于相应的平台了解用户需求至关重要。本文提出了一种基于已识别的行项分类的用户生成短文本分类框架。在行项识别阶段,我们使用基于cograph编辑(CoE)的关键词网络聚类,该网络可以从用户生成的短文本中形成。我们还提出了加权网络上的整数线性规划(ILP)公式,并设计了一种启发式算法来识别大规模网络中的聚类。最后,我们概述了该框架在经验设置中对发票进行分类的应用程序。我们的框架在识别大规模数据的发票行-项类别方面显示出有希望的结果。
{"title":"User-generated short-text classification using cograph editing-based network clustering with an application in invoice categorization","authors":"Dewan F. Wahid ,&nbsp;Elkafi Hassini","doi":"10.1016/j.datak.2023.102238","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102238","url":null,"abstract":"<div><p>Rapid adaptation of online business platforms in every sector creates an enormous amount of user-generated textual data related to providing product or service descriptions, reviewing, marketing, invoicing and bookkeeping. These data are often short in size, noisy (e.g., misspellings, abbreviations), and do not have accurate classifying labels (line-item categories). Classifying these user-generated short-text data with appropriate line-item categories is crucial for corresponding platforms to understand users’ needs. This paper proposed a framework for user-generated short-text classification based on identified line-item categories. In the line-item identification phase, we used cograph editing (CoE)-based clustering on keywords network, which can be formulated from users’ generated short-texts. We also proposed integer linear programming (ILP) formulations for CoE on weighted networks and designed a heuristic algorithm to identify clusters in large-scale networks. Finally, we outlined an application of this framework to categorize invoices in an empirical setting. Our framework showed promising results in identifying invoice line-item categories for large-scale data.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"148 ","pages":"Article 102238"},"PeriodicalIF":2.5,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92136240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Challenges of a Data Ecosystem for scientific data 科学数据数据生态系统的挑战
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-01 DOI: 10.1016/j.datak.2023.102236
Edoardo Ramalli, Barbara Pernici

Data Ecosystems (DE) are used across various fields and applications. They facilitate collaboration between organizations, such as companies or research institutions, enabling them to share data and services. A DE can boost research outcomes by managing and extracting value from the increasing volume of generated and shared data in the last decades. However, the adoption of DE solutions for scientific data by R&D departments and scientific communities is still difficult. Scientific data are challenging to manage, and, as a result, a considerable part of this information still needs to be annotated and organized in order to be shared. This work discusses the challenges of employing DE in scientific domains and the corresponding potential mitigations. First, scientific data and their typologies are contextualized, then their unique characteristics are discussed. Typical properties regarding their high heterogeneity and uncertainty make assessing their consistency and accuracy problematic. In addition, this work discusses the specific requirements expressed by the scientific communities when it comes to integrating a DE solution into their workflow. The unique properties of scientific data and domain-specific requirements create a challenging setting for adopting DEs. The challenges are expressed as general research questions, and this work explores the corresponding solutions in terms of data management aspects. Finally, the paper presents a real-world scenario with more technical details.

数据生态系统(DE)用于各种领域和应用程序。它们促进了公司或研究机构等组织之间的协作,使它们能够共享数据和服务。DE可以通过管理和从过去几十年不断增加的生成和共享数据中提取价值来促进研究成果。然而,研发部门和科学界对科学数据采用DE解决方案仍然很困难。科学数据的管理具有挑战性,因此,为了共享,这些信息的相当一部分仍然需要注释和组织。本工作讨论了在科学领域中使用DE的挑战以及相应的潜在缓解措施。首先,对科学数据及其类型学进行了语境化,然后讨论了它们的独特特征。关于它们的高异质性和不确定性的典型属性使得评估它们的一致性和准确性存在问题。此外,本文还讨论了科学界在将DE解决方案集成到他们的工作流程中时所表达的特定需求。科学数据的独特属性和特定领域的需求为采用DEs创造了一个具有挑战性的环境。这些挑战被表达为一般的研究问题,本工作从数据管理方面探索了相应的解决方案。最后,本文给出了一个具有更多技术细节的真实场景。
{"title":"Challenges of a Data Ecosystem for scientific data","authors":"Edoardo Ramalli,&nbsp;Barbara Pernici","doi":"10.1016/j.datak.2023.102236","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102236","url":null,"abstract":"<div><p>Data Ecosystems (DE) are used across various fields and applications. They facilitate collaboration between organizations, such as companies or research institutions, enabling them to share data and services. A DE can boost research outcomes by managing and extracting value from the increasing volume of generated and shared data in the last decades. However, the adoption of DE solutions for scientific data by R&amp;D departments and scientific communities is still difficult. Scientific data are challenging to manage, and, as a result, a considerable part of this information still needs to be annotated and organized in order to be shared. This work discusses the challenges of employing DE in scientific domains and the corresponding potential mitigations. First, scientific data and their typologies are contextualized, then their unique characteristics are discussed. Typical properties regarding their high heterogeneity and uncertainty make assessing their consistency and accuracy problematic. In addition, this work discusses the specific requirements expressed by the scientific communities when it comes to integrating a DE solution into their workflow. The unique properties of scientific data and domain-specific requirements create a challenging setting for adopting DEs. The challenges are expressed as general research questions, and this work explores the corresponding solutions in terms of data management aspects. Finally, the paper presents a real-world scenario with more technical details.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"148 ","pages":"Article 102236"},"PeriodicalIF":2.5,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0169023X23000964/pdfft?md5=98e3f9c9e5690c131b72c032eddd9253&pid=1-s2.0-S0169023X23000964-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92046668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards comparable ratings: Exploring bias in German physician reviews 迈向可比评级:探索德国医师评价的偏倚
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-01 DOI: 10.1016/j.datak.2023.102235
Joschka Kersting, Falk Maoro, Michaela Geierhos

In this study, we evaluate the impact of gender-biased data from German-language physician reviews on the fairness of fine-tuned language models. For two different downstream tasks, we use data reported to be gender biased and aggregate it with annotations. First, we propose a new approach to aspect-based sentiment analysis that allows identifying, extracting, and classifying implicit and explicit aspect phrases and their polarity within a single model. The second task we present is grade prediction, where we predict the overall grade of a review on the basis of the review text. For both tasks, we train numerous transformer models and evaluate their performance. The aggregation of sensitive attributes, such as a physician’s gender and migration background, with individual text reviews allows us to measure the performance of the models with respect to these sensitive groups. These group-wise performance measures act as extrinsic bias measures for our downstream tasks. In addition, we translate several gender-specific templates of the intrinsic bias metrics into the German language and evaluate our fine-tuned models. Based on this set of tasks, fine-tuned models, and intrinsic and extrinsic bias measures, we perform correlation analyses between intrinsic and extrinsic bias measures. In terms of sensitive groups and effect sizes, our bias measure results show different directions. Furthermore, correlations between measures of intrinsic and extrinsic bias can be observed in different directions. This leads us to conclude that gender-biased data does not inherently lead to biased models. Other variables, such as template dependency for intrinsic measures and label distribution in the data, must be taken into account as they strongly influence the metric results. Therefore, we suggest that metrics and templates should be chosen according to the given task and the biases to be assessed.

在这项研究中,我们评估了来自德语医师评论的性别偏见数据对微调语言模型公平性的影响。对于两个不同的下游任务,我们使用报告的有性别偏见的数据,并将其与注释一起汇总。首先,我们提出了一种基于方面的情感分析的新方法,该方法允许在单个模型中识别、提取和分类隐式和显式方面短语及其极性。我们提出的第二个任务是成绩预测,我们根据复习文本预测复习的总体成绩。对于这两项任务,我们训练了许多变压器模型并评估了它们的性能。敏感属性的聚合,例如医生的性别和迁移背景,以及单独的文本审查,使我们能够根据这些敏感组度量模型的性能。这些团队绩效指标作为我们下游任务的外在偏差指标。此外,我们将几个性别特定的内在偏见指标模板翻译成德语,并评估我们的微调模型。基于这组任务、微调模型以及内在和外在偏差测量,我们对内在和外在偏差测量进行了相关性分析。在敏感群体和效应大小方面,我们的偏倚测量结果显示出不同的方向。此外,内在偏差和外在偏差之间的相关性可以在不同的方向上观察到。这使我们得出结论,性别偏见的数据并不必然导致有偏见的模型。其他变量,如固有度量的模板依赖关系和数据中的标签分布,必须考虑在内,因为它们强烈影响度量结果。因此,我们建议指标和模板应该根据给定的任务和要评估的偏差来选择。
{"title":"Towards comparable ratings: Exploring bias in German physician reviews","authors":"Joschka Kersting,&nbsp;Falk Maoro,&nbsp;Michaela Geierhos","doi":"10.1016/j.datak.2023.102235","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102235","url":null,"abstract":"<div><p>In this study, we evaluate the impact of gender-biased data from German-language physician reviews on the fairness of fine-tuned language models. For two different downstream tasks, we use data reported to be gender biased and aggregate it with annotations. First, we propose a new approach to aspect-based sentiment analysis that allows identifying, extracting, and classifying implicit and explicit aspect phrases and their polarity within a single model. The second task we present is grade prediction, where we predict the overall grade of a review on the basis of the review text. For both tasks, we train numerous transformer models and evaluate their performance. The aggregation of sensitive attributes, such as a physician’s gender and migration background, with individual text reviews allows us to measure the performance of the models with respect to these sensitive groups. These group-wise performance measures act as extrinsic bias measures for our downstream tasks. In addition, we translate several gender-specific templates of the intrinsic bias metrics into the German language and evaluate our fine-tuned models. Based on this set of tasks, fine-tuned models, and intrinsic and extrinsic bias measures, we perform correlation analyses between intrinsic and extrinsic bias measures. In terms of sensitive groups and effect sizes, our bias measure results show different directions. Furthermore, correlations between measures of intrinsic and extrinsic bias can be observed in different directions. This leads us to conclude that gender-biased data does not inherently lead to biased models. Other variables, such as template dependency for intrinsic measures and label distribution in the data, must be taken into account as they strongly influence the metric results. Therefore, we suggest that metrics and templates should be chosen according to the given task and the biases to be assessed.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"148 ","pages":"Article 102235"},"PeriodicalIF":2.5,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0169023X23000952/pdfft?md5=035f0e2eec55531089e125433a25b2bc&pid=1-s2.0-S0169023X23000952-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92046250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An efficient and scalable SPARQL query processing framework for big data using MapReduce and hybrid optimum load balancing 使用MapReduce和混合最优负载平衡的高效可扩展的SPARQL大数据查询处理框架
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-01 DOI: 10.1016/j.datak.2023.102239
V. Naveen Kumar , Ashok Kumar P.S.

The increasing RDF (Resource Description Framework) data volume requires a Hadoop platform for processing queries over large datasets. In this work, SPARQL (Simple Protocol and Rdf Query Language) queries are evaluated with Hadoop based on the objective of minimizing the number of joins through data partitioning for performing map/reduce jobs. The query evaluation time and the number of cross node joins are minimized with the proposed partitioning techniques. Extended vertical partitioning is proposed for distributed data stores based on objects’ explicit information for splitting predicates. For accessing the RDF data, hybrid monarch butterfly with beetle swarm load balancing optimization with Map-reduce (Hybrid Optimum Load Balancing) is applied. The proposed SPARQL query processing is evaluated over large RDF datasets. The proposed approach’s evaluation results are analyzed with the existing approaches, indicating the proposed framework’s efficiency. By using the proposed approach, an accuracy of 97 % is obtained.

不断增长的RDF(资源描述框架)数据量需要一个Hadoop平台来处理对大型数据集的查询。在这项工作中,SPARQL(简单协议和Rdf查询语言)查询是基于通过执行map/reduce作业的数据分区最小化连接数量的目标与Hadoop一起评估的。所提出的分区技术最大限度地减少了查询评估时间和交叉节点连接的数量。针对分布式数据存储,提出了基于对象显式信息的扩展垂直分区,用于划分谓词。对于RDF数据的访问,采用Map-reduce (hybrid optimal load balancing,混合最优负载平衡)混合帝王蝶与甲虫群负载平衡优化。建议的SPARQL查询处理是在大型RDF数据集上进行评估的。将该方法的评价结果与现有方法进行了对比分析,表明了该框架的有效性。通过使用该方法,获得了97%的准确率。
{"title":"An efficient and scalable SPARQL query processing framework for big data using MapReduce and hybrid optimum load balancing","authors":"V. Naveen Kumar ,&nbsp;Ashok Kumar P.S.","doi":"10.1016/j.datak.2023.102239","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102239","url":null,"abstract":"<div><p>The increasing RDF (Resource Description Framework) data volume requires a Hadoop<span> platform for processing queries over large datasets. In this work, SPARQL (Simple Protocol and Rdf Query Language) queries are evaluated with Hadoop based on the objective of minimizing the number of joins through data partitioning for performing map/reduce jobs. The query evaluation time and the number of cross node joins are minimized with the proposed partitioning techniques. Extended vertical partitioning is proposed for distributed data stores based on objects’ explicit information for splitting predicates. For accessing the RDF data, hybrid monarch butterfly with beetle swarm load balancing optimization with Map-reduce (Hybrid Optimum Load Balancing) is applied. The proposed SPARQL query processing is evaluated over large RDF datasets. The proposed approach’s evaluation results are analyzed with the existing approaches, indicating the proposed framework’s efficiency. By using the proposed approach, an accuracy of 97 % is obtained.</span></p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"148 ","pages":"Article 102239"},"PeriodicalIF":2.5,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92067826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance 基于可解释和专业化深度强化学习的预测性维护的分层框架
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-31 DOI: 10.1016/j.datak.2023.102240
Ammar N. Abbas , Georgios C. Chasparis , John D. Kelleher

Deep reinforcement learning holds significant potential for application in industrial decision-making, offering a promising alternative to traditional physical models. However, its black-box learning approach presents challenges for real-world and safety-critical systems, as it lacks interpretability and explanations for the derived actions. Moreover, a key research question in deep reinforcement learning is how to focus policy learning on critical decisions within sparse domains. This paper introduces a novel approach that combines probabilistic modeling and reinforcement learning, providing interpretability and addressing these challenges in the context of safety-critical predictive maintenance. The methodology is activated in specific situations identified through the input–output hidden Markov model, such as critical conditions or near-failure scenarios. To mitigate the challenges associated with deep reinforcement learning in safety-critical predictive maintenance, the approach is initialized with a baseline policy using behavioral cloning, requiring minimal interactions with the environment. The effectiveness of this framework is demonstrated through a case study on predictive maintenance for turbofan engines, outperforming previous approaches and baselines, while also providing the added benefit of interpretability. Importantly, while the framework is applied to a specific use case, this paper aims to present a general methodology that can be applied to diverse predictive maintenance applications.

深度强化学习在工业决策中具有巨大的应用潜力,为传统物理模型提供了一个有希望的替代方案。然而,它的黑箱学习方法对现实世界和安全关键系统提出了挑战,因为它缺乏对派生行为的可解释性和解释。此外,深度强化学习的一个关键研究问题是如何将策略学习集中在稀疏域内的关键决策上。本文介绍了一种结合概率建模和强化学习的新方法,提供可解释性,并在安全关键预测性维护的背景下解决这些挑战。该方法通过输入-输出隐马尔可夫模型在特定情况下被激活,例如关键条件或接近故障的场景。为了减轻在安全关键预测性维护中与深度强化学习相关的挑战,该方法使用行为克隆的基线策略初始化,需要与环境进行最小的交互。通过对涡轮风扇发动机预测性维护的案例研究,证明了该框架的有效性,优于以前的方法和基线,同时还提供了可解释性的额外好处。重要的是,虽然该框架应用于特定的用例,但本文旨在提出一种可应用于各种预测性维护应用程序的通用方法。
{"title":"Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance","authors":"Ammar N. Abbas ,&nbsp;Georgios C. Chasparis ,&nbsp;John D. Kelleher","doi":"10.1016/j.datak.2023.102240","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102240","url":null,"abstract":"<div><p>Deep reinforcement learning holds significant potential for application in industrial decision-making, offering a promising alternative to traditional physical models. However, its black-box learning approach presents challenges for real-world and safety-critical systems, as it lacks interpretability and explanations for the derived actions. Moreover, a key research question in deep reinforcement learning is how to focus policy learning on critical decisions within sparse domains. This paper introduces a novel approach that combines probabilistic modeling and reinforcement learning, providing interpretability and addressing these challenges in the context of safety-critical predictive maintenance. The methodology is activated in specific situations identified through the input–output hidden Markov model, such as critical conditions or near-failure scenarios. To mitigate the challenges associated with deep reinforcement learning in safety-critical predictive maintenance, the approach is initialized with a baseline policy using behavioral cloning, requiring minimal interactions with the environment. The effectiveness of this framework is demonstrated through a case study on predictive maintenance for turbofan engines, outperforming previous approaches and baselines, while also providing the added benefit of interpretability. Importantly, while the framework is applied to a specific use case, this paper aims to present a general methodology that can be applied to diverse predictive maintenance applications.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"149 ","pages":"Article 102240"},"PeriodicalIF":2.5,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92059899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SkiQL: A unified schema query language SkiQL:一种统一的模式查询语言
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-11 DOI: 10.1016/j.datak.2023.102234
Carlos J. Fernández Candel, Jesús J. García-Molina, Diego Sevilla Ruiz

Most NoSQL systems are schema-on-read: data can be stored without first having to declare a schema that imposes a structure. This schemaless feature offers flexibility to evolve data-intensive applications when data change frequently. However, freeing from declaring schemas does not mean their absence, but rather that they are implicit in data and code. Therefore, diagramming tools similar to those available for relational systems are also needed to help developers and administrators to design and to understand NoSQL schemas.

Visualizing diagrams is not practical if schemas contain hundreds of database entities, so exploration or query facilities are then needed. In schemaless NoSQL stores, data of the same entity can be stored with different structure (e.g., non-uniform types and optional fields), which can increase the difficulty of having readable diagrams.

NoSQL schema management tools should therefore have three main components: schema extraction, schema visualization, and schema query. As there are four main NoSQL data models, it is convenient for such tools to be built on a generic data model so that they provide platform-independence (of data models and data stores) to query and visualize schemas. With the aim of favoring the creation of generic database tools, the authors of this paper defined the U-Schema unified data model that integrates the four main NoSQL data models as well as the relational model.

This paper is focused on querying NoSQL and relational schemas which are represented as U-Schema models. We present the SkiQL language designed on U-Schema to achieve a platform-independent schema query service. SkiQL provides two constructs: schema-query and relationship-query. The former allows to obtain information of entity or relationship types, and the latter that of the aggregations or references (relations among types). We will show how SkiQL was evaluated by calculating well-known metrics for languages as well as using a survey with developers with experience in NoSQL.

大多数NoSQL系统都是读时模式:可以存储数据,而不必首先声明强加结构的模式。当数据频繁更改时,此无架构功能提供了发展数据密集型应用程序的灵活性。然而,从声明模式中解放出来并不意味着它们不存在,而是意味着它们隐含在数据和代码中。因此,还需要类似于关系系统的图表工具来帮助开发人员和管理员设计和理解NoSQL模式。如果模式包含数百个数据库实体,那么可视化图表是不可行的,因此需要探索或查询功能。在无模式的NoSQL存储中,同一实体的数据可以用不同的结构存储(例如,非统一类型和可选字段),这会增加具有可读图表的难度。因此,NoSQL模式管理工具应该有三个主要组件:模式提取、模式可视化和模式查询。由于有四个主要的NoSQL数据模型,因此这些工具可以方便地构建在通用数据模型上,从而提供(数据模型和数据存储的)平台独立性,以查询和可视化模式。为了有利于通用数据库工具的创建,本文作者定义了U-Schema统一数据模型,该模型集成了四个主要的NoSQL数据模型以及关系模型。本文的重点是查询NoSQL和以U-Schema模型表示的关系模式。我们提出了基于U-Schema设计的SkiQL语言,以实现独立于平台的模式查询服务。SkiQL提供了两种构造:模式查询和关系查询。前者允许获取实体或关系类型的信息,后者允许获取聚合或引用(类型之间的关系)的信息。我们将展示SkiQL是如何通过计算语言的知名指标以及对具有NoSQL经验的开发人员进行调查来评估的。
{"title":"SkiQL: A unified schema query language","authors":"Carlos J. Fernández Candel,&nbsp;Jesús J. García-Molina,&nbsp;Diego Sevilla Ruiz","doi":"10.1016/j.datak.2023.102234","DOIUrl":"https://doi.org/10.1016/j.datak.2023.102234","url":null,"abstract":"<div><p>Most NoSQL systems are schema-on-read: data can be stored without first having to declare a schema that imposes a structure. This schemaless feature offers flexibility to evolve data-intensive applications when data change frequently. However, freeing from declaring schemas does not mean their absence, but rather that they are implicit in data and code. Therefore, diagramming tools similar to those available for relational systems are also needed to help developers and administrators to design and to understand NoSQL schemas.</p><p>Visualizing diagrams is not practical if schemas contain hundreds of database entities, so exploration or query facilities are then needed. In schemaless NoSQL stores, data of the same entity can be stored with different structure (e.g., non-uniform types and optional fields), which can increase the difficulty of having readable diagrams.</p><p>NoSQL schema management tools should therefore have three main components: schema extraction, schema visualization, and schema query. As there are four main NoSQL data models, it is convenient for such tools to be built on a generic data model so that they provide platform-independence (of data models and data stores) to query and visualize schemas. With the aim of favoring the creation of generic database tools, the authors of this paper defined the U-Schema unified data model that integrates the four main NoSQL data models as well as the relational model.</p><p>This paper is focused on querying NoSQL and relational schemas which are represented as U-Schema models. We present the SkiQL language designed on U-Schema to achieve a platform-independent schema query service. SkiQL provides two constructs: schema-query and relationship-query. The former allows to obtain information of entity or relationship types, and the latter that of the aggregations or references (relations among types). We will show how SkiQL was evaluated by calculating well-known metrics for languages as well as using a survey with developers with experience in NoSQL.</p></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"148 ","pages":"Article 102234"},"PeriodicalIF":2.5,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49749816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data & Knowledge Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1