Frontiers in Computer Science最新文献_第10页

Integrating TAM and IS success model: exploring the role of blockchain and AI in predicting learner engagement and performance in e-learning 整合TAM和IS成功模型:探索区块链和人工智能在预测电子学习中学习者参与度和表现方面的作用

Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2023-09-18 DOI: 10.3389/fcomp.2023.1227749

Damien Tyron Naidoo

This study innovatively intertwines technology adoption and e-learning by integrating blockchain and AI, offering a novel perspective on how cutting-edge technologies revolutionize learning processes. The present study investigates the factors that influence the behavioral use of learners to use blockchain and artificial intelligence (AI) in e-learning. The study proposes the integrated model of Technology Acceptance Model (TAM) and Information System (IS) success Model that include perceived usefulness, perceived ease of use, system quality, information quality, and service quality as antecedents to behavioral use of blockchain and AI in e-learning. The model also examines the moderating effect of learner self-efficacy on the relationship between behavioral use and e-learning engagement and performance. The study collected data from 322 respondents and analyzed the data using partial least squares structural equation modeling (PLS-SEM) with a bootstrapping technique. The results show that the factors of TAM model and IS model have the significant and positive effects on behavior to use blockchain and AI in e-learning. Additionally, learner self-efficacy has a significant positive effect on e-learning engagement and performance, but it does not moderate the relationship between behavior to use blockchain or AI and e-learning engagement and performance. Overall, the study provides insights into the factors that influence the adoption of blockchain and AI in e-learning and offers practical implications for educators and policymakers.

本研究通过整合区块链和人工智能，创新地将技术采用和电子学习交织在一起，为前沿技术如何彻底改变学习过程提供了一个新颖的视角。本研究调查了影响学习者在电子学习中使用区块链和人工智能(AI)的行为使用的因素。该研究提出了技术接受模型(TAM)和信息系统成功模型的集成模型，其中包括感知有用性、感知易用性、系统质量、信息质量和服务质量，作为区块链和人工智能在电子学习中的行为使用的先决条件。该模型还考察了学习者自我效能感对行为使用与网络学习投入和绩效之间关系的调节作用。该研究收集了322名调查对象的数据，并使用带自举技术的偏最小二乘结构方程模型(PLS-SEM)对数据进行了分析。结果表明，TAM模型和IS模型的因素对区块链和人工智能在电子学习中的使用行为具有显著的正向影响。此外，学习者自我效能感对电子学习投入和绩效有显著的正向影响，但它不会调节使用区块链或人工智能的行为与电子学习投入和绩效之间的关系。总体而言，该研究提供了影响区块链和人工智能在电子学习中应用的因素的见解，并为教育工作者和政策制定者提供了实际意义。

{"title":"Integrating TAM and IS success model: exploring the role of blockchain and AI in predicting learner engagement and performance in e-learning","authors":"Damien Tyron Naidoo","doi":"10.3389/fcomp.2023.1227749","DOIUrl":"https://doi.org/10.3389/fcomp.2023.1227749","url":null,"abstract":"This study innovatively intertwines technology adoption and e-learning by integrating blockchain and AI, offering a novel perspective on how cutting-edge technologies revolutionize learning processes. The present study investigates the factors that influence the behavioral use of learners to use blockchain and artificial intelligence (AI) in e-learning. The study proposes the integrated model of Technology Acceptance Model (TAM) and Information System (IS) success Model that include perceived usefulness, perceived ease of use, system quality, information quality, and service quality as antecedents to behavioral use of blockchain and AI in e-learning. The model also examines the moderating effect of learner self-efficacy on the relationship between behavioral use and e-learning engagement and performance. The study collected data from 322 respondents and analyzed the data using partial least squares structural equation modeling (PLS-SEM) with a bootstrapping technique. The results show that the factors of TAM model and IS model have the significant and positive effects on behavior to use blockchain and AI in e-learning. Additionally, learner self-efficacy has a significant positive effect on e-learning engagement and performance, but it does not moderate the relationship between behavior to use blockchain or AI and e-learning engagement and performance. Overall, the study provides insights into the factors that influence the adoption of blockchain and AI in e-learning and offers practical implications for educators and policymakers.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135207950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Shareability: novel perspective on human-media interaction 可分享性:人与媒体互动的新视角

Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2023-09-18 DOI: 10.3389/fcomp.2023.1106322

Nicola Bruno, Giorgia Guerra, Brigitta Pia Alioto, Alessandra Cecilia Jacomuzzi

Interpersonal communication in the twenty-first century is increasingly taking place within digital media. This poses the problem of understanding the factors that may facilitate or hinder communication processes in virtual contexts. Digital media require a human-machine interface, and the analysis of human-machine interfaces traditionally focuses on the dimension of usability. However, interface usability pertains to the interaction of users with digital devices, not to the interaction of users with other users. Here we argue that there is another dimension of human-media interaction that has remained largely unexplored, but plays a key role in interpersonal communication within digital media: shareability. We define shareability as the resultant of a set of interface features that: (i) make sharing of materials with fellow users easy, efficient, and timely (sharing-related usability); (ii) include features that intuitively invite users to share materials (sharing-related affordances); and (iii) provide a sensorimotor environment that includes perceptual information about both presented materials and the behavior of other users that are experiencing these materials through the medium at hand (support to shared availability). Capitalizing on concepts from semiotics, proxemics, and perceptual and cognItive neuroscience, we explore potential criteria to asses shareability in human-machine interfaces. Finally, we show how these notions may be applied in the analysis of three prototypical cases: online gaming, visual communication on social media, and online distance teaching.

21世纪的人际交流越来越多地发生在数字媒体中。这就提出了理解可能促进或阻碍虚拟环境中交流过程的因素的问题。数字媒体需要一个人机界面，而传统上对人机界面的分析主要集中在可用性维度上。然而，界面可用性涉及用户与数字设备的交互，而不是用户与其他用户的交互。在这里，我们认为，人类媒体互动的另一个维度在很大程度上尚未被探索，但在数字媒体的人际交流中起着关键作用:可分享性。我们将可共享性定义为一系列界面特性的结果，这些特性:(i)使与其他用户共享材料变得简单、高效和及时(与共享相关的可用性);(ii)包含直观地邀请用户共享材料的功能(共享相关功能);(iii)提供一个感觉运动环境，其中包括关于呈现材料的感知信息，以及通过手边的媒介体验这些材料的其他用户的行为(支持共享可用性)。利用符号学、邻近学、感知和认知神经科学的概念，我们探索了评估人机界面可共享性的潜在标准。最后，我们展示了这些概念如何应用于三个典型案例的分析:在线游戏、社交媒体上的视觉交流和在线远程教学。

{"title":"Shareability: novel perspective on human-media interaction","authors":"Nicola Bruno, Giorgia Guerra, Brigitta Pia Alioto, Alessandra Cecilia Jacomuzzi","doi":"10.3389/fcomp.2023.1106322","DOIUrl":"https://doi.org/10.3389/fcomp.2023.1106322","url":null,"abstract":"Interpersonal communication in the twenty-first century is increasingly taking place within digital media. This poses the problem of understanding the factors that may facilitate or hinder communication processes in virtual contexts. Digital media require a human-machine interface, and the analysis of human-machine interfaces traditionally focuses on the dimension of usability. However, interface usability pertains to the interaction of users with digital devices, not to the interaction of users with other users. Here we argue that there is another dimension of human-media interaction that has remained largely unexplored, but plays a key role in interpersonal communication within digital media: shareability. We define shareability as the resultant of a set of interface features that: (i) make sharing of materials with fellow users easy, efficient, and timely (sharing-related usability); (ii) include features that intuitively invite users to share materials (sharing-related affordances); and (iii) provide a sensorimotor environment that includes perceptual information about both presented materials and the behavior of other users that are experiencing these materials through the medium at hand (support to shared availability). Capitalizing on concepts from semiotics, proxemics, and perceptual and cognItive neuroscience, we explore potential criteria to asses shareability in human-machine interfaces. Finally, we show how these notions may be applied in the analysis of three prototypical cases: online gaming, visual communication on social media, and online distance teaching.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135205936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tutorial: calibration refinement in quantum annealing 教程:量子退火中的校准细化

Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2023-09-15 DOI: 10.3389/fcomp.2023.1238988

Kevin Chern, Kelly Boothby, Jack Raymond, Pau Farré, Andrew D. King

Quantum annealing has emerged as a powerful platform for simulating and optimizing classical and quantum Ising models. Quantum annealers, like other quantum and/or analog computing devices, are susceptible to non-idealities including crosstalk, device variation, and environmental noise. Compensating for these effects through calibration refinement or “shimming” can significantly improve performance but often relies on ad-hoc methods that exploit symmetries in both the problem being solved and the quantum annealer itself. In this tutorial, we attempt to demystify these methods. We introduce methods for finding exploitable symmetries in Ising models and discuss how to use these symmetries to suppress unwanted bias. We work through several examples of increasing complexity and provide complete Python code. We include automated methods for two important tasks: finding copies of small subgraphs in the qubit connectivity graph and automatically finding symmetries of an Ising model via generalized graph automorphism. We conclude the tutorial by surveying additional methods, providing practical implementation tips, and discussing limitations and remedies of the calibration procedure. Code is available at: https://github.com/dwavesystems/shimming-tutorial.

量子退火已经成为模拟和优化经典和量子Ising模型的强大平台。量子退火炉，像其他量子和/或模拟计算设备一样，容易受到非理想的影响，包括串扰、设备变化和环境噪声。通过校准微调或“调光”来补偿这些影响可以显着提高性能，但通常依赖于利用正在解决的问题和量子退火器本身的对称性的特殊方法。在本教程中，我们试图揭开这些方法的神秘面纱。我们介绍了在Ising模型中寻找可利用的对称性的方法，并讨论了如何使用这些对称性来抑制不必要的偏差。我们将介绍几个日益复杂的示例，并提供完整的Python代码。我们包含了两个重要任务的自动化方法:在量子比特连接图中寻找小子图的副本和通过广义图自同构自动寻找Ising模型的对称性。我们通过调查其他方法来结束本教程，提供实用的实施技巧，并讨论校准程序的局限性和补救措施。代码可从https://github.com/dwavesystems/shimming-tutorial获得。

{"title":"Tutorial: calibration refinement in quantum annealing","authors":"Kevin Chern, Kelly Boothby, Jack Raymond, Pau Farré, Andrew D. King","doi":"10.3389/fcomp.2023.1238988","DOIUrl":"https://doi.org/10.3389/fcomp.2023.1238988","url":null,"abstract":"Quantum annealing has emerged as a powerful platform for simulating and optimizing classical and quantum Ising models. Quantum annealers, like other quantum and/or analog computing devices, are susceptible to non-idealities including crosstalk, device variation, and environmental noise. Compensating for these effects through calibration refinement or “shimming” can significantly improve performance but often relies on ad-hoc methods that exploit symmetries in both the problem being solved and the quantum annealer itself. In this tutorial, we attempt to demystify these methods. We introduce methods for finding exploitable symmetries in Ising models and discuss how to use these symmetries to suppress unwanted bias. We work through several examples of increasing complexity and provide complete Python code. We include automated methods for two important tasks: finding copies of small subgraphs in the qubit connectivity graph and automatically finding symmetries of an Ising model via generalized graph automorphism. We conclude the tutorial by surveying additional methods, providing practical implementation tips, and discussing limitations and remedies of the calibration procedure. Code is available at: https://github.com/dwavesystems/shimming-tutorial.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135437680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Informing the ethical review of human subjects research utilizing artificial intelligence 对利用人工智能进行的人类受试者研究进行伦理审查

Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2023-09-14 DOI: 10.3389/fcomp.2023.1235226

Christos Andreas Makridis, Anthony Boese, Rafael Fricks, Don Workman, Molly Klote, Joshua Mueller, Isabel J. Hildebrandt, Michael Kim, Gil Alterovitz

Introduction The rapid expansion of artificial intelligence (AI) has produced many opportunities, but also new risks that must be actively managed, particularly in the health care sector with clinical practice to avoid unintended health, economic, and social consequences. Methods Given that much of the research and development (R&D) involving human subjects is reviewed and rigorously monitored by institutional review boards (IRBs), we argue that supplemental questions added to the IRB process is an efficient risk mitigation technique available for immediate use. To facilitate this, we introduce AI supplemental questions that provide a feasible, low-disruption mechanism for IRBs to elicit information necessary to inform the review of AI proposals. These questions will also be relevant to review of research using AI that is exempt from the requirement of IRB review. We pilot the questions within the Department of Veterans Affairs–the nation's largest integrated healthcare system–and demonstrate its efficacy in risk mitigation through providing vital information in a way accessible to non-AI subject matter experts responsible for reviewing IRB proposals. We provide these questions for other organizations to adapt to fit their needs and are further developing these questions into an AI IRB module with an extended application, review checklist, informed consent, and other informational materials. Results We find that the supplemental AI IRB module further streamlines and expedites the review of IRB projects. We also find that the module has a positive effect on reviewers' attitudes and ease of assessing the potential alignment and risks associated with proposed projects. Discussion As projects increasingly contain an AI component, streamlining their review and assessment is important to avoid posing too large of a burden on IRBs in their review of submissions. In addition, establishing a minimum standard that submissions must adhere to will help ensure that all projects are at least aware of potential risks unique to AI and dialogue with their local IRBs over them. Further work is needed to apply these concepts to other non-IRB pathways, like quality improvement projects.

人工智能(AI)的快速发展带来了许多机遇，但也带来了必须积极管理的新风险，特别是在具有临床实践的卫生保健部门，以避免意外的健康、经济和社会后果。方法考虑到许多涉及人类受试者的研究和开发(R&D)都是由机构审查委员会(IRB)审查和严格监控的，我们认为，在IRB过程中添加补充问题是一种可立即使用的有效风险缓解技术。为了促进这一点，我们引入了人工智能补充问题，为irb提供了一个可行的、低干扰的机制，以获取必要的信息，为人工智能提案的审查提供信息。这些问题也将与审查使用人工智能的研究有关，这些研究不受IRB审查的要求。我们在退伍军人事务部(美国最大的综合医疗保健系统)内试点了这些问题，并通过向负责审查IRB提案的非人工智能主题专家提供重要信息的方式，证明了其在降低风险方面的有效性。我们为其他组织提供这些问题，以适应他们的需求，并进一步将这些问题开发成AI IRB模块，其中包含扩展应用程序、审查清单、知情同意和其他信息材料。结果我们发现补充AI IRB模块进一步简化和加快了对IRB项目的审查。我们还发现，该模块对审稿人的态度和评估与拟议项目相关的潜在一致性和风险的便利性有积极的影响。随着项目越来越多地包含人工智能组件，简化它们的审查和评估对于避免在审查提交时给irb带来过大的负担非常重要。此外，建立提交必须遵守的最低标准将有助于确保所有项目至少意识到人工智能特有的潜在风险，并与当地的irb就这些风险进行对话。需要进一步的工作将这些概念应用于其他非内部审查途径，如质量改进项目。

{"title":"Informing the ethical review of human subjects research utilizing artificial intelligence","authors":"Christos Andreas Makridis, Anthony Boese, Rafael Fricks, Don Workman, Molly Klote, Joshua Mueller, Isabel J. Hildebrandt, Michael Kim, Gil Alterovitz","doi":"10.3389/fcomp.2023.1235226","DOIUrl":"https://doi.org/10.3389/fcomp.2023.1235226","url":null,"abstract":"Introduction The rapid expansion of artificial intelligence (AI) has produced many opportunities, but also new risks that must be actively managed, particularly in the health care sector with clinical practice to avoid unintended health, economic, and social consequences. Methods Given that much of the research and development (R&amp;D) involving human subjects is reviewed and rigorously monitored by institutional review boards (IRBs), we argue that supplemental questions added to the IRB process is an efficient risk mitigation technique available for immediate use. To facilitate this, we introduce AI supplemental questions that provide a feasible, low-disruption mechanism for IRBs to elicit information necessary to inform the review of AI proposals. These questions will also be relevant to review of research using AI that is exempt from the requirement of IRB review. We pilot the questions within the Department of Veterans Affairs–the nation's largest integrated healthcare system–and demonstrate its efficacy in risk mitigation through providing vital information in a way accessible to non-AI subject matter experts responsible for reviewing IRB proposals. We provide these questions for other organizations to adapt to fit their needs and are further developing these questions into an AI IRB module with an extended application, review checklist, informed consent, and other informational materials. Results We find that the supplemental AI IRB module further streamlines and expedites the review of IRB projects. We also find that the module has a positive effect on reviewers' attitudes and ease of assessing the potential alignment and risks associated with proposed projects. Discussion As projects increasingly contain an AI component, streamlining their review and assessment is important to avoid posing too large of a burden on IRBs in their review of submissions. In addition, establishing a minimum standard that submissions must adhere to will help ensure that all projects are at least aware of potential risks unique to AI and dialogue with their local IRBs over them. Further work is needed to apply these concepts to other non-IRB pathways, like quality improvement projects.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134970710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Specific Gestalt principles cannot explain (un)crowding 特定的格式塔原理不能解释(非)拥挤

Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2023-09-14 DOI: 10.3389/fcomp.2023.1154957

Oh-Hyeon Choung, Einat Rashal, Marina Kunchulia, Michael H. Herzog

The standard physiological model has serious problems accounting for many aspects of vision, particularly when stimulus configurations become slightly more complex than the ones classically used, e.g., configurations of Gabors rather than only one or a few Gabors. For example, as shown in many publications, crowding cannot be explained with most models crafted in the spirit of the physiological approach. In crowding, a target is neighbored by flanking elements, which impair target discrimination. However, when more flankers are added, performance can improve for certain flanker configurations (uncrowding), which cannot be explained by classic models. As was shown, aspects of perceptual organization play a crucial role in uncrowding. For this reason, we tested here whether known principles of perceptual organization can explain crowding and uncrowding. The answer is negative. As shown with subjective tests, whereas grouping is indeed key in uncrowding, the four Gestalt principles examined here did not provide a clear explanation to this effect, as variability in performance was found between and within categories of configurations. We discuss the philosophical foundations of both the physiological and the classic Gestalt approaches and sketch a way to a happy marriage between the two.

标准的生理模型在解释视觉的许多方面存在严重问题，特别是当刺激配置比经典使用的稍微复杂时，例如，Gabors配置而不是只有一个或几个Gabors配置。例如，正如许多出版物所显示的那样，拥挤现象不能用大多数基于生理学方法的模型来解释。在拥挤情况下，目标被侧翼元素包围，这会影响目标的识别。然而，当添加更多侧卫时，某些侧卫配置(不拥挤)的性能可以得到改善，这是经典模型无法解释的。正如所显示的，知觉组织的各个方面在疏解拥挤中起着至关重要的作用。出于这个原因，我们在这里测试了知觉组织的已知原则是否可以解释拥挤和不拥挤。答案是否定的。正如主观测试所显示的那样，虽然分组确实是消除拥挤的关键，但这里审查的四个格式塔原则并没有对这种效果提供明确的解释，因为在配置类别之间和内部发现了性能的可变性。我们讨论了生理学和经典格式塔方法的哲学基础，并概述了两者之间幸福婚姻的途径。

{"title":"Specific Gestalt principles cannot explain (un)crowding","authors":"Oh-Hyeon Choung, Einat Rashal, Marina Kunchulia, Michael H. Herzog","doi":"10.3389/fcomp.2023.1154957","DOIUrl":"https://doi.org/10.3389/fcomp.2023.1154957","url":null,"abstract":"The standard physiological model has serious problems accounting for many aspects of vision, particularly when stimulus configurations become slightly more complex than the ones classically used, e.g., configurations of Gabors rather than only one or a few Gabors. For example, as shown in many publications, crowding cannot be explained with most models crafted in the spirit of the physiological approach. In crowding, a target is neighbored by flanking elements, which impair target discrimination. However, when more flankers are added, performance can improve for certain flanker configurations (uncrowding), which cannot be explained by classic models. As was shown, aspects of perceptual organization play a crucial role in uncrowding. For this reason, we tested here whether known principles of perceptual organization can explain crowding and uncrowding. The answer is negative. As shown with subjective tests, whereas grouping is indeed key in uncrowding, the four Gestalt principles examined here did not provide a clear explanation to this effect, as variability in performance was found between and within categories of configurations. We discuss the philosophical foundations of both the physiological and the classic Gestalt approaches and sketch a way to a happy marriage between the two.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135551753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The mid-level vision toolbox for computing structural properties of real-world images 用于计算真实世界图像结构属性的中级视觉工具箱

Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2023-09-13 DOI: 10.3389/fcomp.2023.1140723

Dirk B. Walther, Delaram Farzanfar, Seohee Han, Morteza Rezanejad

Mid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of representation. We introduce the Mid-Level Vision (MLV) Toolbox, an open-source software that automatically processes low- and mid-level contour features and perceptual grouping cues from real-world images. The MLV toolbox takes vectorized line drawings of scenes as input and extracts structural contour properties. We also include tools for contour detection and tracing for the automatic generation of vectorized line drawings from photographs. Various statistical properties of the contours are computed: the distributions of orientations, contour curvature, and contour lengths, as well as counts and types of contour junctions. The toolbox includes an efficient algorithm for computing the medial axis transform of contour drawings and photographs. Based on the medial axis transform, we compute several scores for local mirror symmetry, local parallelism, and local contour separation. All properties are summarized in histograms that can serve as input into statistical models to relate image properties to human behavioral measures, such as esthetic pleasure, memorability, affective processing, and scene categorization. In addition to measuring contour properties, we include functions for manipulating drawings by separating contours according to their statistical properties, randomly shifting contours, or rotating drawings behind a circular aperture. Finally, the MLV Toolbox offers visualization functions for contour orientations, lengths, curvature, junctions, and medial axis properties on computer-generated and artist-generated line drawings. We include artist-generated vectorized drawings of the Toronto Scenes image set, the International Affective Picture System, and the Snodgrass and Vanderwart object images, as well as automatically traced vectorized drawings of set architectural scenes and the Open Affective Standardized Image Set (OASIS).

中级视觉是生成物体形状和部分几何形状表示的中间视觉处理阶段。我们对这些操作的机械理解是有限的，部分原因是缺乏在这些表示级别上分析图像属性的计算工具。我们介绍了中级视觉(MLV)工具箱，这是一个开源软件，可以自动处理来自现实世界图像的低级和中级轮廓特征和感知分组线索。MLV工具箱以矢量化的场景线条图为输入，提取结构轮廓属性。我们还包括用于轮廓检测和跟踪的工具，用于从照片中自动生成矢量化线条图。计算了等高线的各种统计性质:方向分布、等高线曲率和等高线长度，以及等高线结点的数量和类型。该工具箱包括一个有效的算法，用于计算等高线图纸和照片的中间轴变换。基于中轴线变换，我们计算了局部镜像对称、局部平行度和局部轮廓分离的分数。所有属性都总结在直方图中，可以作为统计模型的输入，将图像属性与人类行为测量(如审美愉悦、记忆、情感处理和场景分类)联系起来。除了测量轮廓属性外，我们还包括通过根据统计属性分离轮廓、随机移动轮廓或在圆形孔径后面旋转绘图来操纵绘图的功能。最后，MLV工具箱为计算机生成和艺术家生成的线条图提供了轮廓方向、长度、曲率、连接点和中间轴属性的可视化功能。我们包括艺术家生成的多伦多场景图像集、国际情感图像系统、Snodgrass和Vanderwart对象图像的矢量化绘图，以及建筑场景集和开放情感标准化图像集(OASIS)的自动跟踪矢量化绘图。

{"title":"The mid-level vision toolbox for computing structural properties of real-world images","authors":"Dirk B. Walther, Delaram Farzanfar, Seohee Han, Morteza Rezanejad","doi":"10.3389/fcomp.2023.1140723","DOIUrl":"https://doi.org/10.3389/fcomp.2023.1140723","url":null,"abstract":"Mid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of representation. We introduce the Mid-Level Vision (MLV) Toolbox, an open-source software that automatically processes low- and mid-level contour features and perceptual grouping cues from real-world images. The MLV toolbox takes vectorized line drawings of scenes as input and extracts structural contour properties. We also include tools for contour detection and tracing for the automatic generation of vectorized line drawings from photographs. Various statistical properties of the contours are computed: the distributions of orientations, contour curvature, and contour lengths, as well as counts and types of contour junctions. The toolbox includes an efficient algorithm for computing the medial axis transform of contour drawings and photographs. Based on the medial axis transform, we compute several scores for local mirror symmetry, local parallelism, and local contour separation. All properties are summarized in histograms that can serve as input into statistical models to relate image properties to human behavioral measures, such as esthetic pleasure, memorability, affective processing, and scene categorization. In addition to measuring contour properties, we include functions for manipulating drawings by separating contours according to their statistical properties, randomly shifting contours, or rotating drawings behind a circular aperture. Finally, the MLV Toolbox offers visualization functions for contour orientations, lengths, curvature, junctions, and medial axis properties on computer-generated and artist-generated line drawings. We include artist-generated vectorized drawings of the Toronto Scenes image set, the International Affective Picture System, and the Snodgrass and Vanderwart object images, as well as automatically traced vectorized drawings of set architectural scenes and the Open Affective Standardized Image Set (OASIS).","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134989789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Skeletons, Object Shape, Statistics. 骨架，对象形状，统计。

IF 2.6 Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2022-10-01 DOI: 10.3389/fcomp.2022.842637

Stephen M Pizer, J S Marron, James N Damon, Jared Vicory, Akash Krishna, Zhiyuan Liu, Mohsen Taheri

Objects and object complexes in 3D, as well as those in 2D, have many possible representations. Among them skeletal representations have special advantages and some limitations. For the special form of skeletal representation called "s-reps," these advantages include strong suitability for representing slabular object populations and statistical applications on these populations. Accomplishing these statistical applications is best if one recognizes that s-reps live on a curved shape space. Here we will lay out the definition of s-reps, their advantages and limitations, their mathematical properties, methods for fitting s-reps to single- and multi-object boundaries, methods for measuring the statistics of these object and multi-object representations, and examples of such applications involving statistics. While the basic theory, ideas, and programs for the methods are described in this paper and while many applications with evaluations have been produced, there remain many interesting open opportunities for research on comparisons to other shape representations, new areas of application and further methodological developments, many of which are explicitly discussed here.

3D和2D中的对象和对象复合体有许多可能的表示。其中，骨架表示有其独特的优点，也有一定的局限性。对于称为“s-reps”的骨骼表示的特殊形式，这些优点包括表示板状对象种群和这些种群的统计应用程序的强大适用性。如果认识到s-代表存在于弯曲的形状空间中，那么完成这些统计应用程序是最好的。在这里，我们将列出s-代表的定义，它们的优点和局限性，它们的数学性质，将s-代表拟合到单对象和多对象边界的方法，测量这些对象和多对象表示的统计数据的方法，以及涉及统计的此类应用的示例。虽然本文描述了这些方法的基本理论、思想和程序，并且已经产生了许多带有评估的应用程序，但在与其他形状表示的比较、新的应用领域和进一步的方法发展方面，仍有许多有趣的开放研究机会，其中许多在这里被明确讨论。

{"title":"Skeletons, Object Shape, Statistics.","authors":"Stephen M Pizer, J S Marron, James N Damon, Jared Vicory, Akash Krishna, Zhiyuan Liu, Mohsen Taheri","doi":"10.3389/fcomp.2022.842637","DOIUrl":"https://doi.org/10.3389/fcomp.2022.842637","url":null,"abstract":"Objects and object complexes in 3D, as well as those in 2D, have many possible representations. Among them skeletal representations have special advantages and some limitations. For the special form of skeletal representation called \"s-reps,\" these advantages include strong suitability for representing slabular object populations and statistical applications on these populations. Accomplishing these statistical applications is best if one recognizes that s-reps live on a curved shape space. Here we will lay out the definition of s-reps, their advantages and limitations, their mathematical properties, methods for fitting s-reps to single- and multi-object boundaries, methods for measuring the statistics of these object and multi-object representations, and examples of such applications involving statistics. While the basic theory, ideas, and programs for the methods are described in this paper and while many applications with evaluations have been produced, there remain many interesting open opportunities for research on comparisons to other shape representations, new areas of application and further methodological developments, many of which are explicitly discussed here.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10488910/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10231002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A workflow for rapid unbiased quantification of fibrillar feature alignment in biological images. 快速无偏量化生物图像中纤维特征排列的工作流程。

IF 2.6 Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2021-10-01 Epub Date: 2021-10-14 DOI: 10.3389/fcomp.2021.745831

Stefania Marcotti, Deandra Belo de Freitas, Lee D Troughton, Fiona N Kenny, Tanya J Shaw, Brian M Stramer, Patrick W Oakes

Measuring the organisation of the cellular cytoskeleton and the surrounding extracellular matrix (ECM) is currently of wide interest as changes in both local and global alignment can highlight alterations in cellular functions and material properties of the extracellular environment. Different approaches have been developed to quantify these structures, typically based on fibre segmentation or on matrix representation and transformation of the image, each with its own advantages and disadvantages. Here we present AFT-Alignment by Fourier Transform, a workflow to quantify the alignment of fibrillar features in microscopy images exploiting 2D Fast Fourier Transforms (FFT). Using pre-existing datasets of cell and ECM images, we demonstrate our approach and compare and contrast this workflow with two other well-known ImageJ algorithms to quantify image feature alignment. These comparisons reveal that AFT has a number of advantages due to its grid-based FFT approach. 1) Flexibility in defining the window and neighbourhood sizes allows for performing a parameter search to determine an optimal length scale to carry out alignment metrics. This approach can thus easily accommodate different image resolutions and biological systems. 2) The length scale of decay in alignment can be extracted by comparing neighbourhood sizes, revealing the overall distance that features remain anisotropic. 3) The approach is ambivalent to the signal source, thus making it applicable for a wide range of imaging modalities and is dependent on fewer input parameters than segmentation methods. 4) Finally, compared to segmentation methods, this algorithm is computationally inexpensive, as high-resolution images can be evaluated in less than a second on a standard desktop computer. This makes it feasible to screen numerous experimental perturbations or examine large images over long length scales. Implementation is made available in both MATLAB and Python for wider accessibility, with example datasets for single images and batch processing. Additionally, we include an approach to automatically search parameters for optimum window and neighbourhood sizes, as well as to measure the decay in alignment over progressively increasing length scales.

目前，测量细胞细胞骨架和周围细胞外基质（ECM）的组织结构受到广泛关注，因为局部和全局排列的变化都能突出细胞功能和细胞外环境材料特性的改变。目前已开发出不同的方法来量化这些结构，这些方法通常基于纤维分割或矩阵表示和图像转换，各有利弊。在这里，我们介绍 AFT-Alignment by Fourier Transform，这是一种利用二维快速傅立叶变换（FFT）量化显微图像中纤维特征排列的工作流程。我们利用已有的细胞和 ECM 图像数据集演示了我们的方法，并将此工作流程与其他两种著名的 ImageJ 算法进行了比较和对比，以量化图像特征对齐情况。这些比较表明，AFT 基于网格的 FFT 方法具有许多优势。1) 灵活定义窗口和邻域大小，允许执行参数搜索，以确定执行配准度量的最佳长度尺度。因此，这种方法很容易适应不同的图像分辨率和生物系统。2) 通过比较邻域大小，可以提取配准衰减的长度尺度，从而揭示特征保持各向异性的总体距离。3) 该方法对信号源不敏感，因此适用于多种成像模式，而且与分割方法相比，依赖的输入参数更少。4) 最后，与分割方法相比，该算法计算成本低廉，在标准台式电脑上不到一秒钟就能评估高分辨率图像。这使得筛选大量实验扰动或检查长长度尺度的大型图像变得可行。为了让更多人能够使用，我们提供了 MATLAB 和 Python 版本，并提供了单幅图像和批量处理的示例数据集。此外，我们还提供了一种方法来自动搜索最佳窗口和邻域大小的参数，以及测量随着长度尺度逐渐增大对齐度的衰减。

{"title":"A workflow for rapid unbiased quantification of fibrillar feature alignment in biological images.","authors":"Stefania Marcotti, Deandra Belo de Freitas, Lee D Troughton, Fiona N Kenny, Tanya J Shaw, Brian M Stramer, Patrick W Oakes","doi":"10.3389/fcomp.2021.745831","DOIUrl":"10.3389/fcomp.2021.745831","url":null,"abstract":"Measuring the organisation of the cellular cytoskeleton and the surrounding extracellular matrix (ECM) is currently of wide interest as changes in both local and global alignment can highlight alterations in cellular functions and material properties of the extracellular environment. Different approaches have been developed to quantify these structures, typically based on fibre segmentation or on matrix representation and transformation of the image, each with its own advantages and disadvantages. Here we present AFT-Alignment by Fourier Transform, a workflow to quantify the alignment of fibrillar features in microscopy images exploiting 2D Fast Fourier Transforms (FFT). Using pre-existing datasets of cell and ECM images, we demonstrate our approach and compare and contrast this workflow with two other well-known ImageJ algorithms to quantify image feature alignment. These comparisons reveal that AFT has a number of advantages due to its grid-based FFT approach. 1) Flexibility in defining the window and neighbourhood sizes allows for performing a parameter search to determine an optimal length scale to carry out alignment metrics. This approach can thus easily accommodate different image resolutions and biological systems. 2) The length scale of decay in alignment can be extracted by comparing neighbourhood sizes, revealing the overall distance that features remain anisotropic. 3) The approach is ambivalent to the signal source, thus making it applicable for a wide range of imaging modalities and is dependent on fewer input parameters than segmentation methods. 4) Finally, compared to segmentation methods, this algorithm is computationally inexpensive, as high-resolution images can be evaluated in less than a second on a standard desktop computer. This makes it feasible to screen numerous experimental perturbations or examine large images over long length scales. Implementation is made available in both MATLAB and Python for wider accessibility, with example datasets for single images and batch processing. Additionally, we include an approach to automatically search parameters for optimum window and neighbourhood sizes, as well as to measure the decay in alignment over progressively increasing length scales.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8654057/pdf/nihms-1753534.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39710314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring Deep Transfer Learning Techniques for Alzheimer's Dementia Detection. 探索用于阿尔茨海默氏症痴呆症检测的深度迁移学习技术。

IF 2.6 Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2021-05-01 Epub Date: 2021-05-12 DOI: 10.3389/fcomp.2021.624683

Youxiang Zhu, Xiaohui Liang, John A Batsis, Robert M Roth

Examination of speech datasets for detecting dementia, collected via various speech tasks, has revealed links between speech and cognitive abilities. However, the speech dataset available for this research is extremely limited because the collection process of speech and baseline data from patients with dementia in clinical settings is expensive. In this paper, we study the spontaneous speech dataset from a recent ADReSS challenge, a Cookie Theft Picture (CTP) dataset with balanced groups of participants in age, gender, and cognitive status. We explore state-of-the-art deep transfer learning techniques from image, audio, speech, and language domains. We envision that one advantage of transfer learning is to eliminate the design of handcrafted features based on the tasks and datasets. Transfer learning further mitigates the limited dementia-relevant speech data problem by inheriting knowledge from similar but much larger datasets. Specifically, we built a variety of transfer learning models using commonly employed MobileNet (image), YAMNet (audio), Mockingjay (speech), and BERT (text) models. Results indicated that the transfer learning models of text data showed significantly better performance than those of audio data. Performance gains of the text models may be due to the high similarity between the pre-training text dataset and the CTP text dataset. Our multi-modal transfer learning introduced a slight improvement in accuracy, demonstrating that audio and text data provide limited complementary information. Multi-task transfer learning resulted in limited improvements in classification and a negative impact in regression. By analyzing the meaning behind the AD/non-AD labels and Mini-Mental State Examination (MMSE) scores, we observed that the inconsistency between labels and scores could limit the performance of the multi-task learning, especially when the outputs of the single-task models are highly consistent with the corresponding labels/scores. In sum, we conducted a large comparative analysis of varying transfer learning models focusing less on model customization but more on pre-trained models and pre-training datasets. We revealed insightful relations among models, data types, and data labels in this research area.

通过各种语音任务收集到的用于检测痴呆症的语音数据集显示，语音与认知能力之间存在联系。然而，由于在临床环境中收集痴呆症患者的语音和基线数据的过程非常昂贵，因此可用于这项研究的语音数据集非常有限。在本文中，我们研究了最近一次 ADReSS 挑战赛中的自发语音数据集，即 Cookie Theft Picture（CTP）数据集，该数据集的参与者在年龄、性别和认知状态上都是均衡的。我们探索了图像、音频、语音和语言领域最先进的深度迁移学习技术。我们认为，迁移学习的一个优势是消除了基于任务和数据集的手工特征设计。迁移学习通过继承类似但规模更大的数据集的知识，进一步缓解了痴呆症相关语音数据有限的问题。具体来说，我们使用常用的 MobileNet（图像）、YAMNet（音频）、Mockingjay（语音）和 BERT（文本）模型建立了各种迁移学习模型。结果表明，文本数据迁移学习模型的性能明显优于音频数据迁移学习模型。文本模型的性能提升可能是由于预训练文本数据集与 CTP 文本数据集之间的高度相似性。我们的多模态迁移学习略微提高了准确率，这表明音频和文本数据提供的互补信息有限。多任务迁移学习在分类方面的改进有限，而在回归方面则产生了负面影响。通过分析注意力缺失/非注意力缺失（AD/non-AD）标签和迷你精神状态检查（MMSE）分数背后的含义，我们发现标签和分数之间的不一致性可能会限制多任务学习的性能，尤其是当单任务模型的输出与相应的标签/分数高度一致时。总之，我们对不同的迁移学习模型进行了大量比较分析，重点不是模型定制，而是预训练模型和预训练数据集。我们揭示了这一研究领域中模型、数据类型和数据标签之间的深刻关系。

{"title":"Exploring Deep Transfer Learning Techniques for Alzheimer's Dementia Detection.","authors":"Youxiang Zhu, Xiaohui Liang, John A Batsis, Robert M Roth","doi":"10.3389/fcomp.2021.624683","DOIUrl":"10.3389/fcomp.2021.624683","url":null,"abstract":"Examination of speech datasets for detecting dementia, collected via various speech tasks, has revealed links between speech and cognitive abilities. However, the speech dataset available for this research is extremely limited because the collection process of speech and baseline data from patients with dementia in clinical settings is expensive. In this paper, we study the spontaneous speech dataset from a recent ADReSS challenge, a Cookie Theft Picture (CTP) dataset with balanced groups of participants in age, gender, and cognitive status. We explore state-of-the-art deep transfer learning techniques from image, audio, speech, and language domains. We envision that one advantage of transfer learning is to eliminate the design of handcrafted features based on the tasks and datasets. Transfer learning further mitigates the limited dementia-relevant speech data problem by inheriting knowledge from similar but much larger datasets. Specifically, we built a variety of transfer learning models using commonly employed MobileNet (image), YAMNet (audio), Mockingjay (speech), and BERT (text) models. Results indicated that the transfer learning models of text data showed significantly better performance than those of audio data. Performance gains of the text models may be due to the high similarity between the pre-training text dataset and the CTP text dataset. Our multi-modal transfer learning introduced a slight improvement in accuracy, demonstrating that audio and text data provide limited complementary information. Multi-task transfer learning resulted in limited improvements in classification and a negative impact in regression. By analyzing the meaning behind the AD/non-AD labels and Mini-Mental State Examination (MMSE) scores, we observed that the inconsistency between labels and scores could limit the performance of the multi-task learning, especially when the outputs of the single-task models are highly consistent with the corresponding labels/scores. In sum, we conducted a large comparative analysis of varying transfer learning models focusing less on model customization but more on pre-trained models and pre-training datasets. We revealed insightful relations among models, data types, and data labels in this research area.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153512/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39027802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Integration of the ImageJ Ecosystem in the KNIME Analytics Platform. ImageJ生态系统在KNIME分析平台的集成。

IF 2.6 Q2 Computer Science

Frontiers in Computer Science

Pub Date : 2020-03-01 Epub Date: 2020-03-17 DOI: 10.3389/fcomp.2020.00008

Christian Dietz, Curtis T Rueden, Stefan Helfrich, Ellen T A Dobson, Martin Horn, Jan Eglinger, Edward L Evans, Dalton T McLean, Tatiana Novitskaya, William A Ricke, Nathan M Sherer, Andries Zijlstra, Michael R Berthold, Kevin W Eliceiri

Open-source software tools are often used for analysis of scientific image data due to their flexibility and transparency in dealing with rapidly evolving imaging technologies. The complex nature of image analysis problems frequently requires many tools to be used in conjunction, including image processing and analysis, data processing, machine learning and deep learning, statistical analysis of the results, visualization, correlation to heterogeneous but related data, and more. However, the development, and therefore application, of these computational tools is impeded by a lack of integration across platforms. Integration of tools goes beyond convenience, as it is impractical for one tool to anticipate and accommodate the current and future needs of every user. This problem is emphasized in the field of bioimage analysis, where various rapidly emerging methods are quickly being adopted by researchers. ImageJ is a popular open-source image analysis platform, with contributions from a global community resulting in hundreds of specialized routines for a wide array of scientific tasks. ImageJ's strength lies in its accessibility and extensibility, allowing researchers to easily improve the software to solve their image analysis tasks. However, ImageJ is not designed for development of complex end-to-end image analysis workflows. Scientists are often forced to create highly specialized and hard-to-reproduce scripts to orchestrate individual software fragments and cover the entire life-cycle of an analysis of an image dataset. KNIME Analytics Platform, a user-friendly data integration, analysis, and exploration workflow system, was designed to handle huge amounts of heterogeneous data in a platform-agnostic, computing environment and has been successful in meeting complex end-to-end demands in several communities, such as cheminformatics and mass spectrometry. Similar needs within the bioimage analysis community led to the creation of the KNIME Image Processing extension which integrates ImageJ into KNIME Analytics Platform, enabling researchers to develop reproducible and scalable workflows, integrating a diverse range of analysis tools. Here we present how users and developers alike can leverage the ImageJ ecosystem via the KNIME Image Processing extension to provide robust and extensible image analysis within KNIME workflows. We illustrate the benefits of this integration with examples, as well as representative scientific use cases.

开源软件工具经常用于分析科学图像数据，因为它们在处理快速发展的成像技术方面具有灵活性和透明度。图像分析问题的复杂性经常需要许多工具一起使用，包括图像处理和分析、数据处理、机器学习和深度学习、结果的统计分析、可视化、异构但相关数据的相关性等等。然而，由于缺乏跨平台的集成，这些计算工具的开发和应用受到了阻碍。工具的集成超越了便利性，因为一个工具预测和适应每个用户当前和未来的需求是不切实际的。这个问题在生物图像分析领域得到了强调，研究人员正在迅速采用各种快速出现的方法。ImageJ是一个流行的开源图像分析平台，来自全球社区的贡献产生了数百个专门的例程，用于广泛的科学任务。ImageJ的优势在于它的可访问性和可扩展性，使研究人员可以轻松地改进软件来解决他们的图像分析任务。然而，ImageJ不是为开发复杂的端到端图像分析工作流而设计的。科学家经常被迫创建高度专业化且难以复制的脚本来编排单个软件片段，并覆盖图像数据集分析的整个生命周期。KNIME分析平台是一个用户友好的数据集成、分析和勘探工作流程系统，旨在处理与平台无关的计算环境中的大量异构数据，并已成功满足化学信息学和质谱等多个领域的复杂端到端需求。生物图像分析社区的类似需求导致了KNIME图像处理扩展的创建，该扩展将ImageJ集成到KNIME分析平台中，使研究人员能够开发可重复和可扩展的工作流程，集成了各种分析工具。在这里，我们介绍了用户和开发人员如何通过KNIME图像处理扩展来利用ImageJ生态系统，在KNIME工作流程中提供健壮和可扩展的图像分析。我们通过示例以及具有代表性的科学用例来说明这种集成的好处。

{"title":"Integration of the ImageJ Ecosystem in the KNIME Analytics Platform.","authors":"Christian Dietz, Curtis T Rueden, Stefan Helfrich, Ellen T A Dobson, Martin Horn, Jan Eglinger, Edward L Evans, Dalton T McLean, Tatiana Novitskaya, William A Ricke, Nathan M Sherer, Andries Zijlstra, Michael R Berthold, Kevin W Eliceiri","doi":"10.3389/fcomp.2020.00008","DOIUrl":"https://doi.org/10.3389/fcomp.2020.00008","url":null,"abstract":"Open-source software tools are often used for analysis of scientific image data due to their flexibility and transparency in dealing with rapidly evolving imaging technologies. The complex nature of image analysis problems frequently requires many tools to be used in conjunction, including image processing and analysis, data processing, machine learning and deep learning, statistical analysis of the results, visualization, correlation to heterogeneous but related data, and more. However, the development, and therefore application, of these computational tools is impeded by a lack of integration across platforms. Integration of tools goes beyond convenience, as it is impractical for one tool to anticipate and accommodate the current and future needs of every user. This problem is emphasized in the field of bioimage analysis, where various rapidly emerging methods are quickly being adopted by researchers. ImageJ is a popular open-source image analysis platform, with contributions from a global community resulting in hundreds of specialized routines for a wide array of scientific tasks. ImageJ's strength lies in its accessibility and extensibility, allowing researchers to easily improve the software to solve their image analysis tasks. However, ImageJ is not designed for development of complex end-to-end image analysis workflows. Scientists are often forced to create highly specialized and hard-to-reproduce scripts to orchestrate individual software fragments and cover the entire life-cycle of an analysis of an image dataset. KNIME Analytics Platform, a user-friendly data integration, analysis, and exploration workflow system, was designed to handle huge amounts of heterogeneous data in a platform-agnostic, computing environment and has been successful in meeting complex end-to-end demands in several communities, such as cheminformatics and mass spectrometry. Similar needs within the bioimage analysis community led to the creation of the KNIME Image Processing extension which integrates ImageJ into KNIME Analytics Platform, enabling researchers to develop reproducible and scalable workflows, integrating a diverse range of analysis tools. Here we present how users and developers alike can leverage the ImageJ ecosystem via the KNIME Image Processing extension to provide robust and extensible image analysis within KNIME workflows. We illustrate the benefits of this integration with examples, as well as representative scientific use cases.","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3389/fcomp.2020.00008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38359736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20