首页 > 最新文献

SoftwareX最新文献

英文 中文
An-augmenter: A unified platform for efficient image annotation and data augmentation 增强器:用于高效图像注释和数据增强的统一平台
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2026-01-22 DOI: 10.1016/j.softx.2026.102516
Samriddha Das, C. Igathinathane, Xin Sun
The growing reliance on AI and deep learning in vision-based applications requires efficient dataset preparation tools, however, existing solutions are often commercially licensed or lack integrated, multi-format workflows. This study presents An-Augmenter, a cross-platform, open-source software that integrates image annotation and augmentation within an offline environment. It supports YOLO, XML, and JSON formats and ensures annotation-consistent augmentation for labeled and unlabeled datasets. Processing 1200 images with all possible augmentation techniques required 50 s on a standard CPU. Validation using YOLO11n object detection model improved [email protected] from 0.905 to 0.941 on a custom egg dataset and from 0.799 to 0.825 on a public apple dataset, demonstrating improved detection performance with augmented data.
基于视觉的应用越来越依赖人工智能和深度学习,这需要高效的数据集准备工具,然而,现有的解决方案通常是商业许可的,或者缺乏集成的多格式工作流程。本研究提出了一种跨平台的开源软件an - augmenter,它集成了离线环境中的图像注释和增强。它支持YOLO、XML和JSON格式,并确保标记和未标记数据集的注释一致的增强。使用所有可能的增强技术处理1200张图像在标准CPU上需要50秒。使用YOLO11n对象检测模型的验证将[email protected]在自定义鸡蛋数据集上从0.905提高到0.941,在公共苹果数据集上从0.799提高到0.825,展示了增强数据的改进检测性能。
{"title":"An-augmenter: A unified platform for efficient image annotation and data augmentation","authors":"Samriddha Das,&nbsp;C. Igathinathane,&nbsp;Xin Sun","doi":"10.1016/j.softx.2026.102516","DOIUrl":"10.1016/j.softx.2026.102516","url":null,"abstract":"<div><div>The growing reliance on AI and deep learning in vision-based applications requires efficient dataset preparation tools, however, existing solutions are often commercially licensed or lack integrated, multi-format workflows. This study presents An-Augmenter, a cross-platform, open-source software that integrates image annotation and augmentation within an offline environment. It supports YOLO, XML, and JSON formats and ensures annotation-consistent augmentation for labeled and unlabeled datasets. Processing 1200 images with all possible augmentation techniques required 50 s on a standard CPU. Validation using YOLO11n object detection model improved [email protected] from 0.905 to 0.941 on a custom egg dataset and from 0.799 to 0.825 on a public apple dataset, demonstrating improved detection performance with augmented data.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102516"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dakar: A CoinJoin forensic software 达喀尔:CoinJoin的取证软件
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2026-01-21 DOI: 10.1016/j.softx.2026.102523
Michael Herbert Ziegler , Mariusz Nowostawski , Basel Katt
The tension between blockchain transparency and user privacy has driven innovation in mixing protocols creating a need for comprehensive analytical frameworks that can rigorously evaluate privacy properties across different implementations. Dakar is an open-source framework that unifies ingestion and provides reproducible classification and analysis of CoinJoin transactions on UTXO blockchains. Its graph database captures the relationships between mixing transactions while a web interface enables experimentation with built-in privacy tools such as CoinJoin transaction heuristics and similarity measures. By enabling researchers to compare and quantify CoinJoin activity across multiple protocols Dakar facilitates studies on privacy-enhancing techniques and supports the discovery and analysis of differences in CoinJoin implementations.
区块链透明度和用户隐私之间的紧张关系推动了混合协议的创新,从而需要能够严格评估不同实现之间隐私属性的综合分析框架。Dakar是一个开源框架,它统一了摄取,并在UTXO区块链上提供可重复的CoinJoin交易分类和分析。它的图形数据库捕获混合交易之间的关系,而web界面可以使用内置的隐私工具(如CoinJoin交易启发式和相似性度量)进行实验。通过使研究人员能够跨多个协议比较和量化CoinJoin活动,Dakar促进了对隐私增强技术的研究,并支持发现和分析CoinJoin实现中的差异。
{"title":"Dakar: A CoinJoin forensic software","authors":"Michael Herbert Ziegler ,&nbsp;Mariusz Nowostawski ,&nbsp;Basel Katt","doi":"10.1016/j.softx.2026.102523","DOIUrl":"10.1016/j.softx.2026.102523","url":null,"abstract":"<div><div>The tension between blockchain transparency and user privacy has driven innovation in mixing protocols creating a need for comprehensive analytical frameworks that can rigorously evaluate privacy properties across different implementations. Dakar is an open-source framework that unifies ingestion and provides reproducible classification and analysis of CoinJoin transactions on UTXO blockchains. Its graph database captures the relationships between mixing transactions while a web interface enables experimentation with built-in privacy tools such as CoinJoin transaction heuristics and similarity measures. By enabling researchers to compare and quantify CoinJoin activity across multiple protocols Dakar facilitates studies on privacy-enhancing techniques and supports the discovery and analysis of differences in CoinJoin implementations.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102523"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Version [1.2]-[AsymIntervals: A Python library for uncertainty modeling with asymmetric interval numbers] Version [1.2]-[AsymIntervals:用于不对称区间数的不确定性建模的Python库]
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2026-01-21 DOI: 10.1016/j.softx.2026.102518
Wojciech Sałabun , Damian Kedziora , Andrii Shekhovtsov
In this paper, we present an extension of the AsymIntervals library, designed to enhance the modelling and processing of uncertainty using Asymmetric Interval Numbers (AINs). In response to the growing demand for expressive and mathematically consistent tools for interval-based uncertainty representation, the library has been extended with a comprehensive set of interval characteristics, logical predicates, relational operators, and mathematical transformations implemented within a unified core class. The extension introduces support for advanced algebraic, trigonometric, as well as exponential and logarithmic operations, flexible construction of AIN objects from multiple input formats, sampling-based data generation, and normalization of AIN collections. Additionally, enhanced export and serialisation mechanisms enable seamless integration with numerical workflows and scientific applications. These improvements substantially broaden the applicability of AsymIntervals in decision analysis, uncertainty modelling, and computational research.
在本文中,我们提出了AsymIntervals库的扩展,旨在增强使用非对称区间数(ain)的不确定性建模和处理。为了响应对基于区间的不确定性表示的表达性和数学上一致的工具的不断增长的需求,该库已经扩展了一组全面的区间特征、逻辑谓词、关系运算符和在统一的核心类中实现的数学转换。该扩展引入了对高级代数、三角函数以及指数和对数运算的支持,从多种输入格式灵活地构建AIN对象,基于采样的数据生成,以及AIN集合的规范化。此外,增强的导出和序列化机制能够与数字工作流程和科学应用程序无缝集成。这些改进极大地扩展了AsymIntervals在决策分析、不确定性建模和计算研究中的适用性。
{"title":"Version [1.2]-[AsymIntervals: A Python library for uncertainty modeling with asymmetric interval numbers]","authors":"Wojciech Sałabun ,&nbsp;Damian Kedziora ,&nbsp;Andrii Shekhovtsov","doi":"10.1016/j.softx.2026.102518","DOIUrl":"10.1016/j.softx.2026.102518","url":null,"abstract":"<div><div>In this paper, we present an extension of the AsymIntervals library, designed to enhance the modelling and processing of uncertainty using Asymmetric Interval Numbers (AINs). In response to the growing demand for expressive and mathematically consistent tools for interval-based uncertainty representation, the library has been extended with a comprehensive set of interval characteristics, logical predicates, relational operators, and mathematical transformations implemented within a unified core class. The extension introduces support for advanced algebraic, trigonometric, as well as exponential and logarithmic operations, flexible construction of AIN objects from multiple input formats, sampling-based data generation, and normalization of AIN collections. Additionally, enhanced export and serialisation mechanisms enable seamless integration with numerical workflows and scientific applications. These improvements substantially broaden the applicability of AsymIntervals in decision analysis, uncertainty modelling, and computational research.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102518"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ExSMuV: [Ex]ploration software for [S]ummarized [Mu]ltimedia [V]ertical search results ExSMuV: [Ex]搜索软件,用于[S]汇总[Mu]多媒体[V]垂直搜索结果
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2026-01-16 DOI: 10.1016/j.softx.2025.102501
Muhammad Wajeeh Uz Zaman , Umer Rashid , Qaisar Abbas , Abdur Rehman Khan
The proliferation of online multimedia content has transformed user information-seeking behavior from lookup to exploratory search. Existing web search engines present search results in disjoint, linearly ranked search result lists called verticals to bridge the information-exploration gap. However, search results presented by vertical search engines require extensive cognitive effort, hindering users’ ability to explore relevant content across verticals. We propose ExSMuV: [Ex]ploration Software for [S]ummarized [Mu]ltimedia [V]ertical Search Results, a framework that aggregates search results across verticals into coherent multimedia documents based on the most prominent topics, using a customized frequent-term scoring algorithm. Based on the identified important topics, a cosine similarity measure is used to aggregate the top-k similar results across verticals into a multimedia document. These documents combine conceptually similar web, image, and video search results into a comprehensive, unified Search User Interface (SUI) to reduce user navigation effort and improve exploration of relevant search results. We conducted a cognitive user study (N=23) comparing ExSMuV with a Bing vertical search baseline. The proposed framework enabled participants to perform exploratory search tasks with +37 % processing speed, +34 % selective attention, and +41 % better working memory compared to the baseline with statistically significant results (p 0.01).
在线多媒体内容的激增已经将用户的信息搜索行为从查找转变为探索性搜索。现有的网络搜索引擎以不相交的、线性排列的搜索结果列表呈现搜索结果,称为垂直搜索,以弥合信息探索的差距。然而,垂直搜索引擎呈现的搜索结果需要大量的认知努力,阻碍了用户在垂直领域探索相关内容的能力。我们提出了ExSMuV: [Ex] explore Software for [S] summarized [Mu]ltimedia [V] vertical Search Results,这是一个框架,可以根据最突出的主题将垂直搜索结果聚合到连贯的多媒体文档中,使用定制的频繁项评分算法。基于确定的重要主题,使用余弦相似性度量将垂直方向上的top-k相似结果聚合到一个多媒体文档中。这些文档将概念上相似的web、图像和视频搜索结果组合成一个全面、统一的搜索用户界面(search User Interface, SUI),以减少用户导航工作并改进对相关搜索结果的探索。我们进行了一项认知用户研究(N=23),将ExSMuV与必应垂直搜索基线进行比较。与基线相比,所提出的框架使参与者能够以+ 37%的处理速度,+ 34%的选择性注意力和+ 41%的工作记忆进行探索性搜索任务,结果具有统计学意义(p≤0.01)。
{"title":"ExSMuV: [Ex]ploration software for [S]ummarized [Mu]ltimedia [V]ertical search results","authors":"Muhammad Wajeeh Uz Zaman ,&nbsp;Umer Rashid ,&nbsp;Qaisar Abbas ,&nbsp;Abdur Rehman Khan","doi":"10.1016/j.softx.2025.102501","DOIUrl":"10.1016/j.softx.2025.102501","url":null,"abstract":"<div><div>The proliferation of online multimedia content has transformed user information-seeking behavior from lookup to exploratory search. Existing web search engines present search results in disjoint, linearly ranked search result lists called verticals to bridge the information-exploration gap. However, search results presented by vertical search engines require extensive cognitive effort, hindering users’ ability to explore relevant content across verticals. We propose ExSMuV: [Ex]ploration Software for [S]ummarized [Mu]ltimedia [V]ertical Search Results, a framework that aggregates search results across verticals into coherent multimedia documents based on the most prominent topics, using a customized frequent-term scoring algorithm. Based on the identified important topics, a cosine similarity measure is used to aggregate the top-k similar results across verticals into a multimedia document. These documents combine conceptually similar web, image, and video search results into a comprehensive, unified Search User Interface (SUI) to reduce user navigation effort and improve exploration of relevant search results. We conducted a cognitive user study (N=23) comparing ExSMuV with a Bing vertical search baseline. The proposed framework enabled participants to perform exploratory search tasks with +37 % processing speed, +34 % selective attention, and +41 % better working memory compared to the baseline with statistically significant results (p <span><math><mo>≤</mo></math></span> 0.01).</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102501"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HyDistFlow.jl: A unified dynamic hybrid AC/DC power flow package for DER-rich distribution systems HyDistFlow。[j]:一种用于富der配电系统的统一动态混合交直流潮流包
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2025-12-17 DOI: 10.1016/j.softx.2025.102486
Sipeng Luo, Tianyang Zhao, Zhaohong Bie
Modern distribution networks are being transformed into hybrid AC/DC active systems through large-scale deployment of converter-interfaced resources (photovoltaic generators, battery energy storage systems). Existing open-source time-series power flow tools lack unified medium/low-voltage DC modeling, multi-mode converter control, and endogenous multi-interval scheduling, which limits hybrid analysis. To address this gap, the open-source HyDistFlow.jl package (Julia) is introduced for accurate and efficient hybrid distribution studies. A unified component model set for AC, DC, and coupling interfaces is provided, where distributed generation and battery storage systems are explicitly represented. Seven consistent voltage source converter control modes are implemented for AC/DC converters. Endogenous scheduling automatically generates storage charge/discharge profiles under network constraints. Switching topology changes in medium/low-voltage distribution networks are accommodated in an engineering-oriented manner. CPU/GPU heterogeneous computation is enabled for scalable computation. Accuracy has been benchmarked against open-source AC solvers and ETAP hybrid AC/DC results, while functional correctness of all control modes and automatic loss-reducing storage dispatch has been demonstrated in designed studies.
通过大规模部署转换器接口资源(光伏发电机、电池储能系统),现代配电网络正在转变为混合交流/直流有源系统。现有的开源时间序列潮流工具缺乏统一的中低压直流建模、多模式变流器控制和内生多区间调度,限制了混合分析。为了解决这个问题,开源HyDistFlow。介绍了jl包(Julia)的准确和有效的混合分布研究。提供了统一的交流、直流和耦合接口组件模型集,其中分布式发电和电池存储系统显式表示。对交流/直流变换器实现了7种一致电压源变换器控制模式。内生调度在网络约束下自动生成存储充放电配置文件。以工程化的方式适应中低压配电网交换拓扑结构的变化。CPU/GPU异构计算,可扩展计算。准确性已与开源AC求解器和ETAP混合AC/DC结果进行了基准测试,而所有控制模式和自动降低损耗的存储调度的功能正确性已在设计研究中得到验证。
{"title":"HyDistFlow.jl: A unified dynamic hybrid AC/DC power flow package for DER-rich distribution systems","authors":"Sipeng Luo,&nbsp;Tianyang Zhao,&nbsp;Zhaohong Bie","doi":"10.1016/j.softx.2025.102486","DOIUrl":"10.1016/j.softx.2025.102486","url":null,"abstract":"<div><div>Modern distribution networks are being transformed into hybrid AC/DC active systems through large-scale deployment of converter-interfaced resources (photovoltaic generators, battery energy storage systems). Existing open-source time-series power flow tools lack unified medium/low-voltage DC modeling, multi-mode converter control, and endogenous multi-interval scheduling, which limits hybrid analysis. To address this gap, the open-source HyDistFlow.jl package (Julia) is introduced for accurate and efficient hybrid distribution studies. A unified component model set for AC, DC, and coupling interfaces is provided, where distributed generation and battery storage systems are explicitly represented. Seven consistent voltage source converter control modes are implemented for AC/DC converters. Endogenous scheduling automatically generates storage charge/discharge profiles under network constraints. Switching topology changes in medium/low-voltage distribution networks are accommodated in an engineering-oriented manner. CPU/GPU heterogeneous computation is enabled for scalable computation. Accuracy has been benchmarked against open-source AC solvers and ETAP hybrid AC/DC results, while functional correctness of all control modes and automatic loss-reducing storage dispatch has been demonstrated in designed studies.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102486"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancing corrosion detection: A fuzzy expert system with desktop and mobile interfaces 先进的腐蚀检测:具有桌面和移动接口的模糊专家系统
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2025-12-13 DOI: 10.1016/j.softx.2025.102488
Mariana Bárcenas Castañeda , Luis Enrique Calatayud Velázquez , Manuel Sabino Lazo Cortes , Mauricio Gabriel Orozco del Castillo , Víctor Augusto Castellanos Escamilla
Corrosion is a complex phenomenon that deteriorates metal surfaces, generating significant economic and operational challenges across industries. Its assessment often requires expert interpretation of macroscopic damage. This work presents the development and validation of two graphical interfaces for SEAViM-CORR, a fuzzy logic-based expert system for corrosion diagnosis using surface images. The desktop version supports laboratory analysis with image editing tools, while the mobile version enables in situ diagnosis in under 200 ms. Using a dual-output model to identify primary and secondary corrosion mechanisms, the interfaces achieved up to 71.4 % efficiency in seven documented case studies, enhancing interpretability, usability for non-expert users, and applicability in both industrial and field environments.
腐蚀是一种复杂的现象,它会使金属表面恶化,给各行业带来重大的经济和运营挑战。其评估通常需要专家对宏观损伤进行解释。这项工作提出了SEAViM-CORR的两个图形界面的开发和验证,SEAViM-CORR是一个基于模糊逻辑的专家系统,用于使用表面图像进行腐蚀诊断。桌面版支持使用图像编辑工具进行实验室分析,而移动版可在200毫秒内进行原位诊断。使用双输出模型来识别初级和次级腐蚀机制,在七个记录的案例研究中,界面的效率高达71.4%,增强了可解释性,非专业用户的可用性,以及在工业和现场环境中的适用性。
{"title":"Advancing corrosion detection: A fuzzy expert system with desktop and mobile interfaces","authors":"Mariana Bárcenas Castañeda ,&nbsp;Luis Enrique Calatayud Velázquez ,&nbsp;Manuel Sabino Lazo Cortes ,&nbsp;Mauricio Gabriel Orozco del Castillo ,&nbsp;Víctor Augusto Castellanos Escamilla","doi":"10.1016/j.softx.2025.102488","DOIUrl":"10.1016/j.softx.2025.102488","url":null,"abstract":"<div><div>Corrosion is a complex phenomenon that deteriorates metal surfaces, generating significant economic and operational challenges across industries. Its assessment often requires expert interpretation of macroscopic damage. This work presents the development and validation of two graphical interfaces for SEAViM-CORR, a fuzzy logic-based expert system for corrosion diagnosis using surface images. The desktop version supports laboratory analysis with image editing tools, while the mobile version enables in situ diagnosis in under 200 ms. Using a dual-output model to identify primary and secondary corrosion mechanisms, the interfaces achieved up to 71.4 % efficiency in seven documented case studies, enhancing interpretability, usability for non-expert users, and applicability in both industrial and field environments.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102488"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CrossCarry: An R package for the analysis of data from a crossover design with GEE 一个R软件包,用于分析来自具有GEE的交叉设计的数据
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2025-12-17 DOI: 10.1016/j.softx.2025.102482
N.A. Cruz , O.O. Melo , C.A. Martinez , R. Alberich
Crossover designs are widely applied in medicine, agriculture, and other biological sciences, yet their analysis remains challenging due to longitudinal observations within each unit and the presence of carry-over effects. Despite their prevalence, there is no comprehensive R package dedicated to the statistical modeling of crossover data. The CrossCarry package addresses this gap by providing a flexible and open-source framework for analyzing any crossover design with response variables from the exponential family, with or without washout periods. It extends the generalized estimating equations (GEE) methodology by incorporating correlation structures specifically tailored to crossover data, capturing both within- and between-period dependencies. Moreover, CrossCarry integrates a parametric component for treatment effects and a nonparametric spline-based component for time and carry-over effects. This combination allows users to model complex correlation patterns and temporal structures with minimal coding effort. By offering a domain-independent implementation of advanced statistical methodology, CrossCarry facilitates reproducible research and promotes the reuse of robust analytical tools across disciplines. Its potential applications span medical trials, agricultural field experiments, and other areas where crossover designs are essential, thus contributing to broader scientific discovery and cross-domain methodological standardization.
交叉设计广泛应用于医学、农业和其他生物科学,但由于每个单元内的纵向观察和结转效应的存在,交叉设计的分析仍然具有挑战性。尽管它们很流行,但没有一个全面的R包专门用于交叉数据的统计建模。CrossCarry包通过提供一个灵活的开源框架来解决这一问题,该框架可用于分析任何具有指数族响应变量的交叉设计,无论是否有冲刷期。它扩展了广义估计方程(GEE)方法,结合了专门为交叉数据定制的相关结构,捕获了周期内和周期之间的依赖关系。此外,CrossCarry集成了用于处理效果的参数组件和用于时间和延续效应的基于非参数样条的组件。这种组合允许用户用最少的编码工作来建模复杂的相关模式和时间结构。通过提供一个独立于领域的高级统计方法实现,CrossCarry促进了可重复的研究,并促进了跨学科的健壮分析工具的重用。它的潜在应用跨越医学试验、农业现场实验和其他交叉设计必不可少的领域,从而有助于更广泛的科学发现和跨领域的方法标准化。
{"title":"CrossCarry: An R package for the analysis of data from a crossover design with GEE","authors":"N.A. Cruz ,&nbsp;O.O. Melo ,&nbsp;C.A. Martinez ,&nbsp;R. Alberich","doi":"10.1016/j.softx.2025.102482","DOIUrl":"10.1016/j.softx.2025.102482","url":null,"abstract":"<div><div>Crossover designs are widely applied in medicine, agriculture, and other biological sciences, yet their analysis remains challenging due to longitudinal observations within each unit and the presence of carry-over effects. Despite their prevalence, there is no comprehensive <span>R</span> package dedicated to the statistical modeling of crossover data. The <span>CrossCarry</span> package addresses this gap by providing a flexible and open-source framework for analyzing any crossover design with response variables from the exponential family, with or without washout periods. It extends the generalized estimating equations (GEE) methodology by incorporating correlation structures specifically tailored to crossover data, capturing both within- and between-period dependencies. Moreover, <span>CrossCarry</span> integrates a parametric component for treatment effects and a nonparametric spline-based component for time and carry-over effects. This combination allows users to model complex correlation patterns and temporal structures with minimal coding effort. By offering a domain-independent implementation of advanced statistical methodology, <span>CrossCarry</span> facilitates reproducible research and promotes the reuse of robust analytical tools across disciplines. Its potential applications span medical trials, agricultural field experiments, and other areas where crossover designs are essential, thus contributing to broader scientific discovery and cross-domain methodological standardization.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102482"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145797871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local software: Integrated design and geo-computing workflows for urban design 本地软件:城市设计的集成设计和地理计算工作流程
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2025-12-03 DOI: 10.1016/j.softx.2025.102457
Carlos Sandoval Olascoaga , Nicholas de Monchaux
While architects and planners routinely rely on Geospatial Information Systems (GIS) and Computer Aided Design (CAD) tools, both types of tools are infrastructurally incompatible leading to cumbersome workarounds, lack of adoption in practice, and missed opportunities to incorporate the large-scale geographic insights of GIS with the building-scale precision of CAD into a seamless design process. Local Software (LS) inquires into how bringing together CAD and GIS tools and workflows can lead to more sustainable urban design proposals. The framework introduces Site Packages (SP), a cross-platform information model based on GeoJSON that enables seamless integration between design and analysis tools and a new design methodology that connects large scale modeling with small scale design decisions. LS provides a web interface and open-source plugins for Grasshopper and QGIS, that allow designers to parametrically generate networked urban interventions while evaluating their ecological and social impacts through GIS. Case studies have demonstrated that proposals created with the LS framework can replace 88–96 % of traditional stormwater systems at 50 % lower cost of underground work, while enhancing urban resilience, reducing heat island effects, and providing community benefits.
虽然建筑师和规划师通常依赖地理空间信息系统(GIS)和计算机辅助设计(CAD)工具,但这两种工具在基础设施上不兼容,导致了繁琐的解决方案,在实践中缺乏采用,并且错过了将GIS的大规模地理见解与CAD的建筑尺度精度结合到无缝设计过程中的机会。本地软件(LS)探讨如何将CAD和GIS工具和工作流程结合起来,从而产生更可持续的城市设计方案。该框架引入了Site Packages (SP),这是一种基于GeoJSON的跨平台信息模型,可以实现设计和分析工具之间的无缝集成,以及一种将大规模建模与小规模设计决策联系起来的新设计方法。LS为Grasshopper和QGIS提供了一个网络界面和开源插件,允许设计师通过GIS参数化地生成网络城市干预,同时评估其生态和社会影响。案例研究表明,利用LS框架创建的建议可以取代88 - 96%的传统雨水系统,而地下工程成本降低50%,同时增强城市弹性,减少热岛效应,并提供社区效益。
{"title":"Local software: Integrated design and geo-computing workflows for urban design","authors":"Carlos Sandoval Olascoaga ,&nbsp;Nicholas de Monchaux","doi":"10.1016/j.softx.2025.102457","DOIUrl":"10.1016/j.softx.2025.102457","url":null,"abstract":"<div><div>While architects and planners routinely rely on Geospatial Information Systems (GIS) and Computer Aided Design (CAD) tools, both types of tools are infrastructurally incompatible leading to cumbersome workarounds, lack of adoption in practice, and missed opportunities to incorporate the large-scale geographic insights of GIS with the building-scale precision of CAD into a seamless design process. Local Software (LS) inquires into how bringing together CAD and GIS tools and workflows can lead to more sustainable urban design proposals. The framework introduces Site Packages (SP), a cross-platform information model based on GeoJSON that enables seamless integration between design and analysis tools and a new design methodology that connects large scale modeling with small scale design decisions. LS provides a web interface and open-source plugins for Grasshopper and QGIS, that allow designers to parametrically generate networked urban interventions while evaluating their ecological and social impacts through GIS. Case studies have demonstrated that proposals created with the LS framework can replace 88–96 % of traditional stormwater systems at 50 % lower cost of underground work, while enhancing urban resilience, reducing heat island effects, and providing community benefits.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102457"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145693270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhishingWebCollector: Async python library for automated phishing feed collection PhishingWebCollector:用于自动网络钓鱼提要收集的异步python库
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2025-12-05 DOI: 10.1016/j.softx.2025.102463
Damian Frąszczak, Edyta Frąszczak
Website phishing represents a significant cyber threat, where attackers create fraudulent websites that imitate legitimate sites to deceive users. Continuous monitoring and detection of malicious websites are crucial for mitigating this threat. This paper introduces PhishingWebCollector, an open-source Python library designed to simplify the collection and integration of phishing feeds. It is an appropriate tool for real-time blacklist updates, creating historical datasets for research, and serving as a foundation for developing AI-based phishing detection systems. Identifying phishing and spoofed websites helps generate high-quality datasets necessary for training models in automated website classification and threat identification. Leveraging Python’s asyncio, it processes multiple feeds concurrently to achieve optimal performance. Available on PyPI with extensive documentation and examples, PhishingWebCollector offers a resource-efficient solution for cybersecurity professionals and researchers.
网站钓鱼是一种重大的网络威胁,攻击者创建假冒合法网站的欺诈性网站来欺骗用户。持续监控和检测恶意网站对于减轻这种威胁至关重要。本文介绍了PhishingWebCollector,这是一个开源Python库,旨在简化网络钓鱼提要的收集和集成。它是实时黑名单更新、创建历史数据集用于研究的合适工具,也是开发基于人工智能的网络钓鱼检测系统的基础。识别网络钓鱼和欺骗网站有助于生成高质量的数据集,这是在自动网站分类和威胁识别中训练模型所必需的。利用Python的asyncio,它可以并发处理多个提要以实现最佳性能。PhishingWebCollector可在PyPI上提供广泛的文档和示例,为网络安全专业人员和研究人员提供资源高效的解决方案。
{"title":"PhishingWebCollector: Async python library for automated phishing feed collection","authors":"Damian Frąszczak,&nbsp;Edyta Frąszczak","doi":"10.1016/j.softx.2025.102463","DOIUrl":"10.1016/j.softx.2025.102463","url":null,"abstract":"<div><div>Website phishing represents a significant cyber threat, where attackers create fraudulent websites that imitate legitimate sites to deceive users. Continuous monitoring and detection of malicious websites are crucial for mitigating this threat. This paper introduces PhishingWebCollector, an open-source Python library designed to simplify the collection and integration of phishing feeds. It is an appropriate tool for real-time blacklist updates, creating historical datasets for research, and serving as a foundation for developing AI-based phishing detection systems. Identifying phishing and spoofed websites helps generate high-quality datasets necessary for training models in automated website classification and threat identification. Leveraging Python’s asyncio, it processes multiple feeds concurrently to achieve optimal performance. Available on PyPI with extensive documentation and examples, PhishingWebCollector offers a resource-efficient solution for cybersecurity professionals and researchers.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102463"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145693271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LabChain: Enabling reproducible and modular scientific experiments in Python LabChain:在Python中实现可重复和模块化的科学实验
IF 2.4 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2026-02-01 Epub Date: 2026-02-05 DOI: 10.1016/j.softx.2026.102543
Manuel Couto , Javier Parapar , David E. Losada
Python’s flexibility accelerates research prototyping but frequently results in unmaintainable code and duplicated computational effort. The absence of software engineering practices in academic development leads to fragile experiments where even minor modifications require rerunning expensive computations from scratch. LabChain addresses this through a pipeline-and-filter architecture with hash-based caching that automatically identifies and reuses intermediate results. When evaluating multiple classifiers on the same embeddings, the framework computes embeddings once—regardless of how many classifiers are tested. This automatic reuse extends across research teams: if another researcher applies different models to the same preprocessed data, LabChain detects existing results and eliminates redundant computation. Beyond efficiency, the framework’s modular structure reduces technical debt that obscures experimental logic. Pipelines serialize to JSON for reproducibility and distributed execution across computational clusters. A mental health detection case study demonstrates dual impact: computational savings exceeding 12 hours per task with reduced CO2 emissions, alongside substantial scientific improvements—performance gains up to 192.3% in some tasks. These improvements emerged from clearer experimental organization that exposed a critical preprocessing bug hidden in the original monolithic implementation. LabChain proves that software engineering discipline amplifies scientific discovery.
Python的灵活性加速了研究原型,但经常导致代码不可维护和重复的计算工作。在学术开发中缺乏软件工程实践导致了脆弱的实验,即使是很小的修改也需要从头开始重新运行昂贵的计算。LabChain通过基于哈希的缓存自动识别和重用中间结果的管道和过滤器架构解决了这个问题。当评估相同嵌入上的多个分类器时,无论测试了多少个分类器,框架都会计算一次嵌入。这种自动重用扩展到整个研究团队:如果另一个研究人员对相同的预处理数据应用不同的模型,LabChain会检测现有的结果并消除冗余计算。除了效率之外,框架的模块化结构减少了模糊实验逻辑的技术债务。管道序列化为JSON,以实现可重复性和跨计算集群的分布式执行。一项心理健康检测案例研究显示了双重影响:每项任务节省了超过12小时的计算时间,减少了二氧化碳排放,同时取得了重大的科学进步——在某些任务中,性能提高了192.3%。这些改进来自于更清晰的实验组织,它暴露了隐藏在原始单片实现中的关键预处理错误。LabChain证明了软件工程学科放大了科学发现。
{"title":"LabChain: Enabling reproducible and modular scientific experiments in Python","authors":"Manuel Couto ,&nbsp;Javier Parapar ,&nbsp;David E. Losada","doi":"10.1016/j.softx.2026.102543","DOIUrl":"10.1016/j.softx.2026.102543","url":null,"abstract":"<div><div>Python’s flexibility accelerates research prototyping but frequently results in unmaintainable code and duplicated computational effort. The absence of software engineering practices in academic development leads to fragile experiments where even minor modifications require rerunning expensive computations from scratch. LabChain addresses this through a pipeline-and-filter architecture with hash-based caching that automatically identifies and reuses intermediate results. When evaluating multiple classifiers on the same embeddings, the framework computes embeddings once—regardless of how many classifiers are tested. This automatic reuse extends across research teams: if another researcher applies different models to the same preprocessed data, LabChain detects existing results and eliminates redundant computation. Beyond efficiency, the framework’s modular structure reduces technical debt that obscures experimental logic. Pipelines serialize to JSON for reproducibility and distributed execution across computational clusters. A mental health detection case study demonstrates dual impact: computational savings exceeding 12 hours per task with reduced CO<sub>2</sub> emissions, alongside substantial scientific improvements—performance gains up to 192.3% in some tasks. These improvements emerged from clearer experimental organization that exposed a critical preprocessing bug hidden in the original monolithic implementation. LabChain proves that software engineering discipline amplifies scientific discovery.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"33 ","pages":"Article 102543"},"PeriodicalIF":2.4,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146187982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SoftwareX
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1