首页 > 最新文献

Software Impacts最新文献

英文 中文
KNNOR-Reg: A python package for oversampling in imbalanced regression 一个python包,用于不平衡回归中的过采样
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2025-01-03 DOI: 10.1016/j.simpa.2024.100740
Samir Brahim Belhaouari , Ashhadul Islam , Khelil Kassoul , Ala Al-Fuqaha , Abdesselam Bouzerdoum
KNNOR-Reg is a Python package designed to address the challenge of imbalanced regression. While popular Python packages exist for tackling imbalanced classification, support for imbalanced regression remains limited. Imbalanced regression involves the underrepresentation of important ranges within a continuous target variable. KNNOR-Reg implements an oversampling technique that generates synthetic samples through interpolation between minority class samples and their nearest neighbors. The labels for synthetic samples are computed based on the inverse distance-weighted average of the nearest neighbors’ labels. KNNOR-Reg offers a user-friendly and extensible Python implementation for oversampling imbalanced regression data, aiming to reduce regressor bias and enhance model outcomes.
knor - reg是一个Python包,旨在解决不平衡回归的挑战。虽然存在用于处理不平衡分类的流行Python包,但对不平衡回归的支持仍然有限。不平衡回归涉及连续目标变量内重要范围的代表性不足。knor - reg实现了一种过采样技术,通过在少数类样本和它们最近的邻居之间插值来生成合成样本。合成样本的标签是基于最近邻居标签的逆距离加权平均值计算的。knor - reg提供了一个用户友好且可扩展的Python实现,用于对不平衡回归数据进行过采样,旨在减少回归偏差并增强模型结果。
{"title":"KNNOR-Reg: A python package for oversampling in imbalanced regression","authors":"Samir Brahim Belhaouari ,&nbsp;Ashhadul Islam ,&nbsp;Khelil Kassoul ,&nbsp;Ala Al-Fuqaha ,&nbsp;Abdesselam Bouzerdoum","doi":"10.1016/j.simpa.2024.100740","DOIUrl":"10.1016/j.simpa.2024.100740","url":null,"abstract":"<div><div>KNNOR-Reg is a Python package designed to address the challenge of imbalanced regression. While popular Python packages exist for tackling imbalanced classification, support for imbalanced regression remains limited. Imbalanced regression involves the underrepresentation of important ranges within a continuous target variable. KNNOR-Reg implements an oversampling technique that generates synthetic samples through interpolation between minority class samples and their nearest neighbors. The labels for synthetic samples are computed based on the inverse distance-weighted average of the nearest neighbors’ labels. KNNOR-Reg offers a user-friendly and extensible Python implementation for oversampling imbalanced regression data, aiming to reduce regressor bias and enhance model outcomes.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100740"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143139899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AbNumPro: A comprehensive offline toolkit for antibody numbering and antigen-binding region prediction AbNumPro:一个全面的离线工具包,用于抗体编号和抗原结合区预测
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-12-27 DOI: 10.1016/j.simpa.2024.100738
Wenzhen Li , Hongyan Lin , Lvxin Peng , Qianhu Jiang , Yushu Gou , Lu Xie , Jian Huang
Identifying complementary-determining regions (CDRs) and antigen-binding regions (ABRs) requires accurate antibody numbering, which is essential for therapeutic antibody development. AbNumPro is a comprehensive offline toolkit developed for antibody numbering and ABRs prediction, addressing the limitations of existing tools, which often lack comprehensiveness and rely solely on online services. By integrating five established numbering schemes—Kabat, Chothia, IMGT, Aho, and Martin—AbNumPro provides precise delineation of CDRs and ABRs, offering both compatibility with diverse research applications and the assurance of data security.
确定互补决定区(cdr)和抗原结合区(abr)需要准确的抗体编号,这对于治疗性抗体的开发至关重要。AbNumPro是一个全面的离线工具包,用于抗体编号和abr预测,解决了现有工具的局限性,这些工具通常缺乏全面性,仅依赖于在线服务。通过集成五种已建立的编号方案- kabat, Chothia, IMGT, Aho和Martin-AbNumPro提供了cdr和abr的精确描述,提供了与各种研究应用的兼容性和数据安全的保证。
{"title":"AbNumPro: A comprehensive offline toolkit for antibody numbering and antigen-binding region prediction","authors":"Wenzhen Li ,&nbsp;Hongyan Lin ,&nbsp;Lvxin Peng ,&nbsp;Qianhu Jiang ,&nbsp;Yushu Gou ,&nbsp;Lu Xie ,&nbsp;Jian Huang","doi":"10.1016/j.simpa.2024.100738","DOIUrl":"10.1016/j.simpa.2024.100738","url":null,"abstract":"<div><div>Identifying complementary-determining regions (CDRs) and antigen-binding regions (ABRs) requires accurate antibody numbering, which is essential for therapeutic antibody development. AbNumPro is a comprehensive offline toolkit developed for antibody numbering and ABRs prediction, addressing the limitations of existing tools, which often lack comprehensiveness and rely solely on online services. By integrating five established numbering schemes—Kabat, Chothia, IMGT, Aho, and Martin—AbNumPro provides precise delineation of CDRs and ABRs, offering both compatibility with diverse research applications and the assurance of data security.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100738"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143139900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthetic dataset generation system for vehicle detection 车辆检测合成数据集生成系统
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-12-28 DOI: 10.1016/j.simpa.2024.100735
Mihaela Orić , Vlatko Galić , Filip Novoselnik
The success of machine learning models for object detection highly depends on the training data size and quality. Generating synthetic data speeds up the data acquisition process by removing the need for human annotation. Moreover, since annotation is done automatically, there is no room for human error. We present a pipeline that automatically generates and annotates aerial images of vehicles on roads. The pipeline is structured to allow easy adding of various new vehicles and is not limited to cars only. The resolution of the generated images and the level of detail can be modified by changing the output settings.
用于目标检测的机器学习模型的成功在很大程度上取决于训练数据的大小和质量。生成合成数据消除了人工注释的需要,从而加快了数据获取过程。此外,由于注释是自动完成的,因此没有人为错误的余地。我们提出了一个自动生成和标注道路上车辆的航拍图像的管道。该管道的结构允许轻松添加各种新车,而不仅仅局限于汽车。可以通过更改输出设置来修改生成图像的分辨率和细节级别。
{"title":"Synthetic dataset generation system for vehicle detection","authors":"Mihaela Orić ,&nbsp;Vlatko Galić ,&nbsp;Filip Novoselnik","doi":"10.1016/j.simpa.2024.100735","DOIUrl":"10.1016/j.simpa.2024.100735","url":null,"abstract":"<div><div>The success of machine learning models for object detection highly depends on the training data size and quality. Generating synthetic data speeds up the data acquisition process by removing the need for human annotation. Moreover, since annotation is done automatically, there is no room for human error. We present a pipeline that automatically generates and annotates aerial images of vehicles on roads. The pipeline is structured to allow easy adding of various new vehicles and is not limited to cars only. The resolution of the generated images and the level of detail can be modified by changing the output settings.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100735"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143140076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plant diseases classification with Spectral Signature Taxonomy & Analysis Software (SSTAS) 基于光谱特征分类分析软件(SSTAS)的植物病害分类
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2025-03-03 DOI: 10.1016/j.simpa.2025.100744
Hardik Jayswal, Hetvi Desai, Hasti Vakani, Mithil Mistry, Nilesh Dubey
This paper investigates a novel approach to plant disease classification, addressing cases where symptoms are not visually apparent. Traditional machine learning methods, reliant on observable symptoms, face challenges such as limited training data, high costs, and low interpretability. To overcome these limitations, a spectroscopy-based classification technique was developed. Experimental data, collected over 15 months at Anand Agriculture University, Gujarat, and Charotar University Space Research Centre, utilized spectral signatures (400–1000 nm) to detect mango diseases. The SSTAS Software, developed with a fine-tuned deep learning model, Deep-Spectro, demonstrated superior accuracy using an 80:20 training-to-testing ratio, surpassing existing models reported in prior research.
本文研究了植物病害分类的一种新方法,解决了症状不明显的情况。传统的机器学习方法依赖于可观察到的症状,面临着训练数据有限、成本高、可解释性低等挑战。为了克服这些限制,开发了一种基于光谱的分类技术。在古吉拉特邦阿南德农业大学和夏洛塔大学空间研究中心收集了15个多月的实验数据,利用光谱特征(400-1000 nm)检测芒果疾病。SSTAS软件采用微调深度学习模型deep - spectro开发,使用80:20的训练与测试比例显示出卓越的准确性,超过了先前研究报告的现有模型。
{"title":"Plant diseases classification with Spectral Signature Taxonomy & Analysis Software (SSTAS)","authors":"Hardik Jayswal,&nbsp;Hetvi Desai,&nbsp;Hasti Vakani,&nbsp;Mithil Mistry,&nbsp;Nilesh Dubey","doi":"10.1016/j.simpa.2025.100744","DOIUrl":"10.1016/j.simpa.2025.100744","url":null,"abstract":"<div><div>This paper investigates a novel approach to plant disease classification, addressing cases where symptoms are not visually apparent. Traditional machine learning methods, reliant on observable symptoms, face challenges such as limited training data, high costs, and low interpretability. To overcome these limitations, a spectroscopy-based classification technique was developed. Experimental data, collected over 15 months at Anand Agriculture University, Gujarat, and Charotar University Space Research Centre, utilized spectral signatures (400–1000 nm) to detect mango diseases. The SSTAS Software, developed with a fine-tuned deep learning model, Deep-Spectro, demonstrated superior accuracy using an 80:20 training-to-testing ratio, surpassing existing models reported in prior research.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100744"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143563740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PEM-SMC: An algorithm for optimizing model parameters PEM-SMC:一种模型参数优化算法
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-12-13 DOI: 10.1016/j.simpa.2024.100728
Gaofeng Zhu , Qiang Chen , Xiangyu Yu , Cong Xu , Kun Zhang , Yunquan Wang , Wei Gong , Tao Che
Bayesian inference is crucial for optimizing parameters in complex models, but often requires sampling due to high-dimensional, intractable posteriors. Beyond Markov-Chain Monte Carlo (MCMC) methods, Sequential Monte Carlo (SMC) algorithms offer an alternative. This paper introduces a Matlab toolbox for the Particle Evolution Metropolis Sequential Monte Carlo (PEM-SMC) algorithm, which combines the strengths of population-based MCMC and SMC. Two case studies – a complex multi-modal probability and a land surface model – demonstrate the toolbox’s capabilities. This tool is valuable for Bayesian inference across fields like statistics, ecology, hydrology, and land surface processes.
贝叶斯推理对于复杂模型的参数优化至关重要,但由于高维、难以处理的后验,通常需要采样。除了马尔可夫链蒙特卡罗(MCMC)方法之外,顺序蒙特卡罗(SMC)算法提供了另一种选择。本文介绍了粒子进化大都市顺序蒙特卡罗(pemsmc)算法的Matlab工具箱,该算法结合了基于种群的MCMC和SMC的优点。两个案例研究——一个复杂的多模态概率和一个陆地表面模型——展示了工具箱的能力。这个工具对于跨领域的贝叶斯推理很有价值,如统计学、生态学、水文学和陆地表面过程。
{"title":"PEM-SMC: An algorithm for optimizing model parameters","authors":"Gaofeng Zhu ,&nbsp;Qiang Chen ,&nbsp;Xiangyu Yu ,&nbsp;Cong Xu ,&nbsp;Kun Zhang ,&nbsp;Yunquan Wang ,&nbsp;Wei Gong ,&nbsp;Tao Che","doi":"10.1016/j.simpa.2024.100728","DOIUrl":"10.1016/j.simpa.2024.100728","url":null,"abstract":"<div><div>Bayesian inference is crucial for optimizing parameters in complex models, but often requires sampling due to high-dimensional, intractable posteriors. Beyond Markov-Chain Monte Carlo (MCMC) methods, Sequential Monte Carlo (SMC) algorithms offer an alternative. This paper introduces a Matlab toolbox for the Particle Evolution Metropolis Sequential Monte Carlo (PEM-SMC) algorithm, which combines the strengths of population-based MCMC and SMC. Two case studies – a complex multi-modal probability and a land surface model – demonstrate the toolbox’s capabilities. This tool is valuable for Bayesian inference across fields like statistics, ecology, hydrology, and land surface processes.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100728"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143139887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MaSchedule. A multi-agent tool for scheduling problems MaSchedule。用于调度问题的多代理工具
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-12-05 DOI: 10.1016/j.simpa.2024.100726
Joel Antonio Trejo-Sánchez , Candelaria E. Sansores , Francisco J. Hernandez-Lopez , Jonás Velasco , Daniel Fajardo Delgado , Jose Luis Lopez-Martinez , Julio Cesar Ramirez-Pacheco
Several scheduling optimization problems belong to the NP-complete class, including, task scheduling, job shop scheduling, and patient admission. These problems commonly require the development of heuristics approaches to find near-optimal solutions within reasonable timeframes. In this work, we present MaSchedule an open-source multi-agent tool for the design of heuristics for scheduling problems.
有几个调度优化问题属于np完全类,包括任务调度、作业车间调度和病人入院。这些问题通常需要开发启发式方法,以便在合理的时间框架内找到接近最优的解决方案。在这项工作中,我们提出了masschedule一个开源的多代理工具,用于设计启发式调度问题。
{"title":"MaSchedule. A multi-agent tool for scheduling problems","authors":"Joel Antonio Trejo-Sánchez ,&nbsp;Candelaria E. Sansores ,&nbsp;Francisco J. Hernandez-Lopez ,&nbsp;Jonás Velasco ,&nbsp;Daniel Fajardo Delgado ,&nbsp;Jose Luis Lopez-Martinez ,&nbsp;Julio Cesar Ramirez-Pacheco","doi":"10.1016/j.simpa.2024.100726","DOIUrl":"10.1016/j.simpa.2024.100726","url":null,"abstract":"<div><div>Several scheduling optimization problems belong to the NP-complete class, including, task scheduling, job shop scheduling, and patient admission. These problems commonly require the development of heuristics approaches to find near-optimal solutions within reasonable timeframes. In this work, we present <span>MaSchedule</span> an open-source multi-agent tool for the design of heuristics for scheduling problems.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100726"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143139893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
mGFD: CloudGenerator mGFD: CloudGenerator
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-11-28 DOI: 10.1016/j.simpa.2024.100721
Gabriela Pedraza-Jiménez, Gerardo Tinoco-Guerrero, Francisco Javier Domínguez-Mota, José Alberto Guzmán-Torres, José Gerardo Tinoco-Ruiz
This work introduces mGFD: Cloud Generator, a web-based software for generating non-structured clouds of points that is useful in numerical analysis, particularly in applying the Meshless Generalized Finite Difference Method (mGFD). mGFD: CloudGenerator allows to manually define external and internal boundary nodes, using an image as a guide, providing precise control over boundary conditions. It supports image uploads (.png, .jpg, .jpeg) to guide node placement and automatically generates the internal cloud of points. The web-based software is open-source and accessible for research and has been used to produce results in some papers, such as the ones mentioned in this paper.
这项工作介绍了mGFD:云生成器,一个基于网络的软件,用于生成在数值分析中有用的非结构化云的点,特别是在应用无网格广义有限差分法(mGFD)。mGFD: CloudGenerator允许手动定义外部和内部边界节点,使用图像作为指南,提供对边界条件的精确控制。它支持图片上传(.png, .jpg, .jpeg)来引导节点放置,并自动生成内部点云。基于网络的软件是开源的,可用于研究,并已被用于在一些论文中产生结果,例如本文中提到的那些。
{"title":"mGFD: CloudGenerator","authors":"Gabriela Pedraza-Jiménez,&nbsp;Gerardo Tinoco-Guerrero,&nbsp;Francisco Javier Domínguez-Mota,&nbsp;José Alberto Guzmán-Torres,&nbsp;José Gerardo Tinoco-Ruiz","doi":"10.1016/j.simpa.2024.100721","DOIUrl":"10.1016/j.simpa.2024.100721","url":null,"abstract":"<div><div>This work introduces mGFD: Cloud Generator, a web-based software for generating non-structured clouds of points that is useful in numerical analysis, particularly in applying the Meshless Generalized Finite Difference Method (mGFD). mGFD: CloudGenerator allows to manually define external and internal boundary nodes, using an image as a guide, providing precise control over boundary conditions. It supports image uploads (.png, .jpg, .jpeg) to guide node placement and automatically generates the internal cloud of points. The web-based software is open-source and accessible for research and has been used to produce results in some papers, such as the ones mentioned in this paper.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100721"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142756998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
hvarma: Autoregressive moving average model of microtremor H/V spectral ratio hvarma:微颤H/V谱比的自回归移动平均模型
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2025-03-04 DOI: 10.1016/j.simpa.2025.100745
Aleix Seguí , Arantza Ugalde , Juan José Egozcue
hvarma is a Python software for estimating the horizontal-to-vertical (H/V) spectral ratio through seismic ambient vibration measurements. It employs a parametric approach to model the H/V transfer function using an AutoRegressive Moving Average (ARMA) filter. Compared to traditional methods, this technique enhances accuracy and reliability in spectral estimates, determining the ground fundamental resonance frequency with high spectral resolution, which is important for engineering geology projects. The program inverts to find optimal filter coefficients and computes coherence between horizontal and vertical components, generating H/V transfer function visualizations across both negative and positive frequencies. Results are saved as image and text files.
hvarma是一个Python软件,用于通过地震环境振动测量估计水平与垂直(H/V)频谱比。它采用参数化方法使用自回归移动平均(ARMA)滤波器对H/V传递函数建模。与传统方法相比,该技术提高了频谱估计的精度和可靠性,以高光谱分辨率确定了地面基共振频率,对工程地质工程具有重要意义。该程序通过反向查找最佳过滤系数,并计算水平和垂直分量之间的相干性,从而在负频率和正频率上生成H/V传递函数可视化。结果保存为图像和文本文件。
{"title":"hvarma: Autoregressive moving average model of microtremor H/V spectral ratio","authors":"Aleix Seguí ,&nbsp;Arantza Ugalde ,&nbsp;Juan José Egozcue","doi":"10.1016/j.simpa.2025.100745","DOIUrl":"10.1016/j.simpa.2025.100745","url":null,"abstract":"<div><div>hvarma is a Python software for estimating the horizontal-to-vertical (<em>H</em>/<em>V</em>) spectral ratio through seismic ambient vibration measurements. It employs a parametric approach to model the <em>H</em>/<em>V</em> transfer function using an AutoRegressive Moving Average (ARMA) filter. Compared to traditional methods, this technique enhances accuracy and reliability in spectral estimates, determining the ground fundamental resonance frequency with high spectral resolution, which is important for engineering geology projects. The program inverts to find optimal filter coefficients and computes coherence between horizontal and vertical components, generating <em>H</em>/<em>V</em> transfer function visualizations across both negative and positive frequencies. Results are saved as image and text files.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100745"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143580377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breadcrumbs for your Deep Learning Model: Following Provenance Traces with DLProv 你的深度学习模型的面包屑:用dlproof跟踪来源痕迹
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-12-13 DOI: 10.1016/j.simpa.2024.100730
Débora Pina , Liliane Kunstmann , Daniel de Oliveira , Marta Mattoso
To train a Deep Learning (DL) model, a workflow must be executed with four well-defined activities: (i) Acquiring data, (ii) Preprocessing, (iii) Splitting and balancing the dataset, and (iv) Building and training the model. After generating several DL models, they undergo a process called model selection. After being selected, the DL model is put into a production environment to make predictions on new data. One of the challenges in supporting these analyses is related to providing relationships between candidate models, their datasets for train, test, and validation, input data, and other derivations paths. These relationships are also essential for trust, reproducibility, and evolution of the selected model. While existing solutions allow monitoring and analyzing the artifacts generated throughout the DL workflow, they often fail to establish relationships for supporting data derivation within the DL workflow. DLProv is a provenance-centric service to support DL workflow analyses and reproducibility. DLProv captures provenance data and exports provenance graphs for DL model reproducibility. DLProv is W3C PROV compliant, ensuring standardized prospective and retrospective provenance, and enables provenance capture in arbitrary execution frameworks.
为了训练深度学习(DL)模型,工作流必须执行四个定义良好的活动:(i)获取数据,(ii)预处理,(iii)拆分和平衡数据集,以及(iv)构建和训练模型。在生成几个深度学习模型后,它们经历一个称为模型选择的过程。选择DL模型后,将其投入到生产环境中,对新数据进行预测。支持这些分析的挑战之一是提供候选模型、用于训练、测试和验证的数据集、输入数据和其他派生路径之间的关系。这些关系对于所选模型的信任、可再现性和进化也是必不可少的。虽然现有的解决方案允许监视和分析整个DL工作流中生成的工件,但它们通常无法建立支持DL工作流中数据派生的关系。DLProv是一个以来源为中心的服务,支持DL工作流分析和再现性。DLProv捕获来源数据并导出来源图,以实现DL模型的再现性。DLProv是W3C PROV兼容的,确保了标准化的前瞻性和回顾性来源,并支持在任意执行框架中获取来源。
{"title":"Breadcrumbs for your Deep Learning Model: Following Provenance Traces with DLProv","authors":"Débora Pina ,&nbsp;Liliane Kunstmann ,&nbsp;Daniel de Oliveira ,&nbsp;Marta Mattoso","doi":"10.1016/j.simpa.2024.100730","DOIUrl":"10.1016/j.simpa.2024.100730","url":null,"abstract":"<div><div>To train a Deep Learning (DL) model, a workflow must be executed with four well-defined activities: (i) Acquiring data, (ii) Preprocessing, (iii) Splitting and balancing the dataset, and (iv) Building and training the model. After generating several DL models, they undergo a process called model selection. After being selected, the DL model is put into a production environment to make predictions on new data. One of the challenges in supporting these analyses is related to providing relationships between candidate models, their datasets for train, test, and validation, input data, and other derivations paths. These relationships are also essential for trust, reproducibility, and evolution of the selected model. While existing solutions allow monitoring and analyzing the artifacts generated throughout the DL workflow, they often fail to establish relationships for supporting data derivation within the DL workflow. DLProv is a provenance-centric service to support DL workflow analyses and reproducibility. DLProv captures provenance data and exports provenance graphs for DL model reproducibility. DLProv is W3C PROV compliant, ensuring standardized prospective and retrospective provenance, and enables provenance capture in arbitrary execution frameworks.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100730"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143139895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SpatialzOSM: A Python package for supporting the explicit spatialization in the population synthesis process SpatialzOSM:一个Python包,用于支持人口综合过程中的显式空间化
IF 1.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2025-03-01 Epub Date: 2024-12-05 DOI: 10.1016/j.simpa.2024.100724
Bladimir Toaza, Domokos Esztergár-Kiss
SpatialzOSM, a package to spatialize aggregated locations into coordinates, thereby supporting population synthesis processes. This paper addresses the need for high-resolution data while ensuring data privacy. SpatialzOSM features include the generation of coordinates using three random distribution techniques: across zones, along road networks, and within buildings for residential locations. For non-residential locations, the package extracts points of interest from open sources. By leveraging open-source data, SpatialzOSM minimizes the risks of reidentification associated with census and survey datasets, ensuring privacy protection. This package is valuable for researchers and modelers engaged in synthetic population generation for models requiring explicit geographic location data.
SpatialzOSM,一个将聚合位置空间化成坐标的包,从而支持人口合成过程。本文在保证数据隐私的同时解决了对高分辨率数据的需求。SpatialzOSM的特点包括使用三种随机分布技术生成坐标:跨区域、沿道路网络和住宅建筑内。对于非住宅地点,该包从开放资源中提取兴趣点。通过利用开源数据,SpatialzOSM将与人口普查和调查数据集相关的重新识别风险降至最低,确保隐私保护。这个包是有价值的研究人员和建模人员从事合成人口生成模型需要明确的地理位置数据。
{"title":"SpatialzOSM: A Python package for supporting the explicit spatialization in the population synthesis process","authors":"Bladimir Toaza,&nbsp;Domokos Esztergár-Kiss","doi":"10.1016/j.simpa.2024.100724","DOIUrl":"10.1016/j.simpa.2024.100724","url":null,"abstract":"<div><div>SpatialzOSM, a package to spatialize aggregated locations into coordinates, thereby supporting population synthesis processes. This paper addresses the need for high-resolution data while ensuring data privacy. SpatialzOSM features include the generation of coordinates using three random distribution techniques: across zones, along road networks, and within buildings for residential locations. For non-residential locations, the package extracts points of interest from open sources. By leveraging open-source data, SpatialzOSM minimizes the risks of reidentification associated with census and survey datasets, ensuring privacy protection. This package is valuable for researchers and modelers engaged in synthetic population generation for models requiring explicit geographic location data.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100724"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143139894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Software Impacts
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1