首页 > 最新文献

Defense + Commercial Sensing最新文献

英文 中文
A systems theoretic perspective on open architectures for learning systems 从系统论角度看学习系统的开放式架构
Pub Date : 2024-06-06 DOI: 10.1117/12.3012391
Tyler Cody, Peter A. Beling
The advancement of open architecture ecosystems is fundamentally dependent on the interoperability, scalability, and adaptability of their constituent elements. As Machine Learning (ML) systems become increasingly integral to these ecosystems, the need for a systematic approach to engineer, deploy, and re-engineer them grows. This paper presents a novel modeling approach based on recently published, formal, systems-theoretic models of learning systems. These models serve dual purposes: first, they give a theoretical grounding to standards that govern the architecture, functionality, and performance criteria for ML systems; second, they allow for requirements to be specified at various levels of abstraction to ensure the systems are intrinsically aligned with the overall objectives of the open architecture ecosystem they belong to. Through the proposed modeling approach, we demonstrate how the adoption of standardized models can significantly enhance interoperability between disparate machine learning systems and other architectural components. Further, we relate our framework to on-going efforts such as Open Neural Network Exchange (ONNX). We identify how our approach can be used to address limitations in government acquisition processes for ML systems. The proposed systems-theoretic framework provides a structured methodology that contributes to the foundational building blocks for open architecture ecosystems for ML systems, thereby advancing the state-of-the-art in complex system integration.
开放式架构生态系统的发展从根本上取决于其组成元素的互操作性、可扩展性和适应性。随着机器学习(ML)系统日益成为这些生态系统的组成部分,人们越来越需要一种系统化的方法来设计、部署和重新设计这些系统。本文介绍了一种新颖的建模方法,该方法基于最近出版的学习系统的正式系统理论模型。这些模型具有双重目的:首先,它们为管理 ML 系统的架构、功能和性能标准的标准提供了理论基础;其次,它们允许在不同的抽象层次上指定需求,以确保系统在本质上与它们所属的开放架构生态系统的总体目标保持一致。通过建议的建模方法,我们展示了采用标准化模型如何显著增强不同机器学习系统和其他架构组件之间的互操作性。此外,我们还将我们的框架与开放神经网络交换(ONNX)等正在进行的工作联系起来。我们确定了如何利用我们的方法来解决政府在获取机器学习系统过程中的局限性。所提出的系统理论框架提供了一种结构化方法,有助于为 ML 系统的开放式架构生态系统提供基础构件,从而推动复杂系统集成领域的最新发展。
{"title":"A systems theoretic perspective on open architectures for learning systems","authors":"Tyler Cody, Peter A. Beling","doi":"10.1117/12.3012391","DOIUrl":"https://doi.org/10.1117/12.3012391","url":null,"abstract":"The advancement of open architecture ecosystems is fundamentally dependent on the interoperability, scalability, and adaptability of their constituent elements. As Machine Learning (ML) systems become increasingly integral to these ecosystems, the need for a systematic approach to engineer, deploy, and re-engineer them grows. This paper presents a novel modeling approach based on recently published, formal, systems-theoretic models of learning systems. These models serve dual purposes: first, they give a theoretical grounding to standards that govern the architecture, functionality, and performance criteria for ML systems; second, they allow for requirements to be specified at various levels of abstraction to ensure the systems are intrinsically aligned with the overall objectives of the open architecture ecosystem they belong to. Through the proposed modeling approach, we demonstrate how the adoption of standardized models can significantly enhance interoperability between disparate machine learning systems and other architectural components. Further, we relate our framework to on-going efforts such as Open Neural Network Exchange (ONNX). We identify how our approach can be used to address limitations in government acquisition processes for ML systems. The proposed systems-theoretic framework provides a structured methodology that contributes to the foundational building blocks for open architecture ecosystems for ML systems, thereby advancing the state-of-the-art in complex system integration.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"62 s284","pages":"1305808 - 1305808-6"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141377378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of uncertainties in above-water radiometric measurements from hyperspectral and polarimetric imaging 通过高光谱和偏振成像估算水上辐射测量的不确定性
Pub Date : 2024-06-06 DOI: 10.1117/12.3014923
M. Malinowski, Eder Herrera-Estrella, Robert Foster, Jacopo Agagliate, Alex Gilerson
Uncertainties in radiance above the ocean surface are mostly determined by the skylight reflected from the air-water interface. Their accurate characterization is important for the accurate measurements of the water-leaving radiance as well as for the estimation of the impact of these uncertainties on the atmospheric correction of satellite and airborne ocean observations. Uncertainties are affected by the state of the ocean surface dependent on the wind speed and the corresponding reflection coefficient, which can be calculated based on Cox-Munk relationships. These uncertainties were estimated in the hyperspectral mode from shipborne measurements by the Hyperspectral Imager ULTRIS X20 (Cubert, Germany), with a 400-1000 nm wavelength range and a 410x410 pixel resolution. Measurements were taken during a VIIRS Cal/Val cruise in Hawaii area in a broad range of wind speeds 0-10 m/s and at viewing angles 20-60 degrees. In addition, airborne measurements from a helicopter at four different altitudes of 60, 150, 450, and 750 meters were carried out in different parts of Chesapeake Bay to establish a relationship between uncertainties and altitude. For these, a Teledyne DALSA M2450 polarized camera with a filter wheel containing several filters at different spectral bands was used together with the imager to characterize wave slope statistics and to determine uncertainties in measurements of the Stokes vector components and the degree of linear polarization (DoLP). Measurement uncertainties are further compared with simulations.
海洋表面上方辐射度的不确定性主要由空气-水界面反射的天光决定。准确描述这些不确定性对于准确测量离水辐射度以及估计这些不确定性对卫星和机载海洋观测的大气校正的影响非常重要。不确定性受海洋表面状态的影响,取决于风速和相应的反射系数,可根据 Cox-Munk 关系计算。这些不确定性是在高光谱模式下通过高光谱成像仪 ULTRIS X20(德国 Cubert 公司)的船载测量进行估算的,其波长范围为 400-1000 nm,分辨率为 410x410 像素。在夏威夷地区的一次 VIIRS Cal/Val 巡航中进行了测量,风速范围为 0-10 米/秒,视角为 20-60 度。此外,还在切萨皮克湾的不同地区,在 60 米、150 米、450 米和 750 米的四个不同高度,用直升机进行了空中测量,以确定不确定性与高度之间的关系。为此,使用了 Teledyne DALSA M2450 偏振相机和成像仪,该相机带有一个滤光轮,内含不同光谱波段的多个滤光片,用于描述波浪斜率统计特征,并确定斯托克斯矢量分量和线性偏振度(DoLP)测量的不确定性。测量的不确定性还与模拟进行了比较。
{"title":"Estimation of uncertainties in above-water radiometric measurements from hyperspectral and polarimetric imaging","authors":"M. Malinowski, Eder Herrera-Estrella, Robert Foster, Jacopo Agagliate, Alex Gilerson","doi":"10.1117/12.3014923","DOIUrl":"https://doi.org/10.1117/12.3014923","url":null,"abstract":"Uncertainties in radiance above the ocean surface are mostly determined by the skylight reflected from the air-water interface. Their accurate characterization is important for the accurate measurements of the water-leaving radiance as well as for the estimation of the impact of these uncertainties on the atmospheric correction of satellite and airborne ocean observations. Uncertainties are affected by the state of the ocean surface dependent on the wind speed and the corresponding reflection coefficient, which can be calculated based on Cox-Munk relationships. These uncertainties were estimated in the hyperspectral mode from shipborne measurements by the Hyperspectral Imager ULTRIS X20 (Cubert, Germany), with a 400-1000 nm wavelength range and a 410x410 pixel resolution. Measurements were taken during a VIIRS Cal/Val cruise in Hawaii area in a broad range of wind speeds 0-10 m/s and at viewing angles 20-60 degrees. In addition, airborne measurements from a helicopter at four different altitudes of 60, 150, 450, and 750 meters were carried out in different parts of Chesapeake Bay to establish a relationship between uncertainties and altitude. For these, a Teledyne DALSA M2450 polarized camera with a filter wheel containing several filters at different spectral bands was used together with the imager to characterize wave slope statistics and to determine uncertainties in measurements of the Stokes vector components and the degree of linear polarization (DoLP). Measurement uncertainties are further compared with simulations.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"37 1","pages":"1306103 - 1306103-13"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141381947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements in pathogen detection: the integration of SERS and LSPR technologies in handheld clinical diagnostics 病原体检测的进展:将 SERS 和 LSPR 技术整合到手持式临床诊断中
Pub Date : 2024-06-06 DOI: 10.1117/12.3022222
Sebastian Huelck
Amid the SARS-CoV-2 pandemic, traditional virus detection methods like RT-qPCR face limitations in terms of infrastructure and processing time. This has spurred the development of agile diagnostic technologies, emphasizing non-invasive and rapid testing. Surface-Enhanced Raman Scattering (SERS) and Localized Surface Plasmon Resonance (LSPR) have emerged as promising alternatives. SERS, amplifying Raman signals through metal nanostructures, offers high sensitivity, high specificity, rapid response, qualitative and quantitative analysis enhanced by recent innovations like multiwell-array substrates. Integration with machine learning refines SERS's diagnostic capabilities, enabling rapid and accurate identification of SARS-CoV-2. LSPR, leveraging light-metal nanoparticle interactions, revolutionizes rapid viral detection, especially with the development of portable handheld devices. These devices enable real-time, on-site testing, proving crucial in managing infectious disease outbreaks. Their applications extend beyond SARS-CoV-2, holding potential for various pathogens.
在 SARS-CoV-2 大流行期间,RT-qPCR 等传统病毒检测方法面临着基础设施和处理时间方面的限制。这促使人们开发强调无创和快速检测的敏捷诊断技术。表面增强拉曼散射(SERS)和局部表面等离子体共振(LSPR)已成为前景广阔的替代方法。SERS 通过金属纳米结构放大拉曼信号,具有高灵敏度、高特异性、快速反应、定性和定量分析等特点,并通过多孔阵列基底等最新创新技术得到加强。与机器学习的结合完善了 SERS 的诊断能力,使其能够快速准确地识别 SARS-CoV-2 病毒。LSPR 利用光-金属纳米粒子的相互作用,彻底改变了病毒的快速检测,特别是随着便携式手持设备的发展。这些设备可以进行实时现场检测,对管理传染病爆发至关重要。它们的应用范围超出了 SARS-CoV-2 病毒,对各种病原体都有潜在的检测价值。
{"title":"Advancements in pathogen detection: the integration of SERS and LSPR technologies in handheld clinical diagnostics","authors":"Sebastian Huelck","doi":"10.1117/12.3022222","DOIUrl":"https://doi.org/10.1117/12.3022222","url":null,"abstract":"Amid the SARS-CoV-2 pandemic, traditional virus detection methods like RT-qPCR face limitations in terms of infrastructure and processing time. This has spurred the development of agile diagnostic technologies, emphasizing non-invasive and rapid testing. Surface-Enhanced Raman Scattering (SERS) and Localized Surface Plasmon Resonance (LSPR) have emerged as promising alternatives. SERS, amplifying Raman signals through metal nanostructures, offers high sensitivity, high specificity, rapid response, qualitative and quantitative analysis enhanced by recent innovations like multiwell-array substrates. Integration with machine learning refines SERS's diagnostic capabilities, enabling rapid and accurate identification of SARS-CoV-2. LSPR, leveraging light-metal nanoparticle interactions, revolutionizes rapid viral detection, especially with the development of portable handheld devices. These devices enable real-time, on-site testing, proving crucial in managing infectious disease outbreaks. Their applications extend beyond SARS-CoV-2, holding potential for various pathogens.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"22 13","pages":"130590E - 130590E-6"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141378972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bridging the AI/ML gap with explainable symbolic causal models using information theory 利用信息论建立可解释的符号因果模型,弥合人工智能与人工智能之间的差距
Pub Date : 2024-06-06 DOI: 10.1117/12.3014447
Stuart W. Card
We report favorable preliminary findings of work in progress bridging the Artificial Intelligence (AI) gap between bottom-up data-driven Machine Learning (ML) and top-down conceptually driven symbolic reasoning. Our overall goal is automatic generation, maintenance and utilization of explainable, parsimonious, plausibly causal, probably approximately correct, hybrid symbolic/numeric models of the world, the self and other agents, for prediction, what-if (counter-factual) analysis and control. Our old Evolutionary Learning with Information Theoretic Evaluation of Ensembles (ELITE2) techniques quantify strengths of arbitrary multivariate nonlinear statistical dependencies, prior to discovering forms by which observed variables may drive others. We extend these to apply Granger causality, in terms of conditional Mutual Information (MI), to distinguish causal relationships and find their directions. As MI can reflect one observable driving a second directly or via a mediator, two being driven by a common cause, etc., to untangle the causal graph we will apply Pearl causality with its back- and front-door adjustments and criteria. Initial efforts verified that our information theoretic indices detect causality in noise corrupted data despite complex relationships among hidden variables with chaotic dynamics disturbed by process noise, The next step is to apply these information theoretic filters in Genetic Programming (GP) to reduce the population of discovered statistical dependencies to plausibly causal relationships, represented symbolically for use by a reasoning engine in a cognitive architecture. Success could bring broader generalization, using not just learned patterns but learned general principles, enabling AI/ML based systems to autonomously navigate complex unknown environments and handle “black swans”.
我们报告了在自下而上的数据驱动型机器学习(ML)和自上而下的概念驱动型符号推理之间搭建桥梁的人工智能(AI)工作的初步成果。我们的总体目标是自动生成、维护和利用可解释的、解析的、似因似果的、可能近似正确的世界、自我和其他代理的混合符号/数字模型,用于预测、假设(反事实)分析和控制。在发现观察到的变量可能驱动其他变量的形式之前,我们原有的集合信息论评估进化学习(ELITE2)技术可以量化任意多变量非线性统计依赖关系的强度。我们将其扩展到格兰杰因果关系中,以条件互信息(MI)来区分因果关系并找到其方向。由于 MI 可以反映一个观测变量直接或通过中介驱动另一个观测变量,也可以反映两个观测变量被一个共同原因驱动等,因此,为了理清因果关系图,我们将应用珀尔因果关系及其前后门调整和标准。下一步是在遗传编程(GP)中应用这些信息理论过滤器,将已发现的统计依赖关系减少为可信的因果关系,并用符号表示出来,供认知架构中的推理引擎使用。成功可以带来更广泛的通用性,不仅可以使用学习到的模式,还可以使用学习到的一般原则,从而使基于人工智能/人工智能的系统能够自主导航复杂的未知环境,并处理 "黑天鹅"。
{"title":"Bridging the AI/ML gap with explainable symbolic causal models using information theory","authors":"Stuart W. Card","doi":"10.1117/12.3014447","DOIUrl":"https://doi.org/10.1117/12.3014447","url":null,"abstract":"We report favorable preliminary findings of work in progress bridging the Artificial Intelligence (AI) gap between bottom-up data-driven Machine Learning (ML) and top-down conceptually driven symbolic reasoning. Our overall goal is automatic generation, maintenance and utilization of explainable, parsimonious, plausibly causal, probably approximately correct, hybrid symbolic/numeric models of the world, the self and other agents, for prediction, what-if (counter-factual) analysis and control. Our old Evolutionary Learning with Information Theoretic Evaluation of Ensembles (ELITE2) techniques quantify strengths of arbitrary multivariate nonlinear statistical dependencies, prior to discovering forms by which observed variables may drive others. We extend these to apply Granger causality, in terms of conditional Mutual Information (MI), to distinguish causal relationships and find their directions. As MI can reflect one observable driving a second directly or via a mediator, two being driven by a common cause, etc., to untangle the causal graph we will apply Pearl causality with its back- and front-door adjustments and criteria. Initial efforts verified that our information theoretic indices detect causality in noise corrupted data despite complex relationships among hidden variables with chaotic dynamics disturbed by process noise, The next step is to apply these information theoretic filters in Genetic Programming (GP) to reduce the population of discovered statistical dependencies to plausibly causal relationships, represented symbolically for use by a reasoning engine in a cognitive architecture. Success could bring broader generalization, using not just learned patterns but learned general principles, enabling AI/ML based systems to autonomously navigate complex unknown environments and handle “black swans”.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"23 25","pages":"1305802 - 1305802-4"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141379071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed edge computing for cooperative augmented reality: enhancing mobile sensing capabilities 用于合作增强现实的分布式边缘计算:增强移动传感能力
Pub Date : 2024-06-06 DOI: 10.1117/12.3021841
Cheng-Yu Cheng, Qi Zhao, Cheng-Ying Wu, Yuchen Yang, Muhammad Qureshi, Hang Liu, Genshe Chen
Cooperative Augmented Reality (AR) can provide real-time, immersive, and context-aware situational awareness while enhancing mobile sensing capabilities and benefiting various applications. Distributed edge computing has emerged as an essential paradigm to facilitate cooperative AR. We designed and implemented a distributed system to enable fast, reliable, and scalable cooperative AR. In this paper, we present a novel approach and architecture that integrates advanced sensing, communications, and processing techniques to create such a cooperative AR system and demonstrate its capability with HoloLens and edge servers connected over a wireless network. Our research addresses the challenges of implementing a distributed cooperative AR system capable of capturing data from a multitude of sensors on HoloLens, performing fusion and accurate object recognition, and seamlessly projecting the reconstructed 3D model into the wearer’s field of view. The paper delves into the intricate architecture of the proposed cooperative AR system, detailing its distributed sensing and edge computing components, and the Apache Storm-integrated platform. The implementation encompasses data collection, aggregation, analysis, object recognition, and rendering of 3D models on the HoloLens, all in real-time. The proposed system enhances the AR experience while showcasing the vast potential of distributed edge computing. Our findings illustrate the feasibility and advantages of merging distributed cooperative sensing and edge computing to offer dynamic, immersive AR experiences, paving the way for new applications.
合作式增强现实(AR)可提供实时、身临其境和情境感知的态势感知,同时增强移动传感能力并使各种应用受益。分布式边缘计算已成为促进合作式增强现实的重要模式。我们设计并实施了一个分布式系统,以实现快速、可靠和可扩展的协同 AR。在本文中,我们介绍了一种新颖的方法和架构,该方法和架构集成了先进的传感、通信和处理技术,从而创建了这样一个合作式 AR 系统,并利用通过无线网络连接的 HoloLens 和边缘服务器演示了该系统的能力。我们的研究解决了实施分布式合作 AR 系统的挑战,该系统能够从 HoloLens 上的多个传感器捕获数据,执行融合和准确的物体识别,并将重建的 3D 模型无缝投射到佩戴者的视野中。本文深入探讨了拟议的合作式 AR 系统的复杂架构,详细介绍了其分布式传感和边缘计算组件以及集成 Apache Storm 的平台。实施过程包括数据收集、汇总、分析、对象识别以及在 HoloLens 上实时渲染 3D 模型。拟议的系统增强了 AR 体验,同时展示了分布式边缘计算的巨大潜力。我们的研究结果表明,将分布式协同传感与边缘计算结合起来,提供动态、身临其境的 AR 体验是可行的,并具有优势,从而为新的应用铺平了道路。
{"title":"Distributed edge computing for cooperative augmented reality: enhancing mobile sensing capabilities","authors":"Cheng-Yu Cheng, Qi Zhao, Cheng-Ying Wu, Yuchen Yang, Muhammad Qureshi, Hang Liu, Genshe Chen","doi":"10.1117/12.3021841","DOIUrl":"https://doi.org/10.1117/12.3021841","url":null,"abstract":"Cooperative Augmented Reality (AR) can provide real-time, immersive, and context-aware situational awareness while enhancing mobile sensing capabilities and benefiting various applications. Distributed edge computing has emerged as an essential paradigm to facilitate cooperative AR. We designed and implemented a distributed system to enable fast, reliable, and scalable cooperative AR. In this paper, we present a novel approach and architecture that integrates advanced sensing, communications, and processing techniques to create such a cooperative AR system and demonstrate its capability with HoloLens and edge servers connected over a wireless network. Our research addresses the challenges of implementing a distributed cooperative AR system capable of capturing data from a multitude of sensors on HoloLens, performing fusion and accurate object recognition, and seamlessly projecting the reconstructed 3D model into the wearer’s field of view. The paper delves into the intricate architecture of the proposed cooperative AR system, detailing its distributed sensing and edge computing components, and the Apache Storm-integrated platform. The implementation encompasses data collection, aggregation, analysis, object recognition, and rendering of 3D models on the HoloLens, all in real-time. The proposed system enhances the AR experience while showcasing the vast potential of distributed edge computing. Our findings illustrate the feasibility and advantages of merging distributed cooperative sensing and edge computing to offer dynamic, immersive AR experiences, paving the way for new applications.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"55 1","pages":"130620O - 130620O-11"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141381309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantization to accelerate inference in multimodal 3D object detection 量化加速多模态三维物体检测推理
Pub Date : 2024-06-06 DOI: 10.1117/12.3013702
Billy E. Geerhart, Venkateswara Dasari, Brian Rapp, Peng Wang, Ju Wang, Christopher X. Payne
The Label-Diffusion-LIDAR-Segmentation (LDLS) algorithm uses multi-modal data for enhanced inference of environmental categories. The algorithm segments the Red-Green-Blue (RGB) channels and maps the results to the LIDAR point cloud using matrix calculations to reduce noise. Recent research has developed custom optimization techniques using quantization to accelerate the 3D object detection using LDLS in robotic systems. These optimizations achieve a 3x speedup over the original algorithm, making it possible to deploy the algorithm in real-world applications. The optimizations include quantization for the segmentation inference as well as matrix optimizations for the label diffusion. We will present our results, compare them with the baseline, and discuss their significance in achieving real-time object detection in resource-constrained environments.
标签扩散-激光雷达分割(LDLS)算法利用多模态数据加强对环境类别的推断。该算法对红绿蓝(RGB)通道进行分割,并利用矩阵计算将结果映射到激光雷达点云,以减少噪声。最近的研究开发了使用量化的定制优化技术,以加速机器人系统中使用 LDLS 的 3D 物体检测。这些优化技术将原始算法的速度提高了 3 倍,使该算法在实际应用中的部署成为可能。优化包括分割推理的量化以及标签扩散的矩阵优化。我们将介绍我们的成果,将其与基线进行比较,并讨论它们在资源受限环境中实现实时物体检测的意义。
{"title":"Quantization to accelerate inference in multimodal 3D object detection","authors":"Billy E. Geerhart, Venkateswara Dasari, Brian Rapp, Peng Wang, Ju Wang, Christopher X. Payne","doi":"10.1117/12.3013702","DOIUrl":"https://doi.org/10.1117/12.3013702","url":null,"abstract":"The Label-Diffusion-LIDAR-Segmentation (LDLS) algorithm uses multi-modal data for enhanced inference of environmental categories. The algorithm segments the Red-Green-Blue (RGB) channels and maps the results to the LIDAR point cloud using matrix calculations to reduce noise. Recent research has developed custom optimization techniques using quantization to accelerate the 3D object detection using LDLS in robotic systems. These optimizations achieve a 3x speedup over the original algorithm, making it possible to deploy the algorithm in real-world applications. The optimizations include quantization for the segmentation inference as well as matrix optimizations for the label diffusion. We will present our results, compare them with the baseline, and discuss their significance in achieving real-time object detection in resource-constrained environments.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"25 32","pages":"1305807 - 1305807-10"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141378876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight deep learning model for rapid detection of fabricated ENF signals from audio sources 轻量级深度学习模型,用于快速检测音频源中的伪造 ENF 信号
Pub Date : 2024-06-06 DOI: 10.1117/12.3013456
Adilet Pazylkarim, Deeraj Nagothu, Yu Chen
The rapid advancement of multimedia content editing software tools has made it increasingly easy for malicious actors to manipulate real-time multimedia data streams, encompassing audio and video. Among the notorious cybercrimes, replay attacks have gained widespread prevalence, necessitating the development of more efficient authentication methods for detection. A cutting-edge authentication technique leverages Electrical Network Frequency (ENF) signals embedded within multimedia content. ENF signals offer a range of advantageous attributes, including uniqueness, unpredictability, and total randomness, rendering them highly effective for detecting replay attacks. To counter potential attackers who may seek to deceive detection systems by embedding fake ENF signals, this study harnesses the growing accessibility of deep Convolutional Neural Networks (CNNs). These CNNs are not only deployable on platforms with limited computational resources, such as Single-Board Computers (SBCs), but they also exhibit the capacity to swiftly identify interference within a signal by learning distinctive spatio-temporal patterns. In this paper, we explore applying a Computationally Efficient Deep Learning Model (CEDM) as a powerful tool for rapidly detecting potential fabrications within ENF signals originating from diverse audio sources. Our experimental study validates the effectiveness of the proposed method.
随着多媒体内容编辑软件工具的快速发展,恶意行为者越来越容易操纵包括音频和视频在内的实时多媒体数据流。在臭名昭著的网络犯罪中,重放攻击已变得十分普遍,因此有必要开发更有效的验证方法来进行检测。一种先进的认证技术利用了多媒体内容中嵌入的电网络频率(ENF)信号。ENF 信号具有一系列优势属性,包括唯一性、不可预测性和完全随机性,因此在检测重放攻击方面非常有效。潜在的攻击者可能会通过嵌入伪造的 ENF 信号来欺骗检测系统,为了应对这种情况,本研究利用了日益普及的深度卷积神经网络(CNN)。这些 CNN 不仅可以部署在单板计算机 (SBC) 等计算资源有限的平台上,还能通过学习独特的时空模式迅速识别信号中的干扰。在本文中,我们探讨了如何将计算高效深度学习模型(CEDM)作为一种强大的工具,用于快速检测来自不同音频源的 ENF 信号中的潜在伪造信号。我们的实验研究验证了所提方法的有效性。
{"title":"A lightweight deep learning model for rapid detection of fabricated ENF signals from audio sources","authors":"Adilet Pazylkarim, Deeraj Nagothu, Yu Chen","doi":"10.1117/12.3013456","DOIUrl":"https://doi.org/10.1117/12.3013456","url":null,"abstract":"The rapid advancement of multimedia content editing software tools has made it increasingly easy for malicious actors to manipulate real-time multimedia data streams, encompassing audio and video. Among the notorious cybercrimes, replay attacks have gained widespread prevalence, necessitating the development of more efficient authentication methods for detection. A cutting-edge authentication technique leverages Electrical Network Frequency (ENF) signals embedded within multimedia content. ENF signals offer a range of advantageous attributes, including uniqueness, unpredictability, and total randomness, rendering them highly effective for detecting replay attacks. To counter potential attackers who may seek to deceive detection systems by embedding fake ENF signals, this study harnesses the growing accessibility of deep Convolutional Neural Networks (CNNs). These CNNs are not only deployable on platforms with limited computational resources, such as Single-Board Computers (SBCs), but they also exhibit the capacity to swiftly identify interference within a signal by learning distinctive spatio-temporal patterns. In this paper, we explore applying a Computationally Efficient Deep Learning Model (CEDM) as a powerful tool for rapidly detecting potential fabrications within ENF signals originating from diverse audio sources. Our experimental study validates the effectiveness of the proposed method.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"18 23","pages":"1305811 - 1305811-13"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141378890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI agile assistant 生成式人工智能敏捷助手
Pub Date : 2024-06-06 DOI: 10.1117/12.3011173
Darrell L. Young, Perry Boyette, James Moreland, Jason Teske
Large Language Models (LLMs) provide new capabilities to rapidly reform, regroup; and reskill for new missions, opportunities, and respond to an ever-changing operational landscape. Agile contracts can enable larger flow of value in new development contexts. These methods of engagement and partnership enable the establishment of high performing teams through the forming, storming, norming, and performing stages that then inform the best liberating structures that exceed traditional rigid hierarchical models or even established mission engineering methods. Use of Generative AI based on LLMs coupled with modern agile model-based engineering in design allows for automated requirements decomposition trained in the lingua franca of the development team and translation to the dialects of other domain disciplines with the business acumen afforded by proven approaches in industry. Cutting-edge AI automations to track and adapt knowledge, skills, and abilities across ever changing jobs and roles will be illustrated using prevailing architecture frameworks, model-based system engineering, simulation, and decision-making assisted approaches to emergent objectives.
大型语言模型(LLM)提供了新的能力,可针对新任务、新机遇进行快速改革、重组和再培训,并应对不断变化的业务环境。敏捷合同可以在新的发展环境中实现更大的价值流。这些参与和合作方法能够通过组建、风暴、规范和执行阶段建立高绩效团队,然后为最佳解放结构提供信息,这些结构超越了传统的僵化等级模式,甚至超越了既定的任务工程方法。在设计中使用基于 LLMs 的生成式人工智能与基于模型的现代敏捷工程相结合,就能以开发团队的通用语言进行自动需求分解,并将其翻译成其他领域学科的方言,同时还能利用行业中成熟方法所提供的商业敏锐度。我们将利用流行的架构框架、基于模型的系统工程、仿真和决策辅助方法来说明在不断变化的工作和角色中跟踪和调整知识、技能和能力的尖端人工智能自动化。
{"title":"Generative AI agile assistant","authors":"Darrell L. Young, Perry Boyette, James Moreland, Jason Teske","doi":"10.1117/12.3011173","DOIUrl":"https://doi.org/10.1117/12.3011173","url":null,"abstract":"Large Language Models (LLMs) provide new capabilities to rapidly reform, regroup; and reskill for new missions, opportunities, and respond to an ever-changing operational landscape. Agile contracts can enable larger flow of value in new development contexts. These methods of engagement and partnership enable the establishment of high performing teams through the forming, storming, norming, and performing stages that then inform the best liberating structures that exceed traditional rigid hierarchical models or even established mission engineering methods. Use of Generative AI based on LLMs coupled with modern agile model-based engineering in design allows for automated requirements decomposition trained in the lingua franca of the development team and translation to the dialects of other domain disciplines with the business acumen afforded by proven approaches in industry. Cutting-edge AI automations to track and adapt knowledge, skills, and abilities across ever changing jobs and roles will be illustrated using prevailing architecture frameworks, model-based system engineering, simulation, and decision-making assisted approaches to emergent objectives.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"11 1","pages":"1305809 - 1305809-12"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141378795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active detection for fish species recognition in underwater environments 水下环境中鱼类物种识别的主动探测
Pub Date : 2024-06-06 DOI: 10.1117/12.3013344
Chiranjibi Shah, M. M. Nabi, S. Y. Alaba, Ryan Caillouet, Jack H. Prior, Matthew Campbell, Matthew D. Grossi, Farron Wallace, John E. Ball, Robert J. Moorhead
Fish species must be identified for stock assessments, ecosystem monitoring, production management, and the conservation of endangered species. Implementing algorithms for fish species detection in underwater settings like the Gulf of Mexico poses a formidable challenge. Active learning, a method that efficiently identifies informative samples for annotation while staying within a budget, has demonstrated its effectiveness in the context of object detection in recent times. In this study, we present an active detection model designed for fish species recognition in underwater environments. This model can be employed as an object detection system to effectively lower the expense associated with manual annotation. It uses epistemic uncertainty with Evidential Deep Learning (EDL) and proposes a novel module denoted as Model Evidence Head (MEH) for fish species detection in underwater environments. It employs Hierarchical Uncertainty Aggregation (HUA) to obtain the informativeness of an image. We conducted experiments using a fine-grained and extensive dataset of reef fish collected from the Gulf of Mexico, specifically the Southeast Area Monitoring and Assessment Program Dataset 2021 (SEAMAPD21). The experimental results demonstrate that an active detection framework achieves better detection performance on the SEAMAPD21 dataset demonstrating a favorable balance between performance and data efficiency for underwater fish species recognition.
鱼类种群评估、生态系统监测、生产管理和濒危物种保护必须识别鱼类物种。在墨西哥湾等水下环境中实施鱼类物种检测算法是一项艰巨的挑战。主动学习是一种在预算范围内有效识别信息样本进行注释的方法,近来已在物体检测方面显示出其有效性。在本研究中,我们提出了一种主动检测模型,用于识别水下环境中的鱼类物种。该模型可用作物体检测系统,有效降低人工标注的相关费用。该模型利用证据深度学习(EDL)的认识不确定性,并提出了一个用于水下环境鱼类物种检测的新模块,称为 "模型证据头"(MEH)。它采用层次不确定性聚合(HUA)来获取图像的信息量。我们使用从墨西哥湾收集到的大量细粒度珊瑚礁鱼类数据集进行了实验,特别是东南地区监测和评估计划数据集 2021(SEAMAPD21)。实验结果表明,主动检测框架在 SEAMAPD21 数据集上实现了更好的检测性能,在水下鱼类物种识别的性能和数据效率之间实现了良好的平衡。
{"title":"Active detection for fish species recognition in underwater environments","authors":"Chiranjibi Shah, M. M. Nabi, S. Y. Alaba, Ryan Caillouet, Jack H. Prior, Matthew Campbell, Matthew D. Grossi, Farron Wallace, John E. Ball, Robert J. Moorhead","doi":"10.1117/12.3013344","DOIUrl":"https://doi.org/10.1117/12.3013344","url":null,"abstract":"Fish species must be identified for stock assessments, ecosystem monitoring, production management, and the conservation of endangered species. Implementing algorithms for fish species detection in underwater settings like the Gulf of Mexico poses a formidable challenge. Active learning, a method that efficiently identifies informative samples for annotation while staying within a budget, has demonstrated its effectiveness in the context of object detection in recent times. In this study, we present an active detection model designed for fish species recognition in underwater environments. This model can be employed as an object detection system to effectively lower the expense associated with manual annotation. It uses epistemic uncertainty with Evidential Deep Learning (EDL) and proposes a novel module denoted as Model Evidence Head (MEH) for fish species detection in underwater environments. It employs Hierarchical Uncertainty Aggregation (HUA) to obtain the informativeness of an image. We conducted experiments using a fine-grained and extensive dataset of reef fish collected from the Gulf of Mexico, specifically the Southeast Area Monitoring and Assessment Program Dataset 2021 (SEAMAPD21). The experimental results demonstrate that an active detection framework achieves better detection performance on the SEAMAPD21 dataset demonstrating a favorable balance between performance and data efficiency for underwater fish species recognition.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"5 15","pages":"130610D - 130610D-10"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141379707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application specificity of data for pre-training in computer vision 计算机视觉预训练数据的应用特殊性
Pub Date : 2024-06-06 DOI: 10.1117/12.3013088
Gabriel Peters, Scott Couwenhoven, Derek Walvoord, Carl Salvaggio
In an era of immense data generation, unlocking the full potential of Machine Learning (ML) hinges on overcoming the limitations posed by the scarcity of labeled data. In Computer Vision (CV) research, algorithm design must consider this shift and focus instead on the abundance of unlabeled imagery. In recent years, there has been a notable trend within the community toward Self-Supervised Learning (SSL) methods that can leverage this untapped data pool. ML practice promotes self-supervised pre-training for generalized feature extraction on a diverse unlabeled dataset followed by supervised transfer learning on a smaller set of labeled, application-specific images. This shift in learning methods elicits conversation about the importance of pre-training data composition for optimizing downstream performance. We evaluate models with varying measures of similarity between pre-training and transfer learning data compositions. Our findings indicate that front-end embeddings sufficiently generalize learned image features independent of data composition, leaving transfer learning to inject the majority of application-specific understanding into the model. Composition may be irrelevant in self-supervised pre-training, suggesting target data is a primary driver of application specificity. Thus, pre-training deep learning models with application-specific data, which is often difficult to acquire, is not necessary for reaching competitive downstream performance. The capability to pre-train on more accessible datasets invites more flexibility in practical deep learning.
在一个产生大量数据的时代,要充分释放机器学习(ML)的潜力,关键在于克服标记数据稀缺所带来的限制。在计算机视觉(CV)研究中,算法设计必须考虑到这一转变,转而关注大量无标记图像。近年来,业界出现了一种明显的趋势,即采用自监督学习(SSL)方法来利用这一尚未开发的数据池。ML 实践提倡在一个多样化的未标记数据集上进行自我监督预训练,以进行通用特征提取,然后在一个较小的已标记的特定应用图像集上进行监督迁移学习。学习方法的这种转变引起了关于预训练数据组成对优化下游性能的重要性的讨论。我们评估了预训练和迁移学习数据组成之间不同相似度的模型。我们的研究结果表明,前端嵌入能充分泛化所学图像特征,而不受数据组成的影响,因此迁移学习能为模型注入大部分特定应用的理解。在自我监督的预训练中,组成可能无关紧要,这表明目标数据是应用特异性的主要驱动力。因此,用特定应用数据对深度学习模型进行预训练(通常很难获取),对于达到有竞争力的下游性能并非必要。在更容易获取的数据集上进行预训练的能力为实际深度学习带来了更大的灵活性。
{"title":"Application specificity of data for pre-training in computer vision","authors":"Gabriel Peters, Scott Couwenhoven, Derek Walvoord, Carl Salvaggio","doi":"10.1117/12.3013088","DOIUrl":"https://doi.org/10.1117/12.3013088","url":null,"abstract":"In an era of immense data generation, unlocking the full potential of Machine Learning (ML) hinges on overcoming the limitations posed by the scarcity of labeled data. In Computer Vision (CV) research, algorithm design must consider this shift and focus instead on the abundance of unlabeled imagery. In recent years, there has been a notable trend within the community toward Self-Supervised Learning (SSL) methods that can leverage this untapped data pool. ML practice promotes self-supervised pre-training for generalized feature extraction on a diverse unlabeled dataset followed by supervised transfer learning on a smaller set of labeled, application-specific images. This shift in learning methods elicits conversation about the importance of pre-training data composition for optimizing downstream performance. We evaluate models with varying measures of similarity between pre-training and transfer learning data compositions. Our findings indicate that front-end embeddings sufficiently generalize learned image features independent of data composition, leaving transfer learning to inject the majority of application-specific understanding into the model. Composition may be irrelevant in self-supervised pre-training, suggesting target data is a primary driver of application specificity. Thus, pre-training deep learning models with application-specific data, which is often difficult to acquire, is not necessary for reaching competitive downstream performance. The capability to pre-train on more accessible datasets invites more flexibility in practical deep learning.","PeriodicalId":178341,"journal":{"name":"Defense + Commercial Sensing","volume":"3 13","pages":"1305803 - 1305803-14"},"PeriodicalIF":0.0,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141380302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Defense + Commercial Sensing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1