首页 > 最新文献

arXiv - CS - Computational Engineering, Finance, and Science最新文献

英文 中文
Inverse design of Non-parameterized Ventilated Acoustic Resonator via Variational Autoencoder with Acoustic Response-encoded Latent Space 通过声学响应编码潜空间变异自动编码器反向设计非参数化通风声学谐振器
Pub Date : 2024-08-12 DOI: arxiv-2408.05917
Min Woo Cho, Seok Hyeon Hwang, Jun-Young Jang, Jin Yeong Song, Sun-kwang Hwang, Kyoung Je Cha, Dong Yong Park, Kyungjun Song, Sang Min Park
Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emergeas an alternative for sound attenuation in environments that requireventilation, owing to its excellent low-frequency attenuation performance andflexible shape adaptability. However, due to the non-linear acoustic responsesof VARs, the VAR designs are generally obtained within a limited parametrizeddesign space, and the design relies on the iteration of the numericalsimulation which consumes a considerable amount of computational time andresources. This paper proposes an acoustic response-encoded variationalautoencoder (AR-VAE), a novel variational autoencoder-based generative designmodel for the efficient and accurate inverse design of VAR even withnon-parametrized designs. The AR-VAE matches the high-dimensional acousticresponse with the VAR cross-section image in the dimension-reduced latentspace, which enables the AR-VAE to generate various non-parametrized VARcross-section images with the target acoustic response. AR-VAE generatesnon-parameterized VARs from target acoustic responses, which show a 25-foldreduction in mean squared error compared to conventional deep learning-basedparameter searching methods while exhibiting lower average mean squared errorand peak frequency variance. By combining the inverse-designed VARs by AR-VAE,multi-cavity VAR was devised for broadband and multitarget peak frequencyattenuation. The proposed design method presents a new approach for structuralinverse-design with a high-dimensional non-linear physical response.
通风声共振(VAR)是一种声学超材料,由于其出色的低频衰减性能和灵活的形状适应性,在需要通风的环境中成为一种声音衰减的替代方法。然而,由于 VAR 的非线性声学响应,VAR 的设计通常只能在有限的参数化设计空间内获得,而且设计依赖于数值模拟的迭代,这将消耗大量的计算时间和资源。本文提出了一种声学响应编码变异自动编码器(AR-VAE),这是一种基于变异自动编码器的新型生成式设计模型,即使在非参数化设计的情况下也能高效、准确地进行 VAR 反设计。AR-VAE 将高维声学响应与降维潜在空间中的 VAR 横截面图像相匹配,从而使 AR-VAE 能够生成具有目标声学响应的各种非参数化 VAR 横截面图像。与基于深度学习的传统参数搜索方法相比,AR-VAE 从目标声学响应生成的非参数化 VAR 的均方误差降低了 25 倍,同时平均均方误差和峰值频率方差也更低。通过 AR-VAE 将逆向设计的 VAR 组合在一起,设计出了多腔 VAR,用于宽带和多目标峰值频率衰减。所提出的设计方法为具有高维非线性物理响应的结构逆设计提供了一种新方法。
{"title":"Inverse design of Non-parameterized Ventilated Acoustic Resonator via Variational Autoencoder with Acoustic Response-encoded Latent Space","authors":"Min Woo Cho, Seok Hyeon Hwang, Jun-Young Jang, Jin Yeong Song, Sun-kwang Hwang, Kyoung Je Cha, Dong Yong Park, Kyungjun Song, Sang Min Park","doi":"arxiv-2408.05917","DOIUrl":"https://doi.org/arxiv-2408.05917","url":null,"abstract":"Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emerge\u0000as an alternative for sound attenuation in environments that require\u0000ventilation, owing to its excellent low-frequency attenuation performance and\u0000flexible shape adaptability. However, due to the non-linear acoustic responses\u0000of VARs, the VAR designs are generally obtained within a limited parametrized\u0000design space, and the design relies on the iteration of the numerical\u0000simulation which consumes a considerable amount of computational time and\u0000resources. This paper proposes an acoustic response-encoded variational\u0000autoencoder (AR-VAE), a novel variational autoencoder-based generative design\u0000model for the efficient and accurate inverse design of VAR even with\u0000non-parametrized designs. The AR-VAE matches the high-dimensional acoustic\u0000response with the VAR cross-section image in the dimension-reduced latent\u0000space, which enables the AR-VAE to generate various non-parametrized VAR\u0000cross-section images with the target acoustic response. AR-VAE generates\u0000non-parameterized VARs from target acoustic responses, which show a 25-fold\u0000reduction in mean squared error compared to conventional deep learning-based\u0000parameter searching methods while exhibiting lower average mean squared error\u0000and peak frequency variance. By combining the inverse-designed VARs by AR-VAE,\u0000multi-cavity VAR was devised for broadband and multitarget peak frequency\u0000attenuation. The proposed design method presents a new approach for structural\u0000inverse-design with a high-dimensional non-linear physical response.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142227901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HyperCAN: Hypernetwork-Driven Deep Parameterized Constitutive Models for Metamaterials HyperCAN:超网络驱动的超材料深度参数化构造模型
Pub Date : 2024-08-12 DOI: arxiv-2408.06017
Li Zheng, Dennis M. Kochmann, Siddhant Kumar
We introduce HyperCAN, a machine learning framework that utilizeshypernetworks to construct adaptable constitutive artificial neural networksfor a wide range of beam-based metamaterials exhibiting diverse mechanicalbehavior under finite deformations. HyperCAN integrates an input convex networkthat models the nonlinear stress-strain map of a truss lattice, while ensuringadherence to fundamental mechanics principles, along with a hypernetwork thatdynamically adjusts the parameters of the convex network as a function of thelattice topology and geometry. This unified framework demonstrates robustgeneralization in predicting the mechanical behavior of previously unseenmetamaterial designs and loading scenarios well beyond the training domain. Weshow how HyperCAN can be integrated into multiscale simulations to accuratelycapture the highly nonlinear responses of large-scale truss metamaterials,closely matching fully resolved simulations while significantly reducingcomputational costs. This offers new efficient opportunities for the multiscaledesign and optimization of truss metamaterials.
我们介绍的 HyperCAN 是一种机器学习框架,它利用超网络为各种基于梁的超材料构建可适应的构成性人工神经网络,这些超材料在有限变形条件下表现出不同的力学行为。HyperCAN 集成了一个输入凸网络和一个超网络,前者对桁架晶格的非线性应力应变图进行建模,同时确保符合基本力学原理,后者可根据晶格的拓扑结构和几何形状动态调整凸网络的参数。这个统一的框架在预测以前从未见过的材料设计和加载场景的力学行为方面具有强大的通用性,远远超出了训练领域。我们展示了如何将 HyperCAN 集成到多尺度模拟中,以准确捕捉大规模桁架超材料的高度非线性响应,使其与全解析模拟密切匹配,同时显著降低计算成本。这为桁架超材料的多尺度设计和优化提供了新的高效机会。
{"title":"HyperCAN: Hypernetwork-Driven Deep Parameterized Constitutive Models for Metamaterials","authors":"Li Zheng, Dennis M. Kochmann, Siddhant Kumar","doi":"arxiv-2408.06017","DOIUrl":"https://doi.org/arxiv-2408.06017","url":null,"abstract":"We introduce HyperCAN, a machine learning framework that utilizes\u0000hypernetworks to construct adaptable constitutive artificial neural networks\u0000for a wide range of beam-based metamaterials exhibiting diverse mechanical\u0000behavior under finite deformations. HyperCAN integrates an input convex network\u0000that models the nonlinear stress-strain map of a truss lattice, while ensuring\u0000adherence to fundamental mechanics principles, along with a hypernetwork that\u0000dynamically adjusts the parameters of the convex network as a function of the\u0000lattice topology and geometry. This unified framework demonstrates robust\u0000generalization in predicting the mechanical behavior of previously unseen\u0000metamaterial designs and loading scenarios well beyond the training domain. We\u0000show how HyperCAN can be integrated into multiscale simulations to accurately\u0000capture the highly nonlinear responses of large-scale truss metamaterials,\u0000closely matching fully resolved simulations while significantly reducing\u0000computational costs. This offers new efficient opportunities for the multiscale\u0000design and optimization of truss metamaterials.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An anisotropic, brittle damage model for finite strains with a generic damage tensor regularization 采用通用损伤张量正则化的各向异性有限应变脆性损伤模型
Pub Date : 2024-08-12 DOI: arxiv-2408.06140
Tim van der Velden, Stefanie Reese, Hagen Holthusen, Tim Brepols
This paper establishes a universal framework for the nonlocal modeling ofanisotropic damage at finite strains. By the combination of two recent works,the new framework allows for the flexible incorporation of differentestablished hyperelastic finite strain material formulations into anisotropicdamage whilst ensuring mesh-independent results by employing a generic set ofmicromorphic gradient-extensions. First, the anisotropic damage model,generally satisfying the damage growth criterion, is investigated for thespecific choice of a Neo-Hookean material on a single element. Next, the modelis applied with different gradient-extensions in structural simulations of anasymmetrically notched specimen to identify an efficient choice in the form ofa volumetric-deviatoric regularization. Thereafter, the universal framework,which is without loss of generality here specified for a Neo-Hookean materialwith a volumetric-deviatoric gradient-extension, successfully serves for thecomplex simulation of a pressure loaded rotor blade. After acceptance of the manuscript, we make the codes of the materialsubroutines accessible to the public athttps://doi.org/10.5281/zenodo.11171630.
本文为有限应变下各向异性损伤的非局部建模建立了一个通用框架。新框架结合了两篇最新研究成果,允许将不同的超弹性有限应变材料公式灵活地纳入各向异性损伤模型,同时通过采用一组通用的微形态梯度扩展,确保得到与网格无关的结果。首先,研究了在单个元素上特定选择新胡肯材料的各向异性损伤模型,该模型一般满足损伤增长准则。接着,在对非对称缺口试样进行结构模拟时,将该模型与不同的梯度扩展结合使用,以确定一种有效的体积-偏差正则化形式。此后,在不失一般性的前提下,我们针对具有体积偏差梯度伸长的新胡克式材料建立了通用框架,并成功地应用于压力加载转子叶片的复杂模拟。稿件被接受后,我们将在https://doi.org/10.5281/zenodo.11171630 上公开材料程序代码。
{"title":"An anisotropic, brittle damage model for finite strains with a generic damage tensor regularization","authors":"Tim van der Velden, Stefanie Reese, Hagen Holthusen, Tim Brepols","doi":"arxiv-2408.06140","DOIUrl":"https://doi.org/arxiv-2408.06140","url":null,"abstract":"This paper establishes a universal framework for the nonlocal modeling of\u0000anisotropic damage at finite strains. By the combination of two recent works,\u0000the new framework allows for the flexible incorporation of different\u0000established hyperelastic finite strain material formulations into anisotropic\u0000damage whilst ensuring mesh-independent results by employing a generic set of\u0000micromorphic gradient-extensions. First, the anisotropic damage model,\u0000generally satisfying the damage growth criterion, is investigated for the\u0000specific choice of a Neo-Hookean material on a single element. Next, the model\u0000is applied with different gradient-extensions in structural simulations of an\u0000asymmetrically notched specimen to identify an efficient choice in the form of\u0000a volumetric-deviatoric regularization. Thereafter, the universal framework,\u0000which is without loss of generality here specified for a Neo-Hookean material\u0000with a volumetric-deviatoric gradient-extension, successfully serves for the\u0000complex simulation of a pressure loaded rotor blade. After acceptance of the manuscript, we make the codes of the material\u0000subroutines accessible to the public at\u0000https://doi.org/10.5281/zenodo.11171630.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Heterogeneous Computation Resource Allocation based on Data-driven Method 基于数据驱动方法的异构计算资源分配研究
Pub Date : 2024-08-11 DOI: arxiv-2408.05671
Xirui Tang, Zeyu Wang, Xiaowei Cai, Honghua Su, Changsong Wei
The rapid development of the mobile Internet and the Internet of Things isleading to a diversification of user devices and the emergence of new mobileapplications on a regular basis. Such applications include those that arecomputationally intensive, such as pattern recognition, interactive gaming,virtual reality, and augmented reality. However, the computing and energyresources available on the user's equipment are limited, which presents achallenge in effectively supporting such demanding applications. In this work,we propose a heterogeneous computing resource allocation model based on adata-driven approach. The model first collects and analyzes historical workloaddata at scale, extracts key features, and builds a detailed data set. Then, adata-driven deep neural network is used to predict future resourcerequirements. Based on the prediction results, the model adopts a dynamicadjustment and optimization resource allocation strategy. This strategy notonly fully considers the characteristics of different computing resources, butalso accurately matches the requirements of various tasks, and realizes dynamicand flexible resource allocation, thereby greatly improving the overallperformance and resource utilization of the system. Experimental results showthat the proposed method is significantly better than the traditional resourceallocation method in a variety of scenarios, demonstrating its excellentaccuracy and adaptability.
移动互联网和物联网的快速发展导致用户设备的多样化和新移动应用的不断涌现。这些应用包括模式识别、互动游戏、虚拟现实和增强现实等计算密集型应用。然而,用户设备上可用的计算和能源资源有限,这给有效支持此类高要求应用带来了挑战。在这项工作中,我们提出了一种基于数据驱动方法的异构计算资源分配模型。该模型首先大规模收集和分析历史工作负载数据,提取关键特征,建立详细的数据集。然后,使用数据驱动的深度神经网络预测未来的资源需求。根据预测结果,模型采用动态调整和优化资源分配策略。该策略不仅充分考虑了不同计算资源的特点,而且准确匹配了各种任务的需求,实现了动态灵活的资源分配,从而大大提高了系统的整体性能和资源利用率。实验结果表明,所提出的方法在各种场景下都明显优于传统的资源分配方法,证明了其出色的准确性和适应性。
{"title":"Research on Heterogeneous Computation Resource Allocation based on Data-driven Method","authors":"Xirui Tang, Zeyu Wang, Xiaowei Cai, Honghua Su, Changsong Wei","doi":"arxiv-2408.05671","DOIUrl":"https://doi.org/arxiv-2408.05671","url":null,"abstract":"The rapid development of the mobile Internet and the Internet of Things is\u0000leading to a diversification of user devices and the emergence of new mobile\u0000applications on a regular basis. Such applications include those that are\u0000computationally intensive, such as pattern recognition, interactive gaming,\u0000virtual reality, and augmented reality. However, the computing and energy\u0000resources available on the user's equipment are limited, which presents a\u0000challenge in effectively supporting such demanding applications. In this work,\u0000we propose a heterogeneous computing resource allocation model based on a\u0000data-driven approach. The model first collects and analyzes historical workload\u0000data at scale, extracts key features, and builds a detailed data set. Then, a\u0000data-driven deep neural network is used to predict future resource\u0000requirements. Based on the prediction results, the model adopts a dynamic\u0000adjustment and optimization resource allocation strategy. This strategy not\u0000only fully considers the characteristics of different computing resources, but\u0000also accurately matches the requirements of various tasks, and realizes dynamic\u0000and flexible resource allocation, thereby greatly improving the overall\u0000performance and resource utilization of the system. Experimental results show\u0000that the proposed method is significantly better than the traditional resource\u0000allocation method in a variety of scenarios, demonstrating its excellent\u0000accuracy and adaptability.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A reduced simulation applied to viscoelastic fatigue of polymers using a time multi-scale approach based on Partition of Unity method 使用基于统一分割法的时间多尺度方法,对聚合物的粘弹性疲劳进行简化模拟
Pub Date : 2024-08-09 DOI: arxiv-2408.05143
Sebastian Rodriguez, Angelo Pasquale, Jad Mounayer, Diego Canales, Marianne Beringhier, Chady Ghnatios, Amine Ammar, Francisco Chinesta
The simulation of viscoelastic time-evolution problems described by a largenumber of internal variables and with a large spectrum of relaxation timesrequires high computational resources for their resolution. Furthermore, theinternal variables evolution is described by a set of linear differentialequations which involves many time scales. In this context, the use of aspace-time PGD approximation is proposed here to boost their resolution, wherethe temporal functions are constructed following a multi-scale strategy alongwith the Partition of Unity method, in order to catch each dynamic efficiently.The feasibility and the robustness of the method are discussed in the case of apolymer in a non-equilibrium state under cyclic loading.
粘弹性时间演化问题由大量内部变量和大量弛豫时间谱描述,模拟这些问题需要大量计算资源来解决。此外,内部变量的演变是由一组线性微分方程描述的,涉及许多时间尺度。在这种情况下,本文提出使用空间时间 PGD 近似来提高其分辨率,其中的时间函数是根据多尺度策略和统一分割法构建的,以便有效地捕捉每个动态。
{"title":"A reduced simulation applied to viscoelastic fatigue of polymers using a time multi-scale approach based on Partition of Unity method","authors":"Sebastian Rodriguez, Angelo Pasquale, Jad Mounayer, Diego Canales, Marianne Beringhier, Chady Ghnatios, Amine Ammar, Francisco Chinesta","doi":"arxiv-2408.05143","DOIUrl":"https://doi.org/arxiv-2408.05143","url":null,"abstract":"The simulation of viscoelastic time-evolution problems described by a large\u0000number of internal variables and with a large spectrum of relaxation times\u0000requires high computational resources for their resolution. Furthermore, the\u0000internal variables evolution is described by a set of linear differential\u0000equations which involves many time scales. In this context, the use of a\u0000space-time PGD approximation is proposed here to boost their resolution, where\u0000the temporal functions are constructed following a multi-scale strategy along\u0000with the Partition of Unity method, in order to catch each dynamic efficiently.\u0000The feasibility and the robustness of the method are discussed in the case of a\u0000polymer in a non-equilibrium state under cyclic loading.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FiST-Financial Style Transfer with Hallucination and Creativity Control Framework FiST--金融风格转移与幻觉和创造力控制框架
Pub Date : 2024-08-09 DOI: arxiv-2408.05365
Sohini Roychowdhury, Marko Krema, Brian Moore, Xingjian Lai, Dike Effedua, Bharat Jethwani
Financial report generation using general purpose large language models posetwo major challenges, including the lack of compound sentences andhallucinations. Advanced prompt engineering and retrieval augmented generation(RAG) techniques are incapable of curing the writing style discrepancies. Inthis work we propose a novel two-stage fine-tuning process wherein publicdomain financial reports are processed into prompt-completions and augmentedusing simple LLM prompts to then enable sectional financial report generationusing minimal instructions and tabular data inputs. Our proposed fine-tuningframework results doubles the number of correct questions answers and reduceshallucinations by over 50%. Additionally, the two-stage fine tuned models havelower perplexity, improved ROUGE, TER and BLEU scores, higher creativity andknowledge density with lower uncertainty and cross entropy.
使用通用大型语言模型生成财务报告面临两大挑战,包括缺乏复合句和幻觉。先进的提示工程和检索增强生成(RAG)技术无法解决写作风格差异问题。在这项工作中,我们提出了一种新颖的两阶段微调流程,将公共领域的财务报告处理为提示完成语,并使用简单的 LLM 提示语进行增强,然后使用最少的指令和表格数据输入生成分节财务报告。我们提出的微调框架使问题答案的正确率提高了一倍,减少了 50% 以上的误解。此外,经过两阶段微调的模型具有更低的困惑度,更高的 ROUGE、TER 和 BLEU 分数,更高的创造力和知识密度,以及更低的不确定性和交叉熵。
{"title":"FiST-Financial Style Transfer with Hallucination and Creativity Control Framework","authors":"Sohini Roychowdhury, Marko Krema, Brian Moore, Xingjian Lai, Dike Effedua, Bharat Jethwani","doi":"arxiv-2408.05365","DOIUrl":"https://doi.org/arxiv-2408.05365","url":null,"abstract":"Financial report generation using general purpose large language models pose\u0000two major challenges, including the lack of compound sentences and\u0000hallucinations. Advanced prompt engineering and retrieval augmented generation\u0000(RAG) techniques are incapable of curing the writing style discrepancies. In\u0000this work we propose a novel two-stage fine-tuning process wherein public\u0000domain financial reports are processed into prompt-completions and augmented\u0000using simple LLM prompts to then enable sectional financial report generation\u0000using minimal instructions and tabular data inputs. Our proposed fine-tuning\u0000framework results doubles the number of correct questions answers and reduces\u0000hallucinations by over 50%. Additionally, the two-stage fine tuned models have\u0000lower perplexity, improved ROUGE, TER and BLEU scores, higher creativity and\u0000knowledge density with lower uncertainty and cross entropy.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Audio-visual cross-modality knowledge transfer for machine learning-based in-situ monitoring in laser additive manufacturing 基于机器学习的激光增材制造原位监测视听跨模态知识转移
Pub Date : 2024-08-09 DOI: arxiv-2408.05307
Jiarui Xie, Mutahar Safdar, Lequn Chen, Seung Ki Moon, Yaoyao Fiona Zhao
Various machine learning (ML)-based in-situ monitoring systems have beendeveloped to detect laser additive manufacturing (LAM) process anomalies anddefects. Multimodal fusion can improve in-situ monitoring performance byacquiring and integrating data from multiple modalities, including visual andaudio data. However, multimodal fusion employs multiple sensors of differenttypes, which leads to higher hardware, computational, and operational costs.This paper proposes a cross-modality knowledge transfer (CMKT) methodology thattransfers knowledge from a source to a target modality for LAM in-situmonitoring. CMKT enhances the usefulness of the features extracted from thetarget modality during the training phase and removes the sensors of the sourcemodality during the prediction phase. This paper proposes three CMKT methods:semantic alignment, fully supervised mapping, and semi-supervised mapping.Semantic alignment establishes a shared encoded space between modalities tofacilitate knowledge transfer. It utilizes a semantic alignment loss to alignthe distributions of the same classes (e.g., visual defective and audiodefective classes) and a separation loss to separate the distributions ofdifferent classes (e.g., visual defective and audio defect-free classes). Thetwo mapping methods transfer knowledge by deriving the features of one modalityfrom the other modality using fully supervised and semi-supervised learning.The proposed CMKT methods were implemented and compared with multimodalaudio-visual fusion in an LAM in-situ anomaly detection case study. Thesemantic alignment method achieves a 98.4% accuracy while removing the audiomodality during the prediction phase, which is comparable to the accuracy ofmultimodal fusion (98.2%).
目前已开发出多种基于机器学习(ML)的原位监测系统,用于检测激光增材制造(LAM)过程的异常和缺陷。多模态融合可以通过获取和整合多种模态的数据(包括视觉和音频数据)来提高原位监测性能。本文提出了一种跨模态知识转移(CMKT)方法,将知识从源模态转移到目标模态,用于 LAM 现场监测。在训练阶段,CMKT 增强了从目标模态提取的特征的实用性,并在预测阶段消除了源模态的传感器。本文提出了三种 CMKT 方法:语义对齐、完全监督映射和半监督映射。它利用语义对齐损失来对齐相同类别(如视觉缺陷和听觉缺陷类别)的分布,并利用分离损失来分离不同类别(如视觉缺陷和听觉无缺陷类别)的分布。这两种映射方法通过使用完全监督和半监督学习从一种模态得出另一种模态的特征,从而实现知识转移。在预测阶段去除音频模式的同时,语义对齐方法的准确率达到了 98.4%,与多模式融合的准确率(98.2%)相当。
{"title":"Audio-visual cross-modality knowledge transfer for machine learning-based in-situ monitoring in laser additive manufacturing","authors":"Jiarui Xie, Mutahar Safdar, Lequn Chen, Seung Ki Moon, Yaoyao Fiona Zhao","doi":"arxiv-2408.05307","DOIUrl":"https://doi.org/arxiv-2408.05307","url":null,"abstract":"Various machine learning (ML)-based in-situ monitoring systems have been\u0000developed to detect laser additive manufacturing (LAM) process anomalies and\u0000defects. Multimodal fusion can improve in-situ monitoring performance by\u0000acquiring and integrating data from multiple modalities, including visual and\u0000audio data. However, multimodal fusion employs multiple sensors of different\u0000types, which leads to higher hardware, computational, and operational costs.\u0000This paper proposes a cross-modality knowledge transfer (CMKT) methodology that\u0000transfers knowledge from a source to a target modality for LAM in-situ\u0000monitoring. CMKT enhances the usefulness of the features extracted from the\u0000target modality during the training phase and removes the sensors of the source\u0000modality during the prediction phase. This paper proposes three CMKT methods:\u0000semantic alignment, fully supervised mapping, and semi-supervised mapping.\u0000Semantic alignment establishes a shared encoded space between modalities to\u0000facilitate knowledge transfer. It utilizes a semantic alignment loss to align\u0000the distributions of the same classes (e.g., visual defective and audio\u0000defective classes) and a separation loss to separate the distributions of\u0000different classes (e.g., visual defective and audio defect-free classes). The\u0000two mapping methods transfer knowledge by deriving the features of one modality\u0000from the other modality using fully supervised and semi-supervised learning.\u0000The proposed CMKT methods were implemented and compared with multimodal\u0000audio-visual fusion in an LAM in-situ anomaly detection case study. The\u0000semantic alignment method achieves a 98.4% accuracy while removing the audio\u0000modality during the prediction phase, which is comparable to the accuracy of\u0000multimodal fusion (98.2%).","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The impact of internal variability on benchmarking deep learning climate emulators 内部变异对深度学习气候模拟器基准测试的影响
Pub Date : 2024-08-09 DOI: arxiv-2408.05288
Björn Lütjens, Raffaele Ferrari, Duncan Watson-Parris, Noelle Selin
Full-complexity Earth system models (ESMs) are computationally veryexpensive, limiting their use in exploring the climate outcomes of multipleemission pathways. More efficient emulators that approximate ESMs can directlymap emissions onto climate outcomes, and benchmarks are being used to evaluatetheir accuracy on standardized tasks and datasets. We investigate a popularbenchmark in data-driven climate emulation, ClimateBench, on which deeplearning-based emulators are currently achieving the best performance. Weimplement a linear regression-based emulator, akin to pattern scaling, and findthat it outperforms the incumbent 100M-parameter deep learning foundationmodel, ClimaX, on 3 out of 4 regionally-resolved surface-level climatevariables. While emulating surface temperature is expected to be predominantlylinear, this result is surprising for emulating precipitation. We identify thatthis outcome is a result of high levels of internal variability in thebenchmark targets. To address internal variability, we update the benchmarktargets with ensemble averages from the MPI-ESM1.2-LR model that contain 50instead of 3 climate simulations per emission pathway. Using the new targets,we show that linear pattern scaling continues to be more accurate ontemperature, but can be outperformed by a deep learning-based model foremulating precipitation. We publish our code, data, and an interactive tutorialat github.com/blutjens/climate-emulator.
全复杂地球系统模型(ESM)的计算成本非常高,限制了它们在探索多种排放路径的气候结果方面的应用。近似 ESM 的更高效模拟器可以直接将排放映射到气候结果上,目前正在使用基准来评估它们在标准化任务和数据集上的准确性。我们研究了数据驱动气候模拟的一个流行基准--ClimateBench,基于深度学习的模拟器目前在该基准上取得了最佳性能。我们实施了一个基于线性回归的仿真器,类似于模式缩放,发现它在 4 个区域分辨地表级气候变量中的 3 个上优于现有的 1 亿参数深度学习基础模型 ClimaX。虽然模拟地表温度预计会占主导地位,但这一结果在模拟降水方面却出人意料。我们发现,这一结果是由于基准目标的内部变异水平较高造成的。为了解决内部变异问题,我们用 MPI-ESM1.2-LR 模型的集合平均值更新了基准目标,每个排放途径包含 50 个而不是 3 个气候模拟。使用新目标后,我们发现线性模式缩放对温度的影响仍然更准确,但基于深度学习的降水预测模型的表现可能会更好。我们在 github.com/blutjens/climate-emulator 上发布了我们的代码、数据和互动教程。
{"title":"The impact of internal variability on benchmarking deep learning climate emulators","authors":"Björn Lütjens, Raffaele Ferrari, Duncan Watson-Parris, Noelle Selin","doi":"arxiv-2408.05288","DOIUrl":"https://doi.org/arxiv-2408.05288","url":null,"abstract":"Full-complexity Earth system models (ESMs) are computationally very\u0000expensive, limiting their use in exploring the climate outcomes of multiple\u0000emission pathways. More efficient emulators that approximate ESMs can directly\u0000map emissions onto climate outcomes, and benchmarks are being used to evaluate\u0000their accuracy on standardized tasks and datasets. We investigate a popular\u0000benchmark in data-driven climate emulation, ClimateBench, on which deep\u0000learning-based emulators are currently achieving the best performance. We\u0000implement a linear regression-based emulator, akin to pattern scaling, and find\u0000that it outperforms the incumbent 100M-parameter deep learning foundation\u0000model, ClimaX, on 3 out of 4 regionally-resolved surface-level climate\u0000variables. While emulating surface temperature is expected to be predominantly\u0000linear, this result is surprising for emulating precipitation. We identify that\u0000this outcome is a result of high levels of internal variability in the\u0000benchmark targets. To address internal variability, we update the benchmark\u0000targets with ensemble averages from the MPI-ESM1.2-LR model that contain 50\u0000instead of 3 climate simulations per emission pathway. Using the new targets,\u0000we show that linear pattern scaling continues to be more accurate on\u0000temperature, but can be outperformed by a deep learning-based model for\u0000emulating precipitation. We publish our code, data, and an interactive tutorial\u0000at github.com/blutjens/climate-emulator.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The LATIN-PGD methodology to nonlinear dynamics and quasi-brittle materials for future earthquake engineering applications 针对非线性动力学和准脆性材料的 LATIN-PGD 方法在未来地震工程中的应用
Pub Date : 2024-08-09 DOI: arxiv-2408.05108
Sebastian Rodriguez, Pierre-Etienne Charbonnel, Pierre Ladevèze, David Néron
This paper presents a first implementation of the LArge Time INcrement(LATIN) method along with the model reduction technique called ProperGeneralized Decomposition (PGD) for solving nonlinear low-frequency dynamicsproblems when dealing with a quasi-brittle isotropic damage constitutiverelations. The present paper uses the Time-Discontinuous Galerkin Method (TDGM)for computing the temporal contributions of the space-time separate-variablessolution of the LATIN-PGD approach, which offers several advantages whenconsidering a high number of DOFs in time. The efficiency of the method istested for the case of a 3D bending beam, where results and benchmarkscomparing LATIN-PGD to classical time-incremental Newmark/Quasi-Newtonnonlinear solver are presented. This work represents a first step towardstaking into account uncertainties and carrying out more complex parametricstudies imposed by seismic risk assessment.
本文首次提出了LArge Time INcrement (LATIN)方法与称为ProperGeneralized Decomposition (PGD)的模型缩减技术,用于求解准脆性各向同性损伤构成参数时的非线性低频动力学问题。本文使用时间-非连续伽勒金方法(TDGM)计算 LATIN-PGD 方法的时空分离无变量求解的时间贡献。以三维弯曲梁为例,测试了该方法的效率,并给出了将 LATIN-PGD 与经典的时间递增纽马克/准牛顿非线性求解器进行比较的结果和基准。这项工作是考虑不确定性和开展地震风险评估所要求的更复杂参数研究的第一步。
{"title":"The LATIN-PGD methodology to nonlinear dynamics and quasi-brittle materials for future earthquake engineering applications","authors":"Sebastian Rodriguez, Pierre-Etienne Charbonnel, Pierre Ladevèze, David Néron","doi":"arxiv-2408.05108","DOIUrl":"https://doi.org/arxiv-2408.05108","url":null,"abstract":"This paper presents a first implementation of the LArge Time INcrement\u0000(LATIN) method along with the model reduction technique called Proper\u0000Generalized Decomposition (PGD) for solving nonlinear low-frequency dynamics\u0000problems when dealing with a quasi-brittle isotropic damage constitutive\u0000relations. The present paper uses the Time-Discontinuous Galerkin Method (TDGM)\u0000for computing the temporal contributions of the space-time separate-variables\u0000solution of the LATIN-PGD approach, which offers several advantages when\u0000considering a high number of DOFs in time. The efficiency of the method is\u0000tested for the case of a 3D bending beam, where results and benchmarks\u0000comparing LATIN-PGD to classical time-incremental Newmark/Quasi-Newton\u0000nonlinear solver are presented. This work represents a first step towards\u0000taking into account uncertainties and carrying out more complex parametric\u0000studies imposed by seismic risk assessment.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning from Human Feedback for Lane Changing of Autonomous Vehicles in Mixed Traffic 混合交通中自动驾驶车辆变道的人机反馈强化学习
Pub Date : 2024-08-08 DOI: arxiv-2408.04447
Yuting Wang, Lu Liu, Maonan Wang, Xi Xiong
The burgeoning field of autonomous driving necessitates the seamlessintegration of autonomous vehicles (AVs) with human-driven vehicles, callingfor more predictable AV behavior and enhanced interaction with human drivers.Human-like driving, particularly during lane-changing maneuvers on highways, isa critical area of research due to its significant impact on safety and trafficflow. Traditional rule-based decision-making approaches often fail toencapsulate the nuanced boundaries of human behavior in diverse drivingscenarios, while crafting reward functions for learning-based methodsintroduces its own set of complexities. This study investigates the applicationof Reinforcement Learning from Human Feedback (RLHF) to emulate human-likelane-changing decisions in AVs. An initial RL policy is pre-trained to ensuresafe lane changes. Subsequently, this policy is employed to gather data, whichis then annotated by humans to train a reward model that discerns lane changesaligning with human preferences. This human-informed reward model supersedesthe original, guiding the refinement of the policy to reflect human-likepreferences. The effectiveness of RLHF in producing human-like lane changes isdemonstrated through the development and evaluation of conservative andaggressive lane-changing models within obstacle-rich environments and mixedautonomy traffic scenarios. The experimental outcomes underscore the potentialof RLHF to diversify lane-changing behaviors in AVs, suggesting its viabilityfor enhancing the integration of AVs into the fabric of human-driven traffic.
蓬勃发展的自动驾驶领域要求自动驾驶汽车(AV)与人类驾驶的车辆无缝集成,这就要求自动驾驶汽车的行为更具可预测性,并加强与人类驾驶员的互动。仿人驾驶,尤其是在高速公路上的变道操作过程中,由于对安全和交通流量具有重大影响,因此是一个关键的研究领域。传统的基于规则的决策方法往往无法囊括不同驾驶场景中人类行为的细微界限,而为基于学习的方法设计奖励函数也带来了一系列复杂问题。本研究探讨了应用 "人类反馈强化学习"(RLHF)在自动驾驶汽车中模拟人类可能改变的决策。对初始 RL 策略进行了预训练,以确保安全变道。随后,利用该策略收集数据,再由人类对数据进行注释,从而训练出一个奖励模型,用于识别符合人类偏好的车道变更。这种由人类提供信息的奖励模型取代了原来的奖励模型,指导政策的改进,以反映类似人类的偏好。通过在障碍物丰富的环境和混合自动驾驶交通场景中开发和评估保守型和激进型变道模型,证明了 RLHF 在产生类人变道方面的有效性。实验结果凸显了 RLHF 在使自动驾驶汽车变道行为多样化方面的潜力,表明其在促进自动驾驶汽车融入人类驾驶交通结构方面的可行性。
{"title":"Reinforcement Learning from Human Feedback for Lane Changing of Autonomous Vehicles in Mixed Traffic","authors":"Yuting Wang, Lu Liu, Maonan Wang, Xi Xiong","doi":"arxiv-2408.04447","DOIUrl":"https://doi.org/arxiv-2408.04447","url":null,"abstract":"The burgeoning field of autonomous driving necessitates the seamless\u0000integration of autonomous vehicles (AVs) with human-driven vehicles, calling\u0000for more predictable AV behavior and enhanced interaction with human drivers.\u0000Human-like driving, particularly during lane-changing maneuvers on highways, is\u0000a critical area of research due to its significant impact on safety and traffic\u0000flow. Traditional rule-based decision-making approaches often fail to\u0000encapsulate the nuanced boundaries of human behavior in diverse driving\u0000scenarios, while crafting reward functions for learning-based methods\u0000introduces its own set of complexities. This study investigates the application\u0000of Reinforcement Learning from Human Feedback (RLHF) to emulate human-like\u0000lane-changing decisions in AVs. An initial RL policy is pre-trained to ensure\u0000safe lane changes. Subsequently, this policy is employed to gather data, which\u0000is then annotated by humans to train a reward model that discerns lane changes\u0000aligning with human preferences. This human-informed reward model supersedes\u0000the original, guiding the refinement of the policy to reflect human-like\u0000preferences. The effectiveness of RLHF in producing human-like lane changes is\u0000demonstrated through the development and evaluation of conservative and\u0000aggressive lane-changing models within obstacle-rich environments and mixed\u0000autonomy traffic scenarios. The experimental outcomes underscore the potential\u0000of RLHF to diversify lane-changing behaviors in AVs, suggesting its viability\u0000for enhancing the integration of AVs into the fabric of human-driven traffic.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Computational Engineering, Finance, and Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1