arXiv - CS - Computational Engineering, Finance, and Science最新文献_第10页

Exploring Commercial Vehicle Detouring Patterns through the Application of Probe Trajectory Data 应用探测器轨迹数据探索商用车辆绕行模式

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-24 DOI: arxiv-2407.17319

Mark Franz PhD, Sara Zahedian PhD, Dhairya Parekh, Tahsin Emtenam PhD, Greg Jordan

Understanding motorist detouring behavior is critical for both trafficoperations and planning applications. However, measuring real-world detouringbehavior is challenging due to the need to track the movement of individualvehicles. Recent developments in high-resolution vehicle trajectory data haveenabled transportation professionals to observe real-world detouring behaviorswithout the need to install and maintain hardware such as license plate readingcameras. This paper investigates the feasibility of vehicle probe trajectorydata to capture commercial motor vehicle (CMV) detouring behavior under threeunique case studies. Before doing so, a validation analysis was conducted toinvestigate the ability of CMV probe trajectory data to represent overall CMVvolumes at well-calibrated count stations near virtual weigh stations (VWS) inMaryland. The validation analysis showed strong positive correlations (above0.75) at all VWS stations. Upon validating the data, a methodology was appliedto assess CMV detour behaviors associated with CMV enforcement activities,congestion avoidance, and incident induced temporary road closures.

了解驾驶者的绕行行为对于交通运营和规划应用都至关重要。然而，由于需要跟踪单个车辆的移动，测量真实世界中的绕行行为具有挑战性。高分辨率车辆轨迹数据的最新发展使交通专业人员能够观察真实世界中的绕行行为，而无需安装和维护车牌读取摄像头等硬件。本文研究了车辆探测轨迹数据在三个独特案例研究中捕捉商用机动车（CMV）绕行行为的可行性。在此之前，还进行了验证分析，以调查 CMV 探头轨迹数据在马里兰州虚拟称重站（VWS）附近校准良好的计数站代表 CMV 总流量的能力。验证分析表明，所有虚拟称重站都存在较强的正相关性（高于 0.75）。在验证数据后，应用一种方法来评估与 CMV 执法活动、拥堵规避和事故诱发的临时道路关闭相关的 CMV 绕行行为。

{"title":"Exploring Commercial Vehicle Detouring Patterns through the Application of Probe Trajectory Data","authors":"Mark Franz PhD, Sara Zahedian PhD, Dhairya Parekh, Tahsin Emtenam PhD, Greg Jordan","doi":"arxiv-2407.17319","DOIUrl":"https://doi.org/arxiv-2407.17319","url":null,"abstract":"Understanding motorist detouring behavior is critical for both traffic\u0000operations and planning applications. However, measuring real-world detouring\u0000behavior is challenging due to the need to track the movement of individual\u0000vehicles. Recent developments in high-resolution vehicle trajectory data have\u0000enabled transportation professionals to observe real-world detouring behaviors\u0000without the need to install and maintain hardware such as license plate reading\u0000cameras. This paper investigates the feasibility of vehicle probe trajectory\u0000data to capture commercial motor vehicle (CMV) detouring behavior under three\u0000unique case studies. Before doing so, a validation analysis was conducted to\u0000investigate the ability of CMV probe trajectory data to represent overall CMV\u0000volumes at well-calibrated count stations near virtual weigh stations (VWS) in\u0000Maryland. The validation analysis showed strong positive correlations (above\u00000.75) at all VWS stations. Upon validating the data, a methodology was applied\u0000to assess CMV detour behaviors associated with CMV enforcement activities,\u0000congestion avoidance, and incident induced temporary road closures.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141771818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

KnowTD-An Actionable Knowledge Representation System for Thermodynamics KnowTD--热力学可操作知识表示系统

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-24 DOI: arxiv-2407.17169

Luisa Vollmer, Sophie Fellenz, Fabian Jirasek, Heike Leitte, Hans Hasse

We demonstrate that thermodynamic knowledge acquired by humans can betransferred to computers so that the machine can use it to solve thermodynamicproblems and produce explainable solutions with a guarantee of correctness. Theactionable knowledge representation system that we have created for thispurpose is called KnowTD. It is based on an ontology of thermodynamics thatrepresents knowledge of thermodynamic theory, material properties, andthermodynamic problems. The ontology is coupled with a reasoner that sets upthe problem to be solved based on user input, extracts the correct, pertinentequations from the ontology, solves the resulting mathematical problem, andreturns the solution to the user, together with an explanation of how it wasobtained. KnowTD is presently limited to simple thermodynamic problems, similarto those discussed in an introductory course in Engineering Thermodynamics.This covers the basic theory and working principles of thermodynamics. KnowTDis designed in a modular way and is easily extendable.

我们证明，人类获得的热力学知识可以转移到计算机上，这样机器就可以利用这些知识来解决热力学问题，并在保证正确性的前提下产生可解释的解决方案。我们为此创建的可操作知识表示系统被称为 KnowTD。它以热力学本体为基础，代表了热力学理论、材料特性和热力学问题等方面的知识。本体与推理器相结合，推理器根据用户输入设置要解决的问题，从本体中提取正确的相关方程，解决由此产生的数学问题，并将解决方案返回给用户，同时解释如何获得该解决方案。KnowTD 目前仅限于简单的热力学问题，与工程热力学入门课程中讨论的问题类似。KnowTD 采用模块化设计，易于扩展。

引用次数: 0

COEFF-KANs: A Paradigm to Address the Electrolyte Field with KANs COEFF-KANs：用 KAN 解决电解质场问题的范例

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-24 DOI: arxiv-2407.20265

Xinhe Li, Zhuoying Feng, Yezeng Chen, Weichen Dai, Zixu He, Yi Zhou, Shuhong Jiao

To reduce the experimental validation workload for chemical researchers andaccelerate the design and optimization of high-energy-density lithium metalbatteries, we aim to leverage models to automatically predict CoulombicEfficiency (CE) based on the composition of liquid electrolytes. There aremainly two representative paradigms in existing methods: machine learning anddeep learning. However, the former requires intelligent input feature selectionand reliable computational methods, leading to error propagation from featureestimation to model prediction, while the latter (e.g. MultiModal-MoLFormer)faces challenges of poor predictive performance and overfitting due to limiteddiversity in augmented data. To tackle these issues, we propose a novel methodCOEFF (COlumbic EFficiency prediction via Fine-tuned models), which consists oftwo stages: pre-training a chemical general model and fine-tuning on downstreamdomain data. Firstly, we adopt the publicly available MoLFormer model to obtainfeature vectors for each solvent and salt in the electrolyte. Then, we performa weighted average of embeddings for each token across all molecules, withweights determined by the respective electrolyte component ratios. Finally, weinput the obtained electrolyte features into a Multi-layer Perceptron orKolmogorov-Arnold Network to predict CE. Experimental results on a real-worlddataset demonstrate that our method achieves SOTA for predicting CE compared toall baselines. Data and code used in this work will be made publicly availableafter the paper is published.

为了减少化学研究人员的实验验证工作量，加快高能量密度锂金属电池的设计和优化，我们旨在利用模型来自动预测基于液态电解质组成的库仑效率（CE）。现有方法中主要有两种代表性范式：机器学习和深度学习。然而，前者需要智能的输入特征选择和可靠的计算方法，从而导致从特征估计到模型预测的误差传播，而后者（如多模态-MoLFormer）由于增强数据的多样性有限，面临着预测性能差和过度拟合的挑战。为了解决这些问题，我们提出了一种新方法 COEFF（通过微调模型进行的铅效率预测），它包括两个阶段：预训练化学通用模型和在下游领域数据上进行微调。首先，我们采用公开的 MoLFormer 模型来获取电解质中每种溶剂和盐的特征向量。然后，我们对所有分子中每个标记的嵌入进行加权平均，权重由各自的电解质成分比决定。最后，我们将获得的电解质特征输入多层感知器或科尔莫哥罗夫-阿诺德网络，以预测 CE。在实际数据集上的实验结果表明，与所有基线方法相比，我们的方法在预测 CE 方面达到了 SOTA。这项工作中使用的数据和代码将在论文发表后公开。

{"title":"COEFF-KANs: A Paradigm to Address the Electrolyte Field with KANs","authors":"Xinhe Li, Zhuoying Feng, Yezeng Chen, Weichen Dai, Zixu He, Yi Zhou, Shuhong Jiao","doi":"arxiv-2407.20265","DOIUrl":"https://doi.org/arxiv-2407.20265","url":null,"abstract":"To reduce the experimental validation workload for chemical researchers and\u0000accelerate the design and optimization of high-energy-density lithium metal\u0000batteries, we aim to leverage models to automatically predict Coulombic\u0000Efficiency (CE) based on the composition of liquid electrolytes. There are\u0000mainly two representative paradigms in existing methods: machine learning and\u0000deep learning. However, the former requires intelligent input feature selection\u0000and reliable computational methods, leading to error propagation from feature\u0000estimation to model prediction, while the latter (e.g. MultiModal-MoLFormer)\u0000faces challenges of poor predictive performance and overfitting due to limited\u0000diversity in augmented data. To tackle these issues, we propose a novel method\u0000COEFF (COlumbic EFficiency prediction via Fine-tuned models), which consists of\u0000two stages: pre-training a chemical general model and fine-tuning on downstream\u0000domain data. Firstly, we adopt the publicly available MoLFormer model to obtain\u0000feature vectors for each solvent and salt in the electrolyte. Then, we perform\u0000a weighted average of embeddings for each token across all molecules, with\u0000weights determined by the respective electrolyte component ratios. Finally, we\u0000input the obtained electrolyte features into a Multi-layer Perceptron or\u0000Kolmogorov-Arnold Network to predict CE. Experimental results on a real-world\u0000dataset demonstrate that our method achieves SOTA for predicting CE compared to\u0000all baselines. Data and code used in this work will be made publicly available\u0000after the paper is published.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141863337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fusing LLMs and KGs for Formal Causal Reasoning behind Financial Risk Contagion 融合 LLMs 和 KGs 进行金融风险蔓延背后的形式因果推理

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-24 DOI: arxiv-2407.17190

Guanyuan Yu, Xv Wang, Qing Li, Yu Zhao

Financial risks trend to spread from one entity to another, ultimatelyleading to systemic risks. The key to preventing such risks lies inunderstanding the causal chains behind risk contagion. Despite this, prevailingapproaches primarily emphasize identifying risks, overlooking the underlyingcausal analysis of risk. To address such an issue, we propose a Risk ContagionCausal Reasoning model called RC2R, which uses the logical reasoningcapabilities of large language models (LLMs) to dissect the causal mechanismsof risk contagion grounded in the factual and expert knowledge embedded withinfinancial knowledge graphs (KGs). At the data level, we utilize financial KGsto construct causal instructions, empowering LLMs to perform formal causalreasoning on risk propagation and tackle the "causal parrot" problem of LLMs.In terms of model architecture, we integrate a fusion module that aligns tokensand nodes across various granularities via multi-scale contrastive learning,followed by the amalgamation of textual and graph-structured data through softprompt with cross multi-head attention mechanisms. To quantify risk contagion,we introduce a risk pathway inference module for calculating risk scores foreach node in the graph. Finally, we visualize the risk contagion pathways andtheir intensities using Sankey diagrams, providing detailed causalexplanations. Comprehensive experiments on financial KGs and supply chaindatasets demonstrate that our model outperforms several state-of-the-art modelsin prediction performance and out-of-distribution (OOD) generalizationcapabilities. We will make our dataset and code publicly accessible toencourage further research and development in this field.

金融风险有从一个实体蔓延到另一个实体的趋势，最终导致系统性风险。防范此类风险的关键在于了解风险蔓延背后的因果链。尽管如此，目前流行的方法主要强调识别风险，而忽视了风险背后的因果分析。为了解决这个问题，我们提出了一个名为 RC2R 的风险传染因果推理模型，该模型利用大型语言模型（LLM）的逻辑推理能力来剖析蕴含在金融知识图谱（KG）中的事实知识和专家知识的风险传染因果机制。在数据层面，我们利用金融知识图谱来构建因果指令，从而使 LLMs 能够对风险传播进行正式的因果推理，并解决 LLMs 的 "因果鹦鹉 "问题。在模型架构方面，我们集成了一个融合模块，该模块通过多尺度对比学习来调整不同粒度的标记和节点，然后通过软提示和交叉多头关注机制来合并文本和图结构化数据。为了量化风险传染，我们引入了一个风险路径推断模块，用于计算图中每个节点的风险分数。最后，我们利用桑基图（Sankey diagrams）将风险传染路径及其强度可视化，并提供详细的因果关系解释。在金融 KG 和供应链数据集上进行的综合实验表明，我们的模型在预测性能和分布外泛化能力方面优于多个最先进的模型。我们将公开我们的数据集和代码，以鼓励在这一领域的进一步研究和开发。

{"title":"Fusing LLMs and KGs for Formal Causal Reasoning behind Financial Risk Contagion","authors":"Guanyuan Yu, Xv Wang, Qing Li, Yu Zhao","doi":"arxiv-2407.17190","DOIUrl":"https://doi.org/arxiv-2407.17190","url":null,"abstract":"Financial risks trend to spread from one entity to another, ultimately\u0000leading to systemic risks. The key to preventing such risks lies in\u0000understanding the causal chains behind risk contagion. Despite this, prevailing\u0000approaches primarily emphasize identifying risks, overlooking the underlying\u0000causal analysis of risk. To address such an issue, we propose a Risk Contagion\u0000Causal Reasoning model called RC2R, which uses the logical reasoning\u0000capabilities of large language models (LLMs) to dissect the causal mechanisms\u0000of risk contagion grounded in the factual and expert knowledge embedded within\u0000financial knowledge graphs (KGs). At the data level, we utilize financial KGs\u0000to construct causal instructions, empowering LLMs to perform formal causal\u0000reasoning on risk propagation and tackle the \"causal parrot\" problem of LLMs.\u0000In terms of model architecture, we integrate a fusion module that aligns tokens\u0000and nodes across various granularities via multi-scale contrastive learning,\u0000followed by the amalgamation of textual and graph-structured data through soft\u0000prompt with cross multi-head attention mechanisms. To quantify risk contagion,\u0000we introduce a risk pathway inference module for calculating risk scores for\u0000each node in the graph. Finally, we visualize the risk contagion pathways and\u0000their intensities using Sankey diagrams, providing detailed causal\u0000explanations. Comprehensive experiments on financial KGs and supply chain\u0000datasets demonstrate that our model outperforms several state-of-the-art models\u0000in prediction performance and out-of-distribution (OOD) generalization\u0000capabilities. We will make our dataset and code publicly accessible to\u0000encourage further research and development in this field.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141771820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Reduced Order Model conditioned on monitoring features for estimation and uncertainty quantification in engineered systems 用于工程系统估算和不确定性量化的以监测特征为条件的降序模型

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-24 DOI: arxiv-2407.17139

Konstantinos Vlachas, Thomas Simpson, Anthony Garland, D. Dane Quinn, Charbel Farhat, Eleni Chatzi

Reduced Order Models (ROMs) form essential tools across engineering domainsby virtue of their function as surrogates for computationally intensive digitaltwinning simulators. Although purely data-driven methods are available for ROMconstruction, schemes that allow to retain a portion of the physics tend toenhance the interpretability and generalization of ROMs. However, physics-basedtechniques can adversely scale when dealing with nonlinear systems that featureparametric dependencies. This study introduces a generative physics-based ROMthat is suited for nonlinear systems with parametric dependencies and isadditionally able to quantify the confidence associated with the respectiveestimates. A main contribution of this work is the conditioning of theseparametric ROMs to features that can be derived from monitoring measurements,feasibly in an online fashion. This is contrary to most existing ROM schemes,which remain restricted to the prescription of the physics-based, and usually apriori unknown, system parameters. Our work utilizes conditional VariationalAutoencoders to continuously map the required reduction bases to a featurevector extracted from limited output measurements, while additionally allowingfor a probabilistic assessment of the ROM-estimated Quantities of Interest. Anauxiliary task using a neural network-based parametrization of suitableprobability distributions is introduced to re-establish the link with physicalmodel parameters. We verify the proposed scheme on a series of simulated casestudies incorporating effects of geometric and material nonlinearity underparametric dependencies related to system properties and input loadcharacteristics.

还原阶次模型（ROM）是工程领域的重要工具，因为它可以替代计算密集型数字孪生模拟器。尽管有纯数据驱动的 ROM 构建方法，但允许保留部分物理特性的方案往往会增强 ROM 的可解释性和通用性。然而，在处理具有参数依赖性特征的非线性系统时，基于物理的技术可能会对扩展产生不利影响。本研究介绍了一种基于物理的生成式 ROM，它适用于具有参数依赖性的非线性系统，此外还能量化与各自估计值相关的置信度。这项工作的一个主要贡献是将这些参数 ROM 条件化为可从监测测量中获得的特征，并以可行的在线方式进行。这与大多数现有的 ROM 方案相反，这些方案仍然局限于对基于物理的、通常是先验未知的系统参数进行规定。我们的工作利用条件变异自动编码器将所需的还原基础持续映射到从有限的输出测量中提取的特征向量上，同时还允许对 ROM 估算的相关量进行概率评估。为了重新建立与物理模型参数的联系，我们引入了一项辅助任务，即使用基于神经网络的适当概率分布参数化。我们在一系列模拟案例研究中验证了所提出的方案，其中包括与系统属性和输入负载特征相关的参数依赖性下的几何和材料非线性效应。

{"title":"A Reduced Order Model conditioned on monitoring features for estimation and uncertainty quantification in engineered systems","authors":"Konstantinos Vlachas, Thomas Simpson, Anthony Garland, D. Dane Quinn, Charbel Farhat, Eleni Chatzi","doi":"arxiv-2407.17139","DOIUrl":"https://doi.org/arxiv-2407.17139","url":null,"abstract":"Reduced Order Models (ROMs) form essential tools across engineering domains\u0000by virtue of their function as surrogates for computationally intensive digital\u0000twinning simulators. Although purely data-driven methods are available for ROM\u0000construction, schemes that allow to retain a portion of the physics tend to\u0000enhance the interpretability and generalization of ROMs. However, physics-based\u0000techniques can adversely scale when dealing with nonlinear systems that feature\u0000parametric dependencies. This study introduces a generative physics-based ROM\u0000that is suited for nonlinear systems with parametric dependencies and is\u0000additionally able to quantify the confidence associated with the respective\u0000estimates. A main contribution of this work is the conditioning of these\u0000parametric ROMs to features that can be derived from monitoring measurements,\u0000feasibly in an online fashion. This is contrary to most existing ROM schemes,\u0000which remain restricted to the prescription of the physics-based, and usually a\u0000priori unknown, system parameters. Our work utilizes conditional Variational\u0000Autoencoders to continuously map the required reduction bases to a feature\u0000vector extracted from limited output measurements, while additionally allowing\u0000for a probabilistic assessment of the ROM-estimated Quantities of Interest. An\u0000auxiliary task using a neural network-based parametrization of suitable\u0000probability distributions is introduced to re-establish the link with physical\u0000model parameters. We verify the proposed scheme on a series of simulated case\u0000studies incorporating effects of geometric and material nonlinearity under\u0000parametric dependencies related to system properties and input load\u0000characteristics.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141771901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A spatiotemporal deep learning framework for prediction of crack dynamics in heterogeneous solids: efficient mapping of concrete microstructures to its fracture properties 用于预测异质固体裂缝动力学的时空深度学习框架：混凝土微结构与其断裂特性的高效映射

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-22 DOI: arxiv-2407.15665

Rasoul Najafi Koopas, Shahed Rezaei, Natalie Rauter, Richard Ostwald, Rolf Lammering

A spatiotemporal deep learning framework is proposed that is capable of 2Dfull-field prediction of fracture in concrete mesostructures. This frameworknot only predicts fractures but also captures the entire history of thefracture process, from the crack initiation in the interfacial transition zoneto the subsequent propagation of the cracks in the mortar matrix. In addition,a convolutional neural network is developed which can predict the averagedstress-strain curve of the mesostructures. The UNet modeling framework, whichcomprises an encoder-decoder section with skip connections, is used as the deeplearning surrogate model. Training and test data are generated fromhigh-fidelity fracture simulations of randomly generated concretemesostructures. These mesostructures include geometric variabilities such asdifferent aggregate particle geometrical features, spatial distribution, andthe total volume fraction of aggregates. The fracture simulations are carriedout in Abaqus, utilizing the cohesive phase-field fracture modeling techniqueas the fracture modeling approach. In this work, to reduce the number oftraining datasets, the spatial distribution of three sets of materialproperties for three-phase concrete mesostructures, along with the spatialphase-field damage index, are fed to the UNet to predict the correspondingstress and spatial damage index at the subsequent step. It is shown that afterthe training process using this methodology, the UNet model is capable ofaccurately predicting damage on the unseen test dataset by using 470 datasets.Moreover, another novel aspect of this work is the conversion of irregularfinite element data into regular grids using a developed pipeline. Thisapproach allows for the implementation of less complex UNet architecture andfacilitates the integration of phase-field fracture equations into surrogatemodels for future developments.

本文提出了一种时空深度学习框架，能够对混凝土中间结构的断裂进行二维全场预测。该框架不仅能预测断裂，还能捕捉断裂过程的整个历史，包括从界面过渡带的裂缝起始到随后砂浆基体中裂缝的扩展。此外，还开发了一个卷积神经网络，可以预测中间结构的平均应力应变曲线。UNet 建模框架包括一个具有跳接连接的编码器-解码器部分，被用作深度学习代用模型。训练和测试数据来自随机生成的混凝土中间结构的高保真断裂模拟。这些中间结构包括几何变量，如不同的集料颗粒几何特征、空间分布和集料的总体积分数。断裂模拟在 Abaqus 中进行，采用内聚相场断裂建模技术作为断裂建模方法。在这项工作中，为了减少训练数据集的数量，将三相混凝土中间结构的三组材料属性的空间分布以及空间相场损伤指数输入 UNet，以预测后续步骤中相应的应力和空间损伤指数。结果表明，在使用这种方法进行训练后，UNet 模型能够通过使用 470 个数据集准确预测未见测试数据集上的损伤。此外，这项工作的另一个新颖之处在于使用开发的管道将不规则有限元数据转换为规则网格。这种方法允许实施不太复杂的 UNet 体系结构，并有助于将相场断裂方程集成到未来开发的代用模型中。

{"title":"A spatiotemporal deep learning framework for prediction of crack dynamics in heterogeneous solids: efficient mapping of concrete microstructures to its fracture properties","authors":"Rasoul Najafi Koopas, Shahed Rezaei, Natalie Rauter, Richard Ostwald, Rolf Lammering","doi":"arxiv-2407.15665","DOIUrl":"https://doi.org/arxiv-2407.15665","url":null,"abstract":"A spatiotemporal deep learning framework is proposed that is capable of 2D\u0000full-field prediction of fracture in concrete mesostructures. This framework\u0000not only predicts fractures but also captures the entire history of the\u0000fracture process, from the crack initiation in the interfacial transition zone\u0000to the subsequent propagation of the cracks in the mortar matrix. In addition,\u0000a convolutional neural network is developed which can predict the averaged\u0000stress-strain curve of the mesostructures. The UNet modeling framework, which\u0000comprises an encoder-decoder section with skip connections, is used as the deep\u0000learning surrogate model. Training and test data are generated from\u0000high-fidelity fracture simulations of randomly generated concrete\u0000mesostructures. These mesostructures include geometric variabilities such as\u0000different aggregate particle geometrical features, spatial distribution, and\u0000the total volume fraction of aggregates. The fracture simulations are carried\u0000out in Abaqus, utilizing the cohesive phase-field fracture modeling technique\u0000as the fracture modeling approach. In this work, to reduce the number of\u0000training datasets, the spatial distribution of three sets of material\u0000properties for three-phase concrete mesostructures, along with the spatial\u0000phase-field damage index, are fed to the UNet to predict the corresponding\u0000stress and spatial damage index at the subsequent step. It is shown that after\u0000the training process using this methodology, the UNet model is capable of\u0000accurately predicting damage on the unseen test dataset by using 470 datasets.\u0000Moreover, another novel aspect of this work is the conversion of irregular\u0000finite element data into regular grids using a developed pipeline. This\u0000approach allows for the implementation of less complex UNet architecture and\u0000facilitates the integration of phase-field fracture equations into surrogate\u0000models for future developments.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141771826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Spatio-Temporal Approach with Self-Corrective Causal Inference for Flight Delay Prediction 利用自校正因果推理进行航班延误预测的时空方法

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-21 DOI: arxiv-2407.15185

Qihui Zhu, Shenwen Chen, Tong Guo, Yisheng Lv, Wenbo Du

Accurate flight delay prediction is crucial for the secure and effectiveoperation of the air traffic system. Recent advances in modeling inter-airportrelationships present a promising approach for investigating flight delayprediction from the multi-airport scenario. However, the previous predictionworks only accounted for the simplistic relationships such as traffic flow orgeographical distance, overlooking the intricate interactions among airportsand thus proving inadequate. In this paper, we leverage causal inference toprecisely model inter-airport relationships and propose a self-correctivespatio-temporal graph neural network (named CausalNet) for flight delayprediction. Specifically, Granger causality inference coupled with aself-correction module is designed to construct causality graphs among airportsand dynamically modify them based on the current airport's delays.Additionally, the features of the causality graphs are adaptively extracted andutilized to address the heterogeneity of airports. Extensive experiments areconducted on the real data of top-74 busiest airports in China. The resultsshow that CausalNet is superior to baselines. Ablation studies emphasize thepower of the proposed self-correction causality graph and the graph featureextraction module. All of these prove the effectiveness of the proposedmethodology.

准确的航班延误预测对于空中交通系统的安全有效运行至关重要。机场间关系建模的最新进展为研究多机场情况下的航班延误预测提供了一种很有前景的方法。然而，以往的预测工作只考虑了交通流量或地理距离等简单的关系，忽略了机场间错综复杂的相互作用，因此证明是不够的。在本文中，我们利用因果推理对机场间关系进行精确建模，并提出了一种用于航班延误预测的自校正时空图神经网络（命名为 CausalNet）。具体来说，格兰杰因果推理与自校正模块相结合，用于构建机场间的因果关系图，并根据当前机场的航班延误情况动态修改因果关系图；此外，自适应地提取和利用因果关系图的特征，以解决机场的异质性问题。在中国前 74 个最繁忙机场的真实数据上进行了广泛的实验。结果表明 CausalNet 优于基线。消融研究强调了所提出的自校正因果关系图和图形特征提取模块的能力。所有这些都证明了所提出方法的有效性。

{"title":"A Spatio-Temporal Approach with Self-Corrective Causal Inference for Flight Delay Prediction","authors":"Qihui Zhu, Shenwen Chen, Tong Guo, Yisheng Lv, Wenbo Du","doi":"arxiv-2407.15185","DOIUrl":"https://doi.org/arxiv-2407.15185","url":null,"abstract":"Accurate flight delay prediction is crucial for the secure and effective\u0000operation of the air traffic system. Recent advances in modeling inter-airport\u0000relationships present a promising approach for investigating flight delay\u0000prediction from the multi-airport scenario. However, the previous prediction\u0000works only accounted for the simplistic relationships such as traffic flow or\u0000geographical distance, overlooking the intricate interactions among airports\u0000and thus proving inadequate. In this paper, we leverage causal inference to\u0000precisely model inter-airport relationships and propose a self-corrective\u0000spatio-temporal graph neural network (named CausalNet) for flight delay\u0000prediction. Specifically, Granger causality inference coupled with a\u0000self-correction module is designed to construct causality graphs among airports\u0000and dynamically modify them based on the current airport's delays.\u0000Additionally, the features of the causality graphs are adaptively extracted and\u0000utilized to address the heterogeneity of airports. Extensive experiments are\u0000conducted on the real data of top-74 busiest airports in China. The results\u0000show that CausalNet is superior to baselines. Ablation studies emphasize the\u0000power of the proposed self-correction causality graph and the graph feature\u0000extraction module. All of these prove the effectiveness of the proposed\u0000methodology.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Economy Watchers Survey provides Datasets and Tasks for Japanese Financial Domain 经济观察者调查为日本金融领域提供数据集和任务

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-20 DOI: arxiv-2407.14727

Masahiro Suzuki, Hiroki Sakaji

Many natural language processing (NLP) tasks in English or general domainsare widely available and are often used to evaluate pre-trained languagemodels. In contrast, there are fewer tasks available for languages other thanEnglish and for the financial domain. In particular, tasks in Japanese and thefinancial domain are limited. We construct two large datasets using materialspublished by a Japanese central government agency. The datasets provide threeJapanese financial NLP tasks, which include a 3-class and 12-classclassification for categorizing sentences, as well as a 5-class classificationtask for sentiment analysis. Our datasets are designed to be comprehensive andup-to-date, leveraging an automatic update framework that ensures the latesttask datasets are publicly available anytime.

英语或一般领域中的许多自然语言处理（NLP）任务广泛存在，通常用于评估预训练的语言模型。相比之下，英语以外的语言和金融领域的任务较少。尤其是日语和金融领域的任务非常有限。我们利用日本中央政府机构发布的资料构建了两个大型数据集。这些数据集提供了三个日语金融 NLP 任务，其中包括用于句子分类的 3 级和 12 级分类，以及用于情感分析的 5 级分类任务。我们的数据集设计全面且最新，利用自动更新框架确保随时公开最新任务数据集。

引用次数: 0

Incorporating lane-change prediction into energy-efficient speed control of connected autonomous vehicles at intersections 将变道预测纳入互联自动驾驶车辆在交叉路口的节能速度控制中

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-20 DOI: arxiv-2407.15004

Maziar Zamanpour, Suiyi He, Michael W. Levin, Zongxuan Sun

Connected and autonomous vehicles (CAVs) possess the capability of perceptionand information broadcasting with other CAVs and connected intersections.Additionally, they exhibit computational abilities and can be controlledstrategically, offering energy benefits. One potential control strategy isreal-time speed control, which adjusts the vehicle speed by taking advantage ofbroadcasted traffic information, such as signal timings. However, the optimalcontrol is likely to increase the gap in front of the controlled CAV, whichinduces lane changing by other drivers. This study proposes a modified trafficflow model that aims to predict lane-changing occurrences and assess the impactof lane changes on future traffic states. The primary objective is to improveenergy efficiency. The prediction model is based on a cell division platformand is derived considering the additional flow during lane changing. An optimalcontrol strategy is then developed, subject to the predicted trajectorygenerated for the preceding vehicle. Lane change prediction estimates futurespeed and gap of vehicles, based on predicted traffic states. The proposedframework outperforms the non-lane change traffic model, resulting in up to 13%energy savings when lane changing is predicted 4-6 seconds in advance.

互联自动驾驶车辆（CAV）具有感知能力，并能与其他自动驾驶车辆和互联交叉路口进行信息广播。此外，它们还具有计算能力，可以进行策略控制，从而带来能源效益。一种潜在的控制策略是实时车速控制，即利用广播的交通信息（如信号灯时间）调整车速。然而，最佳控制很可能会增加受控 CAV 前方的空隙，从而导致其他驾驶员变道。本研究提出了一种改进的交通流模型，旨在预测变道发生率并评估变道对未来交通状态的影响。其主要目的是提高能源效率。该预测模型基于细胞分裂平台，并考虑了变道时的额外流量。然后，根据为前一辆车生成的预测轨迹，制定最佳控制策略。车道变更预测根据预测的交通状态估算车辆的未来速度和间隙。所提出的框架优于非变道交通模型，当提前 4-6 秒预测变道时，可节省多达 13% 的能源。

{"title":"Incorporating lane-change prediction into energy-efficient speed control of connected autonomous vehicles at intersections","authors":"Maziar Zamanpour, Suiyi He, Michael W. Levin, Zongxuan Sun","doi":"arxiv-2407.15004","DOIUrl":"https://doi.org/arxiv-2407.15004","url":null,"abstract":"Connected and autonomous vehicles (CAVs) possess the capability of perception\u0000and information broadcasting with other CAVs and connected intersections.\u0000Additionally, they exhibit computational abilities and can be controlled\u0000strategically, offering energy benefits. One potential control strategy is\u0000real-time speed control, which adjusts the vehicle speed by taking advantage of\u0000broadcasted traffic information, such as signal timings. However, the optimal\u0000control is likely to increase the gap in front of the controlled CAV, which\u0000induces lane changing by other drivers. This study proposes a modified traffic\u0000flow model that aims to predict lane-changing occurrences and assess the impact\u0000of lane changes on future traffic states. The primary objective is to improve\u0000energy efficiency. The prediction model is based on a cell division platform\u0000and is derived considering the additional flow during lane changing. An optimal\u0000control strategy is then developed, subject to the predicted trajectory\u0000generated for the preceding vehicle. Lane change prediction estimates future\u0000speed and gap of vehicles, based on predicted traffic states. The proposed\u0000framework outperforms the non-lane change traffic model, resulting in up to 13%\u0000energy savings when lane changing is predicted 4-6 seconds in advance.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141771821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Projection-based model-order reduction for unstructured meshes with graph autoencoders 利用图自编码器对非结构网格进行基于投影的模型阶次缩减

arXiv - CS - Computational Engineering, Finance, and Science

Pub Date : 2024-07-18 DOI: arxiv-2407.13669

Liam K. MagargalDepartment of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, PA, United States, Parisa KhodabakhshiDepartment of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, PA, United States, Steven N. RodriguezComputational Multiphysics Systems Laboratory, United States Naval Research Laboratory, Washington, DC, United States, Justin W. JaworskiKevin T. Crofton Department of Aerospace and Ocean Engineering, Virginia Tech, Blacksburg, VA, United States, John G. MichopoulosComputational Multiphysics Systems Laboratory, United States Naval Research Laboratory, Washington, DC, United States

This paper presents a graph autoencoder architecture capable of performingprojection-based model-order reduction (PMOR) on advection-dominated flowsmodeled by unstructured meshes. The autoencoder is coupled with the timeintegration scheme from a traditional deep least-squares Petrov-Galerkinprojection and provides the first deployment of a graph autoencoder into a PMORframework. The presented graph autoencoder is constructed with a two-partprocess that consists of (1) generating a hierarchy of reduced graphs toemulate the compressive abilities of convolutional neural networks (CNNs) and(2) training a message passing operation at each step in the hierarchy ofreduced graphs to emulate the filtering process of a CNN. The resultingframework provides improved flexibility over traditional CNN-based autoencodersbecause it is extendable to unstructured meshes. To highlight the capabilitiesof the proposed framework, which is named geometric deep least-squaresPetrov-Galerkin (GD-LSPG), we benchmark the method on a one-dimensionalBurgers' equation problem with a structured mesh and demonstrate theflexibility of GD-LSPG by deploying it to a two-dimensional Euler equationsmodel that uses an unstructured mesh. The proposed framework providesconsiderable improvement in accuracy for very low-dimensional latent spaces incomparison with traditional affine projections.

本文提出了一种图自动编码器架构，能够对非结构网格建模的平流主导流进行基于投影的模型阶次缩减（PMOR）。该自动编码器与传统深最小二乘 Petrov-Galerkin 投影的时间积分方案相结合，首次将图自动编码器应用到 PMOR 框架中。所介绍的图自动编码器由两部分过程构建而成，包括：(1) 生成还原图层次结构，以模拟卷积神经网络（CNN）的压缩能力；(2) 在还原图层次结构的每一步训练消息传递操作，以模拟 CNN 的过滤过程。与传统的基于 CNN 的自动编码器相比，该框架具有更高的灵活性，因为它可以扩展到非结构化网格。为了突出所提框架（命名为几何深最小二乘Petrov-Galerkin（GD-LSPG））的能力，我们在使用结构网格的一维伯格斯方程问题上对该方法进行了基准测试，并通过将其部署到使用非结构网格的二维欧拉方程模型上，展示了GD-LSPG的灵活性。与传统的仿射投影相比，所提出的框架大大提高了超低维潜在空间的精度。

{"title":"Projection-based model-order reduction for unstructured meshes with graph autoencoders","authors":"Liam K. MagargalDepartment of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, PA, United States, Parisa KhodabakhshiDepartment of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, PA, United States, Steven N. RodriguezComputational Multiphysics Systems Laboratory, United States Naval Research Laboratory, Washington, DC, United States, Justin W. JaworskiKevin T. Crofton Department of Aerospace and Ocean Engineering, Virginia Tech, Blacksburg, VA, United States, John G. MichopoulosComputational Multiphysics Systems Laboratory, United States Naval Research Laboratory, Washington, DC, United States","doi":"arxiv-2407.13669","DOIUrl":"https://doi.org/arxiv-2407.13669","url":null,"abstract":"This paper presents a graph autoencoder architecture capable of performing\u0000projection-based model-order reduction (PMOR) on advection-dominated flows\u0000modeled by unstructured meshes. The autoencoder is coupled with the time\u0000integration scheme from a traditional deep least-squares Petrov-Galerkin\u0000projection and provides the first deployment of a graph autoencoder into a PMOR\u0000framework. The presented graph autoencoder is constructed with a two-part\u0000process that consists of (1) generating a hierarchy of reduced graphs to\u0000emulate the compressive abilities of convolutional neural networks (CNNs) and\u0000(2) training a message passing operation at each step in the hierarchy of\u0000reduced graphs to emulate the filtering process of a CNN. The resulting\u0000framework provides improved flexibility over traditional CNN-based autoencoders\u0000because it is extendable to unstructured meshes. To highlight the capabilities\u0000of the proposed framework, which is named geometric deep least-squares\u0000Petrov-Galerkin (GD-LSPG), we benchmark the method on a one-dimensional\u0000Burgers' equation problem with a structured mesh and demonstrate the\u0000flexibility of GD-LSPG by deploying it to a two-dimensional Euler equations\u0000model that uses an unstructured mesh. The proposed framework provides\u0000considerable improvement in accuracy for very low-dimensional latent spaces in\u0000comparison with traditional affine projections.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0