Graphical Models最新文献

英文中文

Single Image Tree Reconstruction via Adversarial Network 基于对抗网络的单幅图像树重建

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-09-01 DOI: 10.1016/j.gmod.2021.101115

Zhihao Liu , Kai Wu , Jianwei Guo , Yunhai Wang , Oliver Deussen , Zhanglin Cheng

Realistic 3D tree reconstruction is still a tedious and time-consuming task in the graphics community. In this paper, we propose a simple and efficient method for reconstructing 3D tree models with high fidelity from a single image. The key to single image-based tree reconstruction is to recover 3D shape information of trees via a deep neural network learned from a set of synthetic tree models. We adopted a conditional generative adversarial network (cGAN) to infer the 3D silhouette and skeleton of a tree respectively from edges extracted from the image and simple 2D strokes drawn by the user. Based on the predicted 3D silhouette and skeleton, a realistic tree model that inherits the tree shape in the input image can be generated using a procedural modeling technique. Experiments on varieties of tree examples demonstrate the efficiency and effectiveness of the proposed method in reconstructing realistic 3D tree models from a single image.

在图形界，逼真的3D树重建仍然是一项繁琐而耗时的任务。在本文中，我们提出了一种简单而有效的方法，用于从单幅图像重建高保真度的三维树木模型。基于单幅图像的树木重建的关键是通过一组合成的树木模型学习深度神经网络来恢复树木的三维形状信息。我们采用条件生成对抗网络(conditional generative adversarial network, cGAN)，分别从图像提取的边缘和用户绘制的简单二维笔画中推断出树木的三维轮廓和骨架。基于预测的三维轮廓和骨架，可以使用程序建模技术生成继承输入图像中树木形状的逼真树模型。在多种树样例上的实验证明了该方法在从单幅图像重建真实三维树模型方面的效率和有效性。

引用次数: 11

Scale-Adaptive ICP Scale-Adaptive ICP

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-07-01 DOI: 10.1016/j.gmod.2021.101113

Yusuf Sahillioğlu , Ladislav Kavan

We present a new scale-adaptive ICP (Iterative Closest Point) method which aligns two objects that differ by rigid transformations (translations and rotations) and uniform scaling. The motivation is that input data may come in different scales (measurement units) which may not be known a priori, or when two range scans of the same object are obtained by different scanners. Classical ICP and its many variants do not handle this scale difference problem adequately. Our novel solution outperforms three different methods that estimate scale prior to alignment and a fourth method that, similar to ours, jointly optimizes for scale during the alignment.

我们提出了一种新的尺度自适应ICP(迭代最近点)方法，该方法通过刚性变换(平移和旋转)和均匀缩放来对齐两个不同的对象。动机是输入数据可能以不同的尺度(测量单位)出现，这些尺度(测量单位)可能不是先验的，或者当同一物体的两个距离扫描由不同的扫描仪获得时。经典的ICP及其许多变体不能充分处理这种尺度差异问题。我们的新解决方案优于三种不同的方法，即在对齐之前估计规模，以及第四种方法，类似于我们的方法，在对齐期间共同优化规模。

引用次数: 7

Combining convex hull and directed graph for fast and accurate ellipse detection 结合凸包和有向图实现快速准确的椭圆检测

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-07-01 DOI: 10.1016/j.gmod.2021.101110

Zeyu Shen , Mingyang Zhao , Xiaohong Jia , Yuan Liang , Lubin Fan , Dong-Ming Yan

Detecting ellipses from images is a fundamental task in many computer vision applications. However, due to the complexity of real-world scenarios, it is still a challenge to detect ellipses accurately and efficiently. In this paper, we propose a novel method to tackle this problem based on the fast computation of convex hull and directed graph, which achieves promising results on both accuracy and efficiency. We use Depth-First-Search to extract branch-free curves after adaptive edge detection. Line segments are used to represent the curvature characteristic of the curves, followed by splitting at sharp corners and inflection points to attain smooth arcs. Then the convex hull is constructed, together with the distance, length, and direction constraints, to find co-elliptic arc pairs. Arcs and their connectivity are encoded into a sparse directed graph, and then ellipses are generated via a fast access of the adjacency list. Finally, salient ellipses are selected subject to strict verification and weighted clustering. Extensive experiments are conducted on eight real-world datasets (six publicly available and two built by ourselves), as well as five synthetic datasets. Our method achieves the overall highest F-measure with competitive speed compared to representative state-of-the-art methods.

从图像中检测椭圆是许多计算机视觉应用的基本任务。然而，由于现实场景的复杂性，准确高效地检测椭圆仍然是一个挑战。本文提出了一种基于凸包和有向图快速计算的新方法来解决这一问题，在精度和效率上都取得了令人满意的结果。在自适应边缘检测后，采用深度优先搜索方法提取无分支曲线。线段用于表示曲线的曲率特征，然后在尖角和拐点处进行分裂以获得光滑的弧线。然后构造凸包，结合距离、长度和方向约束，求出共椭圆弧对。将圆弧及其连通性编码为稀疏有向图，然后通过快速访问邻接表生成椭圆。最后，通过严格的验证和加权聚类，选择显著省略号。广泛的实验在8个真实世界的数据集(6个公开可用，2个由我们自己建立)，以及5个合成数据集上进行。与具有代表性的最先进的方法相比，我们的方法以具有竞争力的速度实现了整体最高的f测量。

{"title":"Combining convex hull and directed graph for fast and accurate ellipse detection","authors":"Zeyu Shen , Mingyang Zhao , Xiaohong Jia , Yuan Liang , Lubin Fan , Dong-Ming Yan","doi":"10.1016/j.gmod.2021.101110","DOIUrl":"10.1016/j.gmod.2021.101110","url":null,"abstract":"<div><p><span>Detecting ellipses<span> from images is a fundamental task in many computer vision applications. However, due to the complexity of real-world scenarios, it is still a challenge to detect ellipses accurately and efficiently. In this paper, we propose a novel method to tackle this problem based on the fast computation of </span></span><span><em>convex hull</em></span> and <span><em>directed graph</em></span><span>, which achieves promising results on both accuracy and efficiency. We use Depth-First-Search to extract branch-free curves after adaptive edge detection. Line segments are used to represent the curvature characteristic of the curves, followed by splitting at sharp corners and inflection points<span> to attain smooth arcs. Then the convex hull is constructed, together with the distance, length, and direction constraints, to find co-elliptic arc pairs. Arcs and their connectivity are encoded into a sparse directed graph, and then ellipses are generated via a fast access of the adjacency list<span>. Finally, salient ellipses are selected subject to strict verification and weighted clustering. Extensive experiments are conducted on eight real-world datasets (six publicly available and two built by ourselves), as well as five synthetic datasets. Our method achieves the overall highest F-measure with competitive speed compared to representative state-of-the-art methods.</span></span></span></p></div>","PeriodicalId":55083,"journal":{"name":"Graphical Models","volume":"116 ","pages":"Article 101110"},"PeriodicalIF":1.7,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gmod.2021.101110","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78297964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Adaptive geometric sound propagation based on A-weighting variance measure 基于a加权方差测度的自适应几何声传播

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-07-01 DOI: 10.1016/j.gmod.2021.101109

Hongyang Zhou, Zhong Ren, Kun Zhou

We introduce an A-weighting variance measurement, an objective estimation of the sound quality generated by geometric acoustic methods. Unlike the previous measurement, which applies to the impulse response, our measurement establishes the relationship between the impulse response and the auralized sound that the user hears. We also develop interactive methods to evaluate the measurement at run time and an adaptive algorithm that balances quality and performance based on the measurement. Experiments show that our method is more efficient in a wide variety of scene geometry, input sound, reverberation, and path tracing strategies.

我们引入了a加权方差测量，这是一种对几何声学方法产生的音质的客观估计。不像之前的测量，它适用于脉冲响应，我们的测量建立脉冲响应和用户听到的听觉声音之间的关系。我们还开发了交互式方法来评估运行时的测量和基于测量平衡质量和性能的自适应算法。实验表明，我们的方法在各种场景几何、输入声音、混响和路径跟踪策略中都更有效。

引用次数: 2

VGF-Net: Visual-Geometric fusion learning for simultaneous drone navigation and height mapping VGF-Net:用于无人机导航和高度映射的视觉几何融合学习

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-07-01 DOI: 10.1016/j.gmod.2021.101108

Yilin Liu, Ke Xie, Hui Huang

The drone navigation requires the comprehensive understanding of both visual and geometric information in the 3D world. In this paper, we present a Visual-Geometric Fusion Network (VGF-Net), a deep network for the fusion analysis of visual/geometric data and the construction of 2.5D height maps for simultaneous drone navigation in novel environments. Given an initial rough height map and a sequence of RGB images, our VGF-Net extracts the visual information of the scene, along with a sparse set of 3D keypoints that capture the geometric relationship between objects in the scene. Driven by the data, VGF-Net adaptively fuses visual and geometric information, forming a unified Visual-Geometric Representation. This representation is fed to a new Directional Attention Model (DAM), which helps enhance the visual-geometric object relationship and propagates the informative data to dynamically refine the height map and the corresponding keypoints. An entire end-to-end information fusion and mapping system is formed, demonstrating remarkable robustness and high accuracy on the autonomous drone navigation across complex indoor and large-scale outdoor scenes.

无人机导航需要对三维世界的视觉和几何信息有全面的了解。在本文中，我们提出了一个视觉几何融合网络(VGF-Net)，这是一个用于视觉/几何数据融合分析和构建2.5D高度地图的深度网络，用于在新环境中同时进行无人机导航。给定初始的粗略高度图和RGB图像序列，我们的VGF-Net提取场景的视觉信息，以及捕获场景中物体之间几何关系的3D关键点的稀疏集。在数据的驱动下，VGF-Net自适应融合视觉和几何信息，形成统一的视觉几何表示。将这种表示形式输入到新的定向注意模型(DAM)中，增强视觉与几何对象的关系，并传播信息数据，从而动态细化高度图和相应的关键点。形成了完整的端到端信息融合和制图系统，在复杂的室内和大型室外场景下，对自主无人机导航具有显著的鲁棒性和高精度。

{"title":"VGF-Net: Visual-Geometric fusion learning for simultaneous drone navigation and height mapping","authors":"Yilin Liu, Ke Xie, Hui Huang","doi":"10.1016/j.gmod.2021.101108","DOIUrl":"10.1016/j.gmod.2021.101108","url":null,"abstract":"<div><p><span>The drone navigation requires the comprehensive understanding of both visual and geometric information in the 3D world. In this paper, we present a </span><em>Visual-Geometric Fusion Network</em><span> (VGF-Net), a deep network for the fusion analysis of visual/geometric data and the construction of 2.5D height maps for simultaneous drone navigation in novel environments. Given an initial rough height map and a sequence of RGB images, our VGF-Net extracts the visual information of the scene, along with a sparse set of 3D keypoints that capture the geometric relationship between objects in the scene. Driven by the data, VGF-Net adaptively fuses visual and geometric information, forming a unified </span><em>Visual-Geometric Representation</em>. This representation is fed to a new <em>Directional Attention Model</em> (DAM), which helps enhance the visual-geometric object relationship and propagates the informative data to dynamically refine the height map and the corresponding keypoints. An entire end-to-end information fusion and mapping system is formed, demonstrating remarkable robustness and high accuracy on the autonomous drone navigation across complex indoor and large-scale outdoor scenes.</p></div>","PeriodicalId":55083,"journal":{"name":"Graphical Models","volume":"116 ","pages":"Article 101108"},"PeriodicalIF":1.7,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gmod.2021.101108","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81693972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Geometry-Based Layout Generation with Hyper-Relations AMONG Objects 具有对象间超关系的基于几何的布局生成

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-07-01 DOI: 10.1016/j.gmod.2021.101104

Shao-Kui Zhang , Wei-Yu Xie , Song-Hai Zhang

Recent studies show increasing demands and interests in automatic layout generation, while there is still much room for improving the plausibility and robustness. In this paper, we present a data-driven layout generation framework without model formulation and loss term optimization. We achieve and organize priors directly based on samples from datasets instead of sampling probabilistic distributions. Therefore, our method enables expressing relations among three or more objects that are hard to be mathematically modeled. Subsequently, a non-learning geometric algorithm is proposed to arrange objects considering constraints such as positions of walls and windows. Experiments show that the proposed method outperforms the state-of-the-art and our generated layouts are competitive to those designed by professionals.¹

近年来的研究表明，自动布局生成的需求和兴趣越来越大，但其可行性和鲁棒性仍有很大的提高空间。在本文中，我们提出了一个数据驱动的布局生成框架，该框架不需要模型制定和损失项优化。我们直接基于数据集的样本而不是抽样概率分布来实现和组织先验。因此，我们的方法可以表达三个或更多对象之间的关系，这些对象很难用数学建模。在此基础上，提出了一种考虑墙体、窗户位置等约束条件的非学习几何算法。实验表明，所提出的方法优于目前最先进的方法，并且我们生成的布局与专业人员设计的布局相比具有竞争力

引用次数: 8

Visually smooth multi-UAV formation transformation 多无人机编队变换视觉平滑

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-07-01 DOI: 10.1016/j.gmod.2021.101111

Xinyu Zheng , Chen Zong , Jingliang Cheng , Jian Xu , Shiqing Xin , Changhe Tu , Shuangmin Chen , Wenping Wang

Unmanned airborne vehicles (UAVs) are useful in both military and civilian operations. In this paper, we consider a recreational scenario, i.e., multi-UAV formation transformation show. A visually smooth transformation needs to enforce the following three requirements at the same time: (1) visually pleasing contour morphing - for any intermediate frame, the agents form a meaningful shape and align with the contour, (2) uniform placement - for any intermediate frame, the agents are (isotropically) evenly spaced, and (3) smooth trajectories - the trajectory of each agent is as rigid/smooth as possible and completely collision free. First, we use the technique of 2-Wasserstein distance based interpolation to generate a sequence of intermediate shape contours. Second, we consider the spatio-temporal motion of all the agents altogether, and integrate the uniformity requirement and the spatial coherence into one objective function. Finally, the optimal formation transformation plan can be inferred by collaborative optimization.

Extensive experimental results show that our algorithm outperforms the existing algorithms in terms of visual smoothness of transformation, boundary alignment, uniformity of agents, and rigidity of trajectories. Furthermore, our algorithm is able to cope with some challenging scenarios including (1) source/target shapes with multiple connected components, (2) source/target shapes with different typology structures, and (3) existence of obstacles. Therefore, it has a great potential in the real multi-UAV light show. We created an animation to demonstrate how our algorithm works; See the demo at https://1drv.ms/v/s!AheMg5fKdtdugVL0aNFfEt_deTbT?e=le5poN .

无人驾驶飞行器(uav)在军事和民用行动中都很有用。本文考虑一个娱乐场景，即多无人机编队变换表演。视觉上平滑的转换需要同时满足以下三个要求:(1)视觉上令人愉悦的轮廓变形-对于任何中间帧，智能体形成有意义的形状并与轮廓对齐;(2)均匀放置-对于任何中间帧，智能体(各向同性)均匀间隔;(3)平滑轨迹-每个智能体的轨迹尽可能刚性/光滑并且完全无碰撞。首先，我们使用基于2-Wasserstein距离的插值技术来生成一系列中间形状轮廓。其次，综合考虑所有智能体的时空运动，将均匀性要求和空间相干性统一为一个目标函数;最后，通过协同优化推导出最优的队形变换方案。大量的实验结果表明，我们的算法在变换的视觉平滑性、边界对齐、代理的均匀性和轨迹的刚性方面都优于现有算法。此外，我们的算法能够应对一些具有挑战性的场景，包括(1)具有多个连接组件的源/目标形状，(2)具有不同类型结构的源/目标形状，以及(3)存在障碍物。因此，它在真正的多无人机灯光秀中具有很大的潜力。我们制作了一个动画来演示我们的算法是如何工作的;参见https://1drv.ms/v/s!AheMg5fKdtdugVL0aNFfEt_deTbT?e=le5poN上的演示。

{"title":"Visually smooth multi-UAV formation transformation","authors":"Xinyu Zheng , Chen Zong , Jingliang Cheng , Jian Xu , Shiqing Xin , Changhe Tu , Shuangmin Chen , Wenping Wang","doi":"10.1016/j.gmod.2021.101111","DOIUrl":"10.1016/j.gmod.2021.101111","url":null,"abstract":"<div><p>Unmanned airborne vehicles (UAVs) are useful in both military and civilian operations. In this paper, we consider a recreational scenario, i.e., multi-UAV formation transformation show. A visually smooth transformation needs to enforce the following three requirements at the same time: (1) visually pleasing contour morphing - for any intermediate frame, the agents form a meaningful shape and align with the contour, (2) uniform placement - for any intermediate frame, the agents are (isotropically) evenly spaced, and (3) smooth trajectories - the trajectory of each agent is as rigid/smooth as possible and completely collision free. First, we use the technique of 2-Wasserstein distance based interpolation to generate a sequence of intermediate shape contours. Second, we consider the spatio-temporal motion of all the agents altogether, and integrate the uniformity requirement and the spatial coherence into one objective function. Finally, the optimal formation transformation plan can be inferred by collaborative optimization.</p><p>Extensive experimental results show that our algorithm outperforms the existing algorithms in terms of visual smoothness of transformation, boundary alignment, uniformity of agents, and rigidity of trajectories. Furthermore, our algorithm is able to cope with some challenging scenarios including (1) source/target shapes with multiple connected components, (2) source/target shapes with different typology structures, and (3) existence of obstacles. Therefore, it has a great potential in the real multi-UAV light show. We created an animation to demonstrate how our algorithm works; See the demo at <span>https://1drv.ms/v/s!AheMg5fKdtdugVL0aNFfEt_deTbT?e=le5poN</span><svg><path></path></svg> .</p></div>","PeriodicalId":55083,"journal":{"name":"Graphical Models","volume":"116 ","pages":"Article 101111"},"PeriodicalIF":1.7,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gmod.2021.101111","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76203195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Learning a shared deformation space for efficient design-preserving garment transfer 学习共享变形空间，实现高效保设计服装转印

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-05-01 DOI: 10.1016/j.gmod.2021.101106

Min Shi , Yukun Wei , Lan Chen , Dengming Zhu , Tianlu Mao , Zhaoqi Wang

Garment transfer from a source mannequin to a shape-varying individual is a vital technique in computer graphics. Existing garment transfer methods are either time consuming or lack designed details especially for clothing with complex styles. In this paper, we propose a data-driven approach to efficiently transfer garments between two distinctive bodies while preserving the source design. Given two sets of simulated garments on a source body and a target body, we utilize the deformation gradients as the representation. Since garments in our dataset are with various topologies, we embed cloth deformation to the body. For garment transfer, the deformation is decomposed into two aspects, typically style and shape. An encoder-decoder network is proposed to learn a shared space which is invariant to garment style but related to the deformation of human bodies. For a new garment in a different style worn by the source human, our method can efficiently transfer it to the target body with the shared shape deformation, meanwhile preserving the designed details. We qualitatively and quantitatively evaluate our method on a diverse set of 3D garments that showcase rich wrinkling patterns. Experiments show that the transferred garments can preserve the source design even if the target body is quite different from the source one.

在计算机图形学中，服装从原始人体模型转移到不同形状的个体是一项至关重要的技术。现有的服装转印方法要么耗时，要么缺乏设计细节，特别是对于款式复杂的服装。在本文中，我们提出了一种数据驱动的方法，在保留原始设计的同时，在两个不同的身体之间有效地转移服装。给定源体和目标体上的两套模拟服装，我们利用变形梯度作为表示。由于我们数据集中的服装具有各种拓扑结构，因此我们将布料变形嵌入到身体中。对于服装转移，变形分为两个方面，典型的是风格和形状。提出了一种编码器-解码器网络来学习与服装款式不相关但与人体变形有关的共享空间。对于源人体所穿的不同风格的新服装，我们的方法可以有效地将其以共享的形状变形传递到目标人体，同时保留设计的细节。我们定性和定量评估我们的方法上的一套不同的3D服装，展示丰富的褶皱模式。实验表明，即使目标体与原体差异较大，转移后的服装仍能保持原设计。

{"title":"Learning a shared deformation space for efficient design-preserving garment transfer","authors":"Min Shi , Yukun Wei , Lan Chen , Dengming Zhu , Tianlu Mao , Zhaoqi Wang","doi":"10.1016/j.gmod.2021.101106","DOIUrl":"10.1016/j.gmod.2021.101106","url":null,"abstract":"<div><p>Garment transfer from a source mannequin to a shape-varying individual is a vital technique in computer graphics. Existing garment transfer methods are either time consuming or lack designed details especially for clothing with complex styles. In this paper, we propose a data-driven approach to efficiently transfer garments between two distinctive bodies while preserving the source design. Given two sets of simulated garments on a source body and a target body, we utilize the deformation gradients as the representation. Since garments in our dataset are with various topologies, we embed cloth deformation to the body. For garment transfer, the deformation is decomposed into two aspects, typically style and shape. An encoder-decoder network is proposed to learn a shared space which is invariant to garment style but related to the deformation of human bodies. For a new garment in a different style worn by the source human, our method can efficiently transfer it to the target body with the shared shape deformation, meanwhile preserving the designed details. We qualitatively and quantitatively evaluate our method on a diverse set of 3D garments that showcase rich wrinkling patterns. Experiments show that the transferred garments can preserve the source design even if the target body is quite different from the source one.</p></div>","PeriodicalId":55083,"journal":{"name":"Graphical Models","volume":"115 ","pages":"Article 101106"},"PeriodicalIF":1.7,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gmod.2021.101106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84648325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model 基于非线性参数模型的漫画地标检测与三维人脸重建

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-05-01 DOI: 10.1016/j.gmod.2021.101103

Hongrui Cai, Yudong Guo, Zhuang Peng, Juyong Zhang

Caricature is an artistic abstraction of the human face by distorting or exaggerating certain facial features, while still retains a likeness with the given face. Due to the large diversity of geometric and texture variations, automatic landmark detection and 3D face reconstruction for caricature is a challenging problem and has rarely been studied before. In this paper, we propose the first automatic method for this task by a novel 3D approach. To this end, we first build a dataset with various styles of 2D caricatures and their corresponding 3D shapes, and then build a parametric model on vertex based deformation space for 3D caricature face. Based on the constructed dataset and the nonlinear parametric model, we propose a neural network based method to regress the 3D face shape and orientation from the input 2D caricature image. Ablation studies and comparison with state-of-the-art methods demonstrate the effectiveness of our algorithm design. Extensive experimental results demonstrate that our method works well for various caricatures. Our constructed dataset, source code and trained model are available at https://github.com/Juyong/CaricatureFace.

漫画是一种通过扭曲或夸张某些面部特征来对人脸进行艺术抽象，同时仍然保持与给定面部的相似性。由于漫画的几何和纹理变化的多样性，自动地标检测和三维人脸重建是一个具有挑战性的问题，以前很少有研究。在本文中，我们通过一种新颖的3D方法提出了该任务的第一种自动方法。为此，我们首先建立了包含各种风格的2D漫画及其对应的3D形状的数据集，然后在基于顶点的变形空间上建立了三维漫画脸的参数化模型。在构建的数据集和非线性参数模型的基础上，提出了一种基于神经网络的三维人脸形状和方向回归方法。消融研究和与最先进的方法的比较证明了我们的算法设计的有效性。大量的实验结果表明，我们的方法适用于各种漫画。我们构建的数据集、源代码和训练模型可在https://github.com/Juyong/CaricatureFace上获得。

{"title":"Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model","authors":"Hongrui Cai, Yudong Guo, Zhuang Peng, Juyong Zhang","doi":"10.1016/j.gmod.2021.101103","DOIUrl":"10.1016/j.gmod.2021.101103","url":null,"abstract":"<div><p><span><span>Caricature is an artistic abstraction of the human face by distorting or exaggerating certain facial features, while still retains a likeness with the given face. Due to the large diversity of geometric and texture variations, automatic landmark detection and 3D face reconstruction for caricature is a challenging problem and has rarely been studied before. In this paper, we propose the first automatic method for this task by a novel 3D approach. To this end, we first build a dataset with various styles of 2D caricatures and their corresponding </span>3D shapes<span><span>, and then build a parametric model on vertex based deformation space for 3D caricature face. Based on the constructed dataset and the nonlinear parametric model, we propose a </span>neural network<span> based method to regress the 3D face shape and orientation from the input 2D caricature image. Ablation studies and comparison with state-of-the-art methods demonstrate the effectiveness of our algorithm design. Extensive experimental results demonstrate that our method works well for various caricatures. Our constructed dataset, source code and trained model are available at </span></span></span><span>https://github.com/Juyong/CaricatureFace</span><svg><path></path></svg>.</p></div>","PeriodicalId":55083,"journal":{"name":"Graphical Models","volume":"115 ","pages":"Article 101103"},"PeriodicalIF":1.7,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gmod.2021.101103","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74961659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

BPA-GAN: Human motion transfer using body-part-aware generative adversarial networks BPA-GAN:使用身体部位感知生成对抗网络的人体运动转移

IF 1.7 4区计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Graphical Models

Pub Date : 2021-05-01 DOI: 10.1016/j.gmod.2021.101107

Jinfeng Jiang , Guiqing Li , Shihao Wu , Huiqian Zhang , Yongwei Nie

Human motion transfer has many applications in human behavior analysis, training data augmentation, and personalization in mixed reality. We propose a Body-Parts-Aware Generative Adversarial Network (BPA-GAN) for image-based human motion transfer. Our key idea is to take advantage of the human body with segmented parts instead of using the human skeleton like most of existing methods to encode the human motion information. As a result, we improve the reconstruction quality, the training efficiency, and the temporal consistency via training multiple GANs in a local-to-global manner and adding regularization on the source motion. Extensive experiments show that our method outperforms the baseline and the state-of-the-art techniques in preserving the details of body parts.

人体运动迁移在混合现实中的人类行为分析、训练数据增强和个性化等方面有着广泛的应用。我们提出了一种基于图像的人体运动传输的身体部位感知生成对抗网络(BPA-GAN)。我们的关键思想是利用人体的分割部分，而不是像大多数现有的方法那样使用人体骨骼来编码人体运动信息。因此，我们通过局部到全局的方式训练多个gan，并在源运动上加入正则化，提高了重建质量、训练效率和时间一致性。大量的实验表明，我们的方法在保留身体部位细节方面优于基线和最先进的技术。

引用次数: 5

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Graphical Models

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀