首页 > 最新文献

Computers & Graphics-Uk最新文献

英文 中文
From coin to 3D face sculpture portraits in the round of Roman emperors 从硬币到 3D 面雕,罗马皇帝的圆形肖像
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-14 DOI: 10.1016/j.cag.2024.103999
Umberto Castellani , Riccardo Bartolomioli , Giacomo Marchioro , Dario Calomino

Representing historical figures on visual media has always been a crucial aspect of political communication in the ancient world, as it is in modern society. A great example comes from ancient Rome, when the emperor’s portraits were serially replicated on visual media to disseminate his image across the countries ruled by the Romans and to assert the power and authority that he embodied by making him universally recognizable. In particular, one of the most common media through which ancient Romans spread the imperial image was coinage, which showed a bi-dimensional projection of his portrait on the very low relief produced by the impression of the coin-die. In this work, we propose a new method that uses a multi-modal 2D and 3D approach to reconstruct the full portrait in the round of Roman emperors from their images adopted on ancient coins. A well-defined pipeline is introduced from the digitization of coins using 3D scanning techniques to the estimation of the 3D model of the portrait represented by a polygonal mesh. A morphable model trained on real 3D faces is exploited to infer the morphological (i.e., geometric) characteristics of the Roman emperor from the contours extracted from a coin portrait using a model fitting procedure. We present examples of face reconstruction of different emperors from coins produced in Rome as well as in the imperial provinces, which sometimes showed local variations of the official portraits centrally designed.

在视觉媒体上表现历史人物一直是古代世界政治传播的一个重要方面,现代社会也是如此。古罗马就是一个很好的例子,当时皇帝的肖像被连续复制到视觉媒体上,以便在罗马人统治的国家中传播他的形象,并通过让人们普遍认识他来维护他所代表的权力和权威。特别是,古罗马人传播帝王形象最常用的媒介之一是钱币,钱币在钱币压模产生的非常低的浮雕上展示了帝王肖像的二维投影。在这项工作中,我们提出了一种新方法,利用多模态二维和三维方法,从古钱币上采用的罗马皇帝图像中重建圆形的完整肖像。从使用三维扫描技术对钱币进行数字化,到估算以多边形网格为代表的肖像三维模型,我们引入了一个定义明确的流水线。我们利用在真实三维人脸上训练的可变形模型,通过模型拟合程序,从钱币肖像提取的轮廓中推断出罗马皇帝的形态(即几何)特征。我们展示了从罗马和帝国行省生产的钱币中重建不同皇帝脸部的实例,这些钱币有时会显示出中央设计的官方肖像的局部变化。
{"title":"From coin to 3D face sculpture portraits in the round of Roman emperors","authors":"Umberto Castellani ,&nbsp;Riccardo Bartolomioli ,&nbsp;Giacomo Marchioro ,&nbsp;Dario Calomino","doi":"10.1016/j.cag.2024.103999","DOIUrl":"10.1016/j.cag.2024.103999","url":null,"abstract":"<div><p>Representing historical figures on visual media has always been a crucial aspect of political communication in the ancient world, as it is in modern society. A great example comes from ancient Rome, when the emperor’s portraits were serially replicated on visual media to disseminate his image across the countries ruled by the Romans and to assert the power and authority that he embodied by making him universally recognizable. In particular, one of the most common media through which ancient Romans spread the imperial image was coinage, which showed a bi-dimensional projection of his portrait on the very low relief produced by the impression of the coin-die. In this work, we propose a new method that uses a multi-modal 2D and 3D approach to reconstruct the full portrait in the round of Roman emperors from their images adopted on ancient coins. A well-defined pipeline is introduced from the digitization of coins using 3D scanning techniques to the estimation of the 3D model of the portrait represented by a polygonal mesh. A morphable model trained on real 3D faces is exploited to infer the morphological (i.e., geometric) characteristics of the Roman emperor from the contours extracted from a coin portrait using a model fitting procedure. We present examples of face reconstruction of different emperors from coins produced in Rome as well as in the imperial provinces, which sometimes showed local variations of the official portraits centrally designed.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 103999"},"PeriodicalIF":2.5,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0097849324001341/pdfft?md5=8a2768f574543214216168dfcdcc1d4c&pid=1-s2.0-S0097849324001341-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141703436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From past to present: A tertiary investigation of twenty-four years of image inpainting 从过去到现在:对二十四年图像内画的第三次调查
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-14 DOI: 10.1016/j.cag.2024.104010
Iany Macedo Barcelos, Taís Bruno Rabelo, Flavia Bernardini, Rodrigo Salvador Monteiro, Leandro Augusto Frata Fernandes

Inpainting techniques, rooted in ancient art restoration practices, have become essential tools for digital image editing in modern contexts. Despite their widespread applications across diverse domains, the rapid advance of inpainting methodologies has highlighted the need for comprehensive reviews to document progress and identify areas for deeper investigation. Although there are many works in literature describing the state of the art regarding inpainting methods, algorithms, and technologies, many of them are presented lacking methodological rigor, which compromises the reliability and validity of their conclusions. In light of the wide literature about inpainting, this tertiary review aims to systematically identify their main techniques, recurring challenges, and applications through the perspective of secondary studies, providing a helpful background for new researchers. Our findings are based on an analysis of 45 reviews, where one of the major issues observed was the lack of standardization in the classification of methods, and to address this, we provide a concise and clear classification. Furthermore, we present a summary of the most commonly used metrics and a discussion of the main shortcomings and applications, which extend beyond digital image restoration to include medical imaging, three-dimensional restoration, cultural heritage preservation, and more. While inpainting poses challenges, this review aims to inspire further exploration and advancement in the field by providing a comprehensive overview of inpainting research.

植根于古代艺术修复实践的内绘技术已成为现代数字图像编辑的重要工具。尽管它们广泛应用于各个领域,但内绘方法的快速发展凸显了对全面审查的需求,以记录进展并确定需要深入研究的领域。尽管有许多文献描述了有关内绘方法、算法和技术的最新进展,但其中许多都缺乏严谨的方法论,这就影响了其结论的可靠性和有效性。鉴于有关着色的文献众多,这篇三级综述旨在通过二次研究的视角,系统地确定其主要技术、反复出现的挑战和应用,为新的研究人员提供有用的背景资料。我们的研究结果基于对 45 篇综述的分析,其中发现的一个主要问题是方法分类缺乏标准化,为了解决这个问题,我们提供了一个简洁明了的分类。此外,我们还总结了最常用的度量方法,并讨论了主要缺点和应用领域,这些领域不仅包括数字图像修复,还包括医疗成像、三维修复、文化遗产保护等。虽然涂色带来了挑战,但本综述旨在通过对涂色研究的全面概述,激励人们进一步探索和推进该领域的研究。
{"title":"From past to present: A tertiary investigation of twenty-four years of image inpainting","authors":"Iany Macedo Barcelos,&nbsp;Taís Bruno Rabelo,&nbsp;Flavia Bernardini,&nbsp;Rodrigo Salvador Monteiro,&nbsp;Leandro Augusto Frata Fernandes","doi":"10.1016/j.cag.2024.104010","DOIUrl":"10.1016/j.cag.2024.104010","url":null,"abstract":"<div><p>Inpainting techniques, rooted in ancient art restoration practices, have become essential tools for digital image editing in modern contexts. Despite their widespread applications across diverse domains, the rapid advance of inpainting methodologies has highlighted the need for comprehensive reviews to document progress and identify areas for deeper investigation. Although there are many works in literature describing the state of the art regarding inpainting methods, algorithms, and technologies, many of them are presented lacking methodological rigor, which compromises the reliability and validity of their conclusions. In light of the wide literature about inpainting, this tertiary review aims to systematically identify their main techniques, recurring challenges, and applications through the perspective of secondary studies, providing a helpful background for new researchers. Our findings are based on an analysis of 45 reviews, where one of the major issues observed was the lack of standardization in the classification of methods, and to address this, we provide a concise and clear classification. Furthermore, we present a summary of the most commonly used metrics and a discussion of the main shortcomings and applications, which extend beyond digital image restoration to include medical imaging, three-dimensional restoration, cultural heritage preservation, and more. While inpainting poses challenges, this review aims to inspire further exploration and advancement in the field by providing a comprehensive overview of inpainting research.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 104010"},"PeriodicalIF":2.5,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141698158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational design of custom-fit PAP masks 量身定制 PAP 面罩的计算设计
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-09 DOI: 10.1016/j.cag.2024.103998
Yukun Lu , Yuhang Wang , Peng Song , Hang Siang Wong , Yingjuan Mok , Ligang Liu

Positive airway pressure (PAP) therapy refers to sleep disordered breathing treatment that uses a stream of compressed air to support the airway during sleep. Even though the use of PAP therapy has been shown to be effective in improving the symptoms and quality of life, many patients are intolerant of the treatment due to poor mask fit. In this paper, our goal is to develop a computational approach for designing custom-fit PAP masks such that they can achieve better mask fit performance in terms of mask leakage and comfort. Our key observation is that a custom-fit PAP mask should fit a patient’s face in its deformed state instead of in its rest state since the PAP mask cushion undergoes notable deformation before reaching an equilibrium state during PAP therapy. To this end, we compute the equilibrium state of a mask cushion using the finite element method, and quantitatively measure the leakage and comfort of the mask cushion in this state. We further optimize the mask cushion geometry to minimize the two measures while ensuring that the cushion can be easily fabricated with molding. We demonstrate the effectiveness of our computational approach on a variety of face models and different types of PAP masks. Experimental results on real subjects show that our designed custom-fit PAP masks are able to achieve better mask fit performance than a generic PAP mask and custom-fit PAP masks designed by a state-of-the-art approach.

气道正压疗法(PAP)是指在睡眠过程中使用压缩气流支撑气道的睡眠呼吸紊乱治疗方法。尽管气道正压疗法已被证明能有效改善症状和提高生活质量,但许多患者因面罩不合适而无法忍受这种治疗。在本文中,我们的目标是开发一种计算方法,用于设计量身定制的呼吸机面罩,使其在面罩泄漏和舒适度方面达到更好的面罩适配性能。我们的主要观点是,定制合适的通气面罩应在变形状态而非静止状态下与患者的脸部贴合,因为在通气治疗过程中,通气面罩垫在达到平衡状态之前会发生明显的变形。为此,我们使用有限元法计算了面罩垫的平衡状态,并定量测量了面罩垫在此状态下的泄漏和舒适度。我们进一步优化了面罩垫的几何形状,使这两项测量结果最小化,同时确保面罩垫易于成型。我们在各种面部模型和不同类型的 PAP 面罩上演示了计算方法的有效性。真实受试者的实验结果表明,与普通 PAP 面具和最先进方法设计的定制 PAP 面具相比,我们设计的定制贴合 PAP 面具能够实现更好的面罩贴合性能。
{"title":"Computational design of custom-fit PAP masks","authors":"Yukun Lu ,&nbsp;Yuhang Wang ,&nbsp;Peng Song ,&nbsp;Hang Siang Wong ,&nbsp;Yingjuan Mok ,&nbsp;Ligang Liu","doi":"10.1016/j.cag.2024.103998","DOIUrl":"https://doi.org/10.1016/j.cag.2024.103998","url":null,"abstract":"<div><p>Positive airway pressure (PAP) therapy refers to sleep disordered breathing treatment that uses a stream of compressed air to support the airway during sleep. Even though the use of PAP therapy has been shown to be effective in improving the symptoms and quality of life, many patients are intolerant of the treatment due to poor mask fit. In this paper, our goal is to develop a computational approach for designing custom-fit PAP masks such that they can achieve better mask fit performance in terms of mask leakage and comfort. Our key observation is that a custom-fit PAP mask should fit a patient’s face in its deformed state instead of in its rest state since the PAP mask cushion undergoes notable deformation before reaching an equilibrium state during PAP therapy. To this end, we compute the equilibrium state of a mask cushion using the finite element method, and quantitatively measure the leakage and comfort of the mask cushion in this state. We further optimize the mask cushion geometry to minimize the two measures while ensuring that the cushion can be easily fabricated with molding. We demonstrate the effectiveness of our computational approach on a variety of face models and different types of PAP masks. Experimental results on real subjects show that our designed custom-fit PAP masks are able to achieve better mask fit performance than a generic PAP mask and custom-fit PAP masks designed by a state-of-the-art approach.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"122 ","pages":"Article 103998"},"PeriodicalIF":2.5,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141607447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SketchCleanGAN: A generative network to enhance and correct query sketches for improving 3D CAD model retrieval systems SketchCleanGAN:为改进 3D CAD 模型检索系统而增强和纠正查询草图的生成网络
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-09 DOI: 10.1016/j.cag.2024.104000
Kamalesh Kumar Kosalaraman, Prasad Pralhad Kendre, Raghwani Dhaval Manilal, Ramanathan Muthuganapathy

Given an input query, a search and retrieval system fetches relevant information from a dataset. In the Engineering domain, such a system is beneficial for tasks such as design reuse. A two-dimensional (2D) sketch is more conducive for an end user to give as a query than a three-dimensional (3D) object. Such query sketches, nevertheless, will inevitably contain defects like incomplete lines, mesh lines, overdrawn areas, missing areas, etc. Since a retrieval system’s results are only as good as the query, it is necessary to improve the query sketches.

In this paper, the problem of transforming a defective CAD sketch into a defect-free sketch is addressed using Generative Adversarial Networks (GANs), which, to the best of our knowledge, has not been investigated before. We first create a dataset of 534 hand-drawn sketches by tracing the boundaries of images of CAD models. We then pair the corrected sketches with their corresponding defective sketches and use them for training a C-WGAN (Conditional Wasserstein Generative Adversarial Network), called SketchCleanGAN. We model the transformation from defective to defect-free sketch as a factorization of the defective input sketch and then translate it to the space of defect-free sketch. We propose a three-branch strategy to this problem. Ablation studies and comparisons with other state-of-the-art techniques demonstrate the efficacy of the proposed technique. Additionally, we also contribute to a dataset of around 58000 improved sketches using the proposed framework.

输入查询后,搜索和检索系统会从数据集中获取相关信息。在工程领域,这样的系统有利于设计再利用等任务。与三维(3D)对象相比,二维(2D)草图更有利于终端用户进行查询。然而,这种查询草图不可避免地会包含一些缺陷,如线条不完整、网状线条、多画区域、缺失区域等。本文利用生成对抗网络(GANs)来解决将有缺陷的 CAD 草图转化为无缺陷草图的问题。我们首先通过追踪 CAD 模型图像的边界创建了一个包含 534 幅手绘草图的数据集。然后,我们将修正后的草图与相应的缺陷草图配对,并用它们来训练一个名为 SketchCleanGAN 的 C-WGAN(条件 Wasserstein 生成对抗网络)。我们将从有缺陷草图到无缺陷草图的转换建模为有缺陷输入草图的因子化,然后将其转换到无缺陷草图空间。我们针对这一问题提出了三分支策略。消融研究以及与其他最先进技术的比较证明了所提技术的有效性。此外,我们还利用所提出的框架建立了一个包含约 58000 个改进草图的数据集。
{"title":"SketchCleanGAN: A generative network to enhance and correct query sketches for improving 3D CAD model retrieval systems","authors":"Kamalesh Kumar Kosalaraman,&nbsp;Prasad Pralhad Kendre,&nbsp;Raghwani Dhaval Manilal,&nbsp;Ramanathan Muthuganapathy","doi":"10.1016/j.cag.2024.104000","DOIUrl":"10.1016/j.cag.2024.104000","url":null,"abstract":"<div><p>Given an input query, a search and retrieval system fetches relevant information from a dataset. In the Engineering domain, such a system is beneficial for tasks such as design reuse. A two-dimensional (2D) sketch is more conducive for an end user to give as a query than a three-dimensional (3D) object. Such query sketches, nevertheless, will inevitably contain defects like incomplete lines, mesh lines, overdrawn areas, missing areas, etc. Since a retrieval system’s results are only as good as the query, it is necessary to improve the query sketches.</p><p>In this paper, the problem of transforming a defective CAD sketch into a defect-free sketch is addressed using Generative Adversarial Networks (GANs), which, to the best of our knowledge, has not been investigated before. We first create a dataset of 534 hand-drawn sketches by tracing the boundaries of images of CAD models. We then pair the corrected sketches with their corresponding defective sketches and use them for training a C-WGAN (Conditional Wasserstein Generative Adversarial Network), called SketchCleanGAN. We model the transformation from defective to defect-free sketch as a factorization of the defective input sketch and then translate it to the space of defect-free sketch. We propose a three-branch strategy to this problem. Ablation studies and comparisons with other state-of-the-art techniques demonstrate the efficacy of the proposed technique. Additionally, we also contribute to a dataset of around 58000 improved sketches using the proposed framework.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 104000"},"PeriodicalIF":2.5,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141694929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quadratic-attraction subdivision with contraction-ratio λ=12 收缩比 λ=1 的二次吸引细分</mml:mro
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-09 DOI: 10.1016/j.cag.2024.104001
Kȩstutis Karčiauskas , Jörg Peters

Classic generalized subdivision, such as Catmull–Clark subdivision, as well as recent subdivision algorithms for high-quality surfaces, rely on slower convergence towards extraordinary points for mesh nodes surrounded by n>4 quadrilaterals. Slow convergence corresponds to a contraction-ratio of λ>0.5. To improve shape, prevent parameterization discordant with surface growth, or to improve convergence in isogeometric analysis near extraordinary points, a number of algorithms explicitly adjust λ by altering refinement rules. However, such tuning of λ has so far led to poorer surface quality, visible as uneven distribution or oscillation of highlight lines. The recent Quadratic-Attraction Subdivision (QAS) generates high-quality, bounded curvature surfaces based on a careful choice of quadratic expansion at the central point and, just like Catmull–Clark subdivision, creates the control points of the next subdivision ring by matrix multiplication. But QAS shares the contraction-ratio λCC>1/2 of Catmull–Clark subdivision when n>4. For n=5,,10, QAS+ improves the convergence to the uniform λ=12 of binary domain refinement and without sacrificing surface quality compared to QAS.

经典的广义细分(如 Catmull-Clark 细分)以及最新的高质量曲面细分算法,都依赖于由 n>4 个四边形包围的网格节点向非常点的缓慢收敛。缓慢收敛相当于 λ>0.5 的收缩比。为了改善形状,防止参数化与表面生长不协调,或改善非常点附近等几何分析的收敛性,许多算法通过改变细化规则来明确调整 λ。然而,迄今为止,这种对 λ 的调整导致了较差的表面质量,表现为高亮线的不均匀分布或摆动。最近的二次吸引细分法(QAS)基于对中心点二次展开的谨慎选择,生成高质量、有约束曲率的曲面,并且与 Catmull-Clark 细分法一样,通过矩阵乘法创建下一个细分环的控制点。但当 n>4 时,QAS 与 Catmull-Clark 细分的收缩率 λCC>1/2 相同。当 n=5,...,10 时,与 QAS 相比,QAS+ 提高了对二元域细化的统一 λ=12 的收敛性,且不影响曲面质量。
{"title":"Quadratic-attraction subdivision with contraction-ratio λ=12","authors":"Kȩstutis Karčiauskas ,&nbsp;Jörg Peters","doi":"10.1016/j.cag.2024.104001","DOIUrl":"10.1016/j.cag.2024.104001","url":null,"abstract":"<div><p>Classic generalized subdivision, such as Catmull–Clark subdivision, as well as recent subdivision algorithms for high-quality surfaces, rely on slower convergence towards extraordinary points for mesh nodes surrounded by <span><math><mrow><mi>n</mi><mo>&gt;</mo><mn>4</mn></mrow></math></span> quadrilaterals. Slow convergence corresponds to a contraction-ratio of <span><math><mrow><mi>λ</mi><mo>&gt;</mo><mn>0</mn><mo>.</mo><mn>5</mn></mrow></math></span>. To improve shape, prevent parameterization discordant with surface growth, or to improve convergence in isogeometric analysis near extraordinary points, a number of algorithms explicitly adjust <span><math><mi>λ</mi></math></span> by altering refinement rules. However, such tuning of <span><math><mi>λ</mi></math></span> has so far led to poorer surface quality, visible as uneven distribution or oscillation of highlight lines. The recent Quadratic-Attraction Subdivision (QAS) generates high-quality, bounded curvature surfaces based on a careful choice of quadratic expansion at the central point and, just like Catmull–Clark subdivision, creates the control points of the next subdivision ring by matrix multiplication. But QAS shares the contraction-ratio <span><math><mrow><msub><mrow><mi>λ</mi></mrow><mrow><mi>C</mi><mi>C</mi></mrow></msub><mo>&gt;</mo><mn>1</mn><mo>/</mo><mn>2</mn></mrow></math></span> of Catmull–Clark subdivision when <span><math><mrow><mi>n</mi><mo>&gt;</mo><mn>4</mn></mrow></math></span>. For <span><math><mrow><mi>n</mi><mo>=</mo><mn>5</mn><mo>,</mo><mo>…</mo><mo>,</mo><mn>10</mn></mrow></math></span>, QAS<span><math><msub><mrow></mrow><mrow><mo>+</mo></mrow></msub></math></span> improves the convergence to the uniform <span><math><mrow><mi>λ</mi><mo>=</mo><mfrac><mrow><mn>1</mn></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></math></span> of binary domain refinement and without sacrificing surface quality compared to QAS.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 104001"},"PeriodicalIF":2.5,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141690207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural inpainting of folded fabrics with interactive editing 通过交互式编辑对折叠织物进行神经绘制
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-08 DOI: 10.1016/j.cag.2024.103997
Guillaume Gisbert, Raphaëlle Chaine, David Coeurjolly

We propose a deep learning approach for inpainting holes in digital models of fabric surfaces. Leveraging the developable nature of fabric surfaces, we flatten the area surrounding the holes with minor distortion and regularly sample it to obtain a discrete 2D map of the 3D embedding, with an indicator mask outlining holes locations. This enables the use of a standard 2D convolutional neural network to inpaint holes given the 3D positioning of the surface. The provided neural architecture includes an attention mechanism to capture long-range relationships on the surface. Finally, we provide ScarfFolds, a database of folded fabrics patches with varying complexity, which is used to train our convolutional network in a supervised manner. We successfully tested our approach on various examples and illustrated that previous 3D deep learning approaches suffer from several issues when applied to fabrics. Also, our method allows the users to interact with the construction of the inpainted surface. The editing is interactive and supports many tools like vertex grabbing, drape twisting or pinching.

我们提出了一种深度学习方法,用于对织物表面数字模型中的孔洞进行内绘。利用织物表面的可开发性,我们对孔洞周围的区域进行了轻微变形,并对其进行定期采样,以获得三维嵌入的离散二维地图,并用指示掩膜勾勒出孔洞位置。这样就可以使用标准的二维卷积神经网络,根据表面的三维定位来绘制孔洞。所提供的神经架构包括一种注意力机制,用于捕捉表面上的长程关系。最后,我们提供了 ScarfFolds 数据库,这是一个具有不同复杂度的折叠织物补丁数据库,用于以监督方式训练我们的卷积网络。我们成功地在各种示例上测试了我们的方法,并说明了之前的三维深度学习方法在应用于织物时存在一些问题。此外,我们的方法还允许用户与上色表面的构造进行交互。编辑是交互式的,支持许多工具,如抓取顶点、扭曲或捏合。
{"title":"Neural inpainting of folded fabrics with interactive editing","authors":"Guillaume Gisbert,&nbsp;Raphaëlle Chaine,&nbsp;David Coeurjolly","doi":"10.1016/j.cag.2024.103997","DOIUrl":"10.1016/j.cag.2024.103997","url":null,"abstract":"<div><p>We propose a deep learning approach for inpainting holes in digital models of fabric surfaces. Leveraging the developable nature of fabric surfaces, we flatten the area surrounding the holes with minor distortion and regularly sample it to obtain a discrete 2D map of the 3D embedding, with an indicator mask outlining holes locations. This enables the use of a standard 2D convolutional neural network to inpaint holes given the 3D positioning of the surface. The provided neural architecture includes an attention mechanism to capture long-range relationships on the surface. Finally, we provide <em>ScarfFolds</em>, a database of folded fabrics patches with varying complexity, which is used to train our convolutional network in a supervised manner. We successfully tested our approach on various examples and illustrated that previous 3D deep learning approaches suffer from several issues when applied to fabrics. Also, our method allows the users to interact with the construction of the inpainted surface. The editing is interactive and supports many tools like vertex grabbing, drape twisting or pinching.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"122 ","pages":"Article 103997"},"PeriodicalIF":2.5,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SOA: Seed point offset attention for indoor 3D object detection in point clouds SOA:用于点云室内 3D 物体检测的种子点偏移注意事项
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-06 DOI: 10.1016/j.cag.2024.103992
Jun Shu , Shiqi Yu , Xinyi Shu , Jiewen Hu

Three-dimensional object detection plays a pivotal role in scene understanding and holds significant importance in various indoor perception applications. Traditional methods based on Hough voting are susceptible to interference from background points or neighboring objects when casting votes for the target’s center from each seed point. Moreover, fixed-size set abstraction modules may result in the loss of structural information for large objects. To address these challenges, this paper proposes a three-dimensional object detection model based on seed point offset attention. The objective of this model is to enhance the model’s resilience to voting noise interference and alleviate feature loss for large-scale objects. Specifically, a seed point offset tensor is first defined, and then the offset tensor self-attention network is employed to learn the weights between votes, thereby establishing a correlation between the voting semantic features and the object structural information. Furthermore, an object surface perception module is introduced, which incorporates detailed features of local object surfaces into global feature representations through vote backtracking and surface mapping. Experimental results indicate that the model achieved excellent performance on the ScanNet-V2 ([email protected], 60.3%) and SUN RGB-D ([email protected], 64.0%) datasets, respectively improving by 2.6% ([email protected]) and 5.4% ([email protected]) compared to VoteNet.

三维物体检测在场景理解中起着关键作用,在各种室内感知应用中具有重要意义。传统的基于 Hough 投票的方法在对每个种子点的目标中心进行投票时,容易受到背景点或邻近物体的干扰。此外,固定大小的集合抽象模块可能会导致大型物体结构信息的丢失。为了应对这些挑战,本文提出了一种基于种子点偏移注意力的三维物体检测模型。该模型的目标是增强模型对投票噪声干扰的抗干扰能力,并减少大型物体的特征丢失。具体来说,首先定义种子点偏移张量,然后利用偏移张量自注意力网络学习投票之间的权重,从而建立投票语义特征与物体结构信息之间的相关性。此外,还引入了物体表面感知模块,通过投票回溯和表面映射,将局部物体表面的细节特征纳入全局特征表征。实验结果表明,该模型在 ScanNet-V2 数据集([email protected],60.3%)和 SUN RGB-D 数据集([email protected],64.0%)上取得了优异的性能,与 VoteNet 相比,分别提高了 2.6% ([email protected])和 5.4% ([email protected])。
{"title":"SOA: Seed point offset attention for indoor 3D object detection in point clouds","authors":"Jun Shu ,&nbsp;Shiqi Yu ,&nbsp;Xinyi Shu ,&nbsp;Jiewen Hu","doi":"10.1016/j.cag.2024.103992","DOIUrl":"10.1016/j.cag.2024.103992","url":null,"abstract":"<div><p>Three-dimensional object detection plays a pivotal role in scene understanding and holds significant importance in various indoor perception applications. Traditional methods based on Hough voting are susceptible to interference from background points or neighboring objects when casting votes for the target’s center from each seed point. Moreover, fixed-size set abstraction modules may result in the loss of structural information for large objects. To address these challenges, this paper proposes a three-dimensional object detection model based on seed point offset attention. The objective of this model is to enhance the model’s resilience to voting noise interference and alleviate feature loss for large-scale objects. Specifically, a seed point offset tensor is first defined, and then the offset tensor self-attention network is employed to learn the weights between votes, thereby establishing a correlation between the voting semantic features and the object structural information. Furthermore, an object surface perception module is introduced, which incorporates detailed features of local object surfaces into global feature representations through vote backtracking and surface mapping. Experimental results indicate that the model achieved excellent performance on the ScanNet-V2 ([email protected], 60.3%) and SUN RGB-D ([email protected], 64.0%) datasets, respectively improving by 2.6% ([email protected]) and 5.4% ([email protected]) compared to VoteNet.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 103992"},"PeriodicalIF":2.5,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141708916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Foreword to the special section on Spanish Computer Graphics Conference 2024 2024 年西班牙计算机图形大会特别部分前言
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-06 DOI: 10.1016/j.cag.2024.103995
Ana Serrano, Gustavo Patow, Julio Marco
{"title":"Foreword to the special section on Spanish Computer Graphics Conference 2024","authors":"Ana Serrano,&nbsp;Gustavo Patow,&nbsp;Julio Marco","doi":"10.1016/j.cag.2024.103995","DOIUrl":"10.1016/j.cag.2024.103995","url":null,"abstract":"","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 103995"},"PeriodicalIF":2.5,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141710884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamics simulation-based packing of irregular 3D objects 基于动力学模拟的不规则三维物体包装
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-06 DOI: 10.1016/j.cag.2024.103996
Qiubing Zhuang , Zhonggui Chen , Keyu He , Juan Cao , Wenping Wang

The 3D packing problem has a wide range of applications. However, the complex geometry of irregular objects leads to a sharp increase in the number of placement combinations, making it a challenging problem. In this paper, we propose a packing pipeline based on rigid body dynamics simulation to deal with two types of 3D packing problems. One is the variant bin packing problem, which involves placing more objects into a container of given dimensions to maximize space utilization. The other is the open dimension problem, where the goal is to minimize the container that can accommodate all objects. We first use heuristic placement strategies and a fast collision detection algorithm to efficiently obtain initial packing results. Then, we simulate the shaking of the container according to the dynamic principle. Combined with the vacant space filling operation, shaking the container drives the movement of objects in the container to make the arrangement of objects more compact. For the open dimension packing, the container height is optimized by adjusting the constraints of simulation in the basic pipeline. Experimental results show that our method has advantages over existing methods in both speed and packing density.

三维包装问题应用广泛。然而,不规则物体的复杂几何形状导致摆放组合的数量急剧增加,使其成为一个具有挑战性的问题。在本文中,我们提出了一种基于刚体动力学模拟的包装流水线,用于处理两类三维包装问题。一种是变体箱包装问题,即在给定尺寸的容器中放置更多物品,以最大限度地利用空间。另一种是开放尺寸问题,目标是最小化可容纳所有物体的容器。我们首先使用启发式摆放策略和快速碰撞检测算法,高效地获得初始包装结果。然后,我们根据动态原理模拟容器的晃动。结合空位填充操作,容器的晃动会带动容器内物体的移动,从而使物体的排列更加紧凑。对于开放尺寸的包装,通过调整基本流水线中的模拟约束条件来优化容器高度。实验结果表明,与现有方法相比,我们的方法在速度和包装密度方面都具有优势。
{"title":"Dynamics simulation-based packing of irregular 3D objects","authors":"Qiubing Zhuang ,&nbsp;Zhonggui Chen ,&nbsp;Keyu He ,&nbsp;Juan Cao ,&nbsp;Wenping Wang","doi":"10.1016/j.cag.2024.103996","DOIUrl":"10.1016/j.cag.2024.103996","url":null,"abstract":"<div><p>The 3D packing problem has a wide range of applications. However, the complex geometry of irregular objects leads to a sharp increase in the number of placement combinations, making it a challenging problem. In this paper, we propose a packing pipeline based on rigid body dynamics simulation to deal with two types of 3D packing problems. One is the variant bin packing problem, which involves placing more objects into a container of given dimensions to maximize space utilization. The other is the open dimension problem, where the goal is to minimize the container that can accommodate all objects. We first use heuristic placement strategies and a fast collision detection algorithm to efficiently obtain initial packing results. Then, we simulate the shaking of the container according to the dynamic principle. Combined with the vacant space filling operation, shaking the container drives the movement of objects in the container to make the arrangement of objects more compact. For the open dimension packing, the container height is optimized by adjusting the constraints of simulation in the basic pipeline. Experimental results show that our method has advantages over existing methods in both speed and packing density.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"123 ","pages":"Article 103996"},"PeriodicalIF":2.5,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141709975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fine-tuning 3D foundation models for geometric object retrieval 微调用于几何物体检索的 3D 基础模型
IF 2.5 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING Pub Date : 2024-07-03 DOI: 10.1016/j.cag.2024.103993
Jarne Van den Herrewegen , Tom Tourwé , Maks Ovsjanikov , Francis wyffels

Foundation models, such as ULIP-2 (Xue et al., 2023) recently projected forward the field of 3D deep learning. These models are trained with significantly more data and show superior representation learning capacity in many downstream tasks like 3D shape classification and few-shot part segmentation.

A particular characteristic of the recent 3D foundation models is that they are typically multi-modal, and involve image (2D) as well as caption (text) branches. This leads to an intricate interplay that benefits all modalities. At the same time, the nature of the 3D encoders alone, involved in these foundation models is not well-understood. Specifically, there is little analysis on the utility of both pre-trained 3D features provided by these models, or their capacity to adapt to new downstream 3D data. Furthermore, existing studies typically focus on label-oriented downstream tasks, such as shape classification, and ignore other critical applications, such as 3D content-based object retrieval.

In this paper, we fill this gap and show, for the first time, how 3D foundation models can be leveraged for strong 3D-to-3D retrieval performance on seven different datasets, on par with state-of-the-art view-based architectures. We evaluate both the pre-trained foundation models, as well as their fine-tuned versions using downstream data. We compare supervised fine-tuning using classification labels against two self-supervised label-free fine-tuning methods. Importantly, we introduce and describe a methodology for fine-tuning, as we found this to be crucial to make transfer learning from 3D foundation models work in a stable manner.

最近,ULIP-2(Xue 等人,2023 年)等基础模型推动了三维深度学习领域的发展。这些模型在训练时使用了大量数据,并在许多下游任务(如三维形状分类和少镜头部件分割)中表现出卓越的表征学习能力。最近的三维基础模型的一个特点是它们通常是多模态的,涉及图像(二维)和标题(文本)分支。这就导致了错综复杂的相互作用,使所有模式都能从中受益。与此同时,人们对这些基础模型所涉及的三维编码器本身的性质还不甚了解。具体来说,对于这些模型所提供的预训练三维特征的效用,或其适应新的下游三维数据的能力,几乎没有分析。在本文中,我们填补了这一空白,并首次展示了如何利用三维基础模型在七个不同的数据集上实现强大的三维到三维检索性能,与最先进的基于视图的架构不相上下。我们使用下游数据对预训练基础模型及其微调版本进行了评估。我们将使用分类标签的监督微调与两种自监督无标签微调方法进行了比较。重要的是,我们引入并描述了一种微调方法,因为我们发现这对于从三维基础模型中稳定地进行迁移学习至关重要。
{"title":"Fine-tuning 3D foundation models for geometric object retrieval","authors":"Jarne Van den Herrewegen ,&nbsp;Tom Tourwé ,&nbsp;Maks Ovsjanikov ,&nbsp;Francis wyffels","doi":"10.1016/j.cag.2024.103993","DOIUrl":"https://doi.org/10.1016/j.cag.2024.103993","url":null,"abstract":"<div><p>Foundation models, such as ULIP-2 (Xue et al., 2023) recently projected forward the field of 3D deep learning. These models are trained with significantly more data and show superior representation learning capacity in many downstream tasks like 3D shape classification and few-shot part segmentation.</p><p>A particular characteristic of the recent 3D foundation models is that they are typically <em>multi-modal</em>, and involve image (2D) as well as caption (text) branches. This leads to an intricate interplay that benefits all modalities. At the same time, the nature of the <em>3D</em> encoders alone, involved in these foundation models is not well-understood. Specifically, there is little analysis on the utility of both pre-trained 3D features provided by these models, or their capacity to adapt to new downstream 3D data. Furthermore, existing studies typically focus on label-oriented downstream tasks, such as shape classification, and ignore other critical applications, such as 3D content-based object retrieval.</p><p>In this paper, we fill this gap and show, for the first time, how 3D foundation models can be leveraged for strong 3D-to-3D retrieval performance on seven different datasets, on par with state-of-the-art view-based architectures. We evaluate both the pre-trained foundation models, as well as their fine-tuned versions using downstream data. We compare supervised fine-tuning using classification labels against two self-supervised label-free fine-tuning methods. Importantly, we introduce and describe a methodology for fine-tuning, as we found this to be crucial to make transfer learning from 3D foundation models work in a stable manner.</p></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"122 ","pages":"Article 103993"},"PeriodicalIF":2.5,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0097849324001286/pdfft?md5=9cb01c40df89ca64e783dcd0f63e3f33&pid=1-s2.0-S0097849324001286-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141592876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computers & Graphics-Uk
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1