首页 > 最新文献

Journal of perceptual imaging最新文献

英文 中文
The Impact of Adaptation Time in High Dynamic Range Luminance Transitions 高动态范围亮度转换中适应时间的影响
Pub Date : 2024-03-01 DOI: 10.2352/j.percept.imaging.2024.7.000401
Jake Zuena, Jaclyn Pytlarz
. Modern production and distribution workflows have allowed for high dynamic range (HDR) imagery to become widespread. It has made a positive impact in the creative industry and improved image quality on consumer devices. Akin to the dynamics of loudness in audio, it is predicted that the increased luminance range allowed by HDR ecosystems could introduce unintended, high-magnitude changes. These luminance changes could occur at program transitions, advertisement insertions, and channel change operations. In this article, we present findings from a psychophysical experiment conducted to evaluate three components of HDR luminance changes: the magnitude of the change, the direction of the change (darker or brighter), and the adaptation time. Results confirm that all three components exert significant influence. We find that increasing either the magnitude of the luminance or the adaptation time results in more discomfort at the unintended transition. We find that transitioning from brighter to darker stimuli has a non-linear relationship with adaptation time, falling off steeply with very short durations.
.现代化的制作和发行流程使高动态范围 (HDR) 图像得以普及。它对创意产业产生了积极影响,并改善了消费设备的图像质量。与音频中响度的动态变化类似,HDR 生态系统允许增加的亮度范围可能会带来意想不到的高幅度变化。这些亮度变化可能发生在节目转换、广告插入和频道转换操作中。在本文中,我们介绍了一项心理物理实验的结果,该实验评估了 HDR 亮度变化的三个组成部分:变化幅度、变化方向(更暗或更亮)以及适应时间。结果表明,这三个因素都产生了显著的影响。我们发现,增加亮度或适应时间都会导致在非预期过渡时产生更多不适。我们发现,从较亮刺激过渡到较暗刺激与适应时间呈非线性关系,适应时间越短,不适感越强。
{"title":"The Impact of Adaptation Time in High Dynamic Range Luminance Transitions","authors":"Jake Zuena, Jaclyn Pytlarz","doi":"10.2352/j.percept.imaging.2024.7.000401","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2024.7.000401","url":null,"abstract":". Modern production and distribution workflows have allowed for high dynamic range (HDR) imagery to become widespread. It has made a positive impact in the creative industry and improved image quality on consumer devices. Akin to the dynamics of loudness in audio, it is predicted that the increased luminance range allowed by HDR ecosystems could introduce unintended, high-magnitude changes. These luminance changes could occur at program transitions, advertisement insertions, and channel change operations. In this article, we present findings from a psychophysical experiment conducted to evaluate three components of HDR luminance changes: the magnitude of the change, the direction of the change (darker or brighter), and the adaptation time. Results confirm that all three components exert significant influence. We find that increasing either the magnitude of the luminance or the adaptation time results in more discomfort at the unintended transition. We find that transitioning from brighter to darker stimuli has a non-linear relationship with adaptation time, falling off steeply with very short durations.","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"688 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140273230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Coding in Human Vision as a Useful Bias in Computer Vision and Machine Learning 人类视觉中的高效编码在计算机视觉和机器学习中的应用
Pub Date : 2023-09-01 DOI: 10.2352/j.percept.imaging.2023.6.000402
Philipp Grüning, Erhardt Barth
Interdisciplinary research in human vision has greatly contributed to the current state-of-the-art in computer vision and machine learning starting with low-level topics such as image compression and image quality assessment up to complex neural networks for object recognition. Representations similar to those in the primary visual cortex are frequently employed, e.g., linear filters in image compression and deep neural networks. Here, we first review particular nonlinear visual representations that can be used to better understand human vision and provide efficient representations for computer vision including deep neural networks. We then focus on i2D representations that are related to end-stopped neurons. The resulting E-nets are deep convolutional networks, which outperform some state-of-the-art deep networks. Finally, we show that the performance of E-nets can be further improved by using genetic algorithms to optimize the architecture of the network.
人类视觉的跨学科研究极大地促进了当前计算机视觉和机器学习的发展,从图像压缩和图像质量评估等低级主题开始,一直到用于对象识别的复杂神经网络。与初级视觉皮层类似的表征经常被使用,例如,图像压缩中的线性滤波器和深度神经网络。在这里,我们首先回顾了可以用来更好地理解人类视觉的特定非线性视觉表示,并为包括深度神经网络在内的计算机视觉提供有效的表示。然后我们将重点放在与末端停止神经元相关的i2D表征上。由此产生的E-nets是深度卷积网络,其性能优于一些最先进的深度网络。最后,我们证明了使用遗传算法优化网络结构可以进一步提高E-nets的性能。
{"title":"Efficient Coding in Human Vision as a Useful Bias in Computer Vision and Machine Learning","authors":"Philipp Grüning, Erhardt Barth","doi":"10.2352/j.percept.imaging.2023.6.000402","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2023.6.000402","url":null,"abstract":"Interdisciplinary research in human vision has greatly contributed to the current state-of-the-art in computer vision and machine learning starting with low-level topics such as image compression and image quality assessment up to complex neural networks for object recognition. Representations similar to those in the primary visual cortex are frequently employed, e.g., linear filters in image compression and deep neural networks. Here, we first review particular nonlinear visual representations that can be used to better understand human vision and provide efficient representations for computer vision including deep neural networks. We then focus on i2D representations that are related to end-stopped neurons. The resulting E-nets are deep convolutional networks, which outperform some state-of-the-art deep networks. Finally, we show that the performance of E-nets can be further improved by using genetic algorithms to optimize the architecture of the network.","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135346858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pictures: Crafting and Beholding 图片:制作和观看
Pub Date : 2023-07-01 DOI: 10.2352/j.percept.imaging.2023.6.000401
J. Koenderink, Andrea van Doorn
. The psychogenesis of visual awareness is an autonomous process in the sense that you do not “do” it. However, you have some control due to your acting in the world. We share this process with many animals. Pictorial awareness appears to be truly human. Here situational awareness splits into an “everyday vision” and a “pictorial” mode. Here we focus mainly on spatial aspects of pictorial art. You have no control whatever over the picture’s structure. The pictorial awareness is pure imagery, constrained by the (physical) structure of the picture. Crafting pictures and beholding pictures are distinct, but closely related, acts. We present an account from experimental and formal phenomenology. It results in a generic model that accounts for the bulk of formal (rare) and informal (common) observations.
{"title":"Pictures: Crafting and Beholding","authors":"J. Koenderink, Andrea van Doorn","doi":"10.2352/j.percept.imaging.2023.6.000401","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2023.6.000401","url":null,"abstract":". The psychogenesis of visual awareness is an autonomous process in the sense that you do not “do” it. However, you have some control due to your acting in the world. We share this process with many animals. Pictorial awareness appears to be truly human. Here situational awareness splits into an “everyday vision” and a “pictorial” mode. Here we focus mainly on spatial aspects of pictorial art. You have no control whatever over the picture’s structure. The pictorial awareness is pure imagery, constrained by the (physical) structure of the picture. Crafting pictures and beholding pictures are distinct, but closely related, acts. We present an account from experimental and formal phenomenology. It results in a generic model that accounts for the bulk of formal (rare) and informal (common) observations.","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68835405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transparency and Translucency in Visual Appearance of Light-Permeable Materials 透光材料视觉外观的透明与半透明
Pub Date : 2022-09-01 DOI: 10.2352/j.percept.imaging.2022.5.000409
Davit Gigilashvili, Tawsin Uddin Ahmed
{"title":"Transparency and Translucency in Visual Appearance of Light-Permeable Materials","authors":"Davit Gigilashvili, Tawsin Uddin Ahmed","doi":"10.2352/j.percept.imaging.2022.5.000409","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2022.5.000409","url":null,"abstract":"","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"20 1","pages":"000409-1"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81438272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Natural Scene Statistics and Distance Perception: Ground Surface and Non-ground Objects 自然场景统计和距离感知:地面和非地面物体
Pub Date : 2022-07-01 DOI: 10.2352/j.percept.imaging.2022.5.000503
Xavier Morin-Duchesne, M. Langer
{"title":"Natural Scene Statistics and Distance Perception: Ground Surface and Non-ground Objects","authors":"Xavier Morin-Duchesne, M. Langer","doi":"10.2352/j.percept.imaging.2022.5.000503","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2022.5.000503","url":null,"abstract":"","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"1 1","pages":"1-12"},"PeriodicalIF":0.0,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45369814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From the Special Issue Guest Editors 特刊特邀编辑
Pub Date : 2022-03-01 DOI: 10.2352/j.percept.imaging.2022.5.000101
Lora T. Likova, Fang Jiang, N. Stiles, A. Tanguay
{"title":"From the Special Issue Guest Editors","authors":"Lora T. Likova, Fang Jiang, N. Stiles, A. Tanguay","doi":"10.2352/j.percept.imaging.2022.5.000101","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2022.5.000101","url":null,"abstract":"","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"19 1","pages":"000101-1"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84609775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of Optical and Geometrical Thickness on Perceived Translucency Differences 光学和几何厚度对感知半透明差异的影响
Pub Date : 2022-03-01 DOI: 10.2352/j.percept.imaging.2022.5.000501
Davit Gigilashvili, P. Urban, Jean-Baptiste Thomas, Marius Pedersen, J. Hardeberg
{"title":"The Impact of Optical and Geometrical Thickness on Perceived Translucency Differences","authors":"Davit Gigilashvili, P. Urban, Jean-Baptiste Thomas, Marius Pedersen, J. Hardeberg","doi":"10.2352/j.percept.imaging.2022.5.000501","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2022.5.000501","url":null,"abstract":"","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"103 1","pages":"000501-1"},"PeriodicalIF":0.0,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76700253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Perception and Appreciation of Tactile Objects: The Role of Visual Experience and Texture Parameters. 触觉对象的感知与欣赏:视觉经验与纹理参数的作用。
Pub Date : 2022-01-01 DOI: 10.2352/J.Percept.Imaging.2022.5.000405
A K M Rezaul Karim, Sanchary Prativa, Lora T Likova

This exploratory study was designed to examine the effects of visual experience and specific texture parameters on both discriminative and aesthetic aspects of tactile perception. To this end, the authors conducted two experiments using a novel behavioral (ranking) approach in blind and (blindfolded) sighted individuals. Groups of congenitally blind, late blind, and (blindfolded) sighted participants made relative stimulus preference, aesthetic appreciation, and smoothness or softness judgment of two-dimensional (2D) or three-dimensional (3D) tactile surfaces through active touch. In both experiments, the aesthetic judgment was assessed on three affective dimensions, Relaxation, Hedonics, and Arousal, hypothesized to underlie visual aesthetics in a prior study. Results demonstrated that none of these behavioral judgments significantly varied as a function of visual experience in either experiment. However, irrespective of visual experience, significant differences were identified in all these behavioral judgments across the physical levels of smoothness or softness. In general, 2D smoothness or 3D softness discrimination was proportional to the level of physical smoothness or softness. Second, the smoother or softer tactile stimuli were preferred over the rougher or harder tactile stimuli. Third, the 3D affective structure of visual aesthetics appeared to be amodal and applicable to tactile aesthetics. However, analysis of the aesthetic profile across the affective dimensions revealed some striking differences between the forms of appreciation of smoothness and softness, uncovering unanticipated substructures in the nascent field of tactile aesthetics. While the physically softer 3D stimuli received higher ranks on all three affective dimensions, the physically smoother 2D stimuli received higher ranks on the Relaxation and Hedonics but lower ranks on the Arousal dimension. Moreover, the Relaxation and Hedonics ranks accurately overlapped with one another across all the physical levels of softness/hardness, but not across the physical levels of smoothness/roughness. These findings suggest that physical texture parameters not only affect basic tactile discrimination but differentially mediate tactile preferences, and aesthetic appreciation. The theoretical and practical implications of these novel findings are discussed.

本探索性研究旨在探讨视觉经验和特定纹理参数对触觉知觉的辨别和审美方面的影响。为此,作者在盲人和(蒙眼)视力正常的个体中进行了两个实验,使用了一种新的行为(排序)方法。先天失明组、晚期失明组和(蒙眼)视力正常组通过主动触摸对二维或三维触觉表面进行相对刺激偏好、审美欣赏和平滑或柔软判断。在这两个实验中,审美判断都是在三个情感维度上进行评估的,放松、享乐和觉醒,这是先前研究中假设的视觉美学基础。结果表明,在两个实验中,这些行为判断都没有明显的视觉经验变化。然而,不管视觉体验如何,所有这些行为判断在平滑或柔软的物理水平上都存在显著差异。一般来说,2D平滑度或3D柔软度的辨别与物理平滑度或柔软度成正比。其次,较光滑或较柔软的触觉刺激比粗糙或较硬的触觉刺激更受欢迎。第三,视觉美学的三维情感结构表现出模态性,适用于触觉美学。然而,在情感维度上对审美轮廓的分析揭示了对平滑和柔软的欣赏形式之间的一些显著差异,揭示了触觉美学新兴领域中意想不到的子结构。虽然物理上较柔软的3D刺激在所有三个情感维度上得分较高,但物理上较光滑的2D刺激在放松和快乐维度上得分较高,但在唤醒维度上得分较低。此外,放松和享乐的排名在柔软/硬度的所有物理级别上都精确地重叠,但在光滑/粗糙的物理级别上却没有重叠。这些结果表明,物理纹理参数不仅影响基本的触觉辨别,而且对触觉偏好和审美有差异的调节作用。讨论了这些新发现的理论和实践意义。
{"title":"Perception and Appreciation of Tactile Objects: The Role of Visual Experience and Texture Parameters.","authors":"A K M Rezaul Karim,&nbsp;Sanchary Prativa,&nbsp;Lora T Likova","doi":"10.2352/J.Percept.Imaging.2022.5.000405","DOIUrl":"https://doi.org/10.2352/J.Percept.Imaging.2022.5.000405","url":null,"abstract":"<p><p>This exploratory study was designed to examine the effects of visual experience and specific texture parameters on both discriminative and aesthetic aspects of tactile perception. To this end, the authors conducted two experiments using a novel behavioral (ranking) approach in blind and (blindfolded) sighted individuals. Groups of congenitally blind, late blind, and (blindfolded) sighted participants made relative stimulus preference, aesthetic appreciation, and smoothness or softness judgment of two-dimensional (2D) or three-dimensional (3D) tactile surfaces through active touch. In both experiments, the aesthetic judgment was assessed on three affective dimensions, Relaxation, Hedonics, and Arousal, hypothesized to underlie visual aesthetics in a prior study. Results demonstrated that none of these behavioral judgments significantly varied as a function of visual experience in either experiment. However, irrespective of visual experience, significant differences were identified in all these behavioral judgments across the physical levels of smoothness or softness. In general, 2D smoothness or 3D softness discrimination was proportional to the level of physical smoothness or softness. Second, the smoother or softer tactile stimuli were preferred over the rougher or harder tactile stimuli. Third, the 3D affective structure of visual aesthetics appeared to be amodal and applicable to tactile aesthetics. However, analysis of the aesthetic profile across the affective dimensions revealed some striking differences between the forms of appreciation of smoothness and softness, uncovering unanticipated substructures in the nascent field of tactile aesthetics. While the physically softer 3D stimuli received higher ranks on all three affective dimensions, the physically smoother 2D stimuli received higher ranks on the Relaxation and Hedonics but lower ranks on the Arousal dimension. Moreover, the Relaxation and Hedonics ranks accurately overlapped with one another across all the physical levels of softness/hardness, but not across the physical levels of smoothness/roughness. These findings suggest that physical texture parameters not only affect basic tactile discrimination but differentially mediate tactile preferences, and aesthetic appreciation. The theoretical and practical implications of these novel findings are discussed.</p>","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"5 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019098/pdf/nihms-1789353.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9508763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Ventriloquist Effect is not Consistently Affected by Stimulus Realism† 腹语者效应并不总是受到刺激现实主义的影响†
Pub Date : 2022-01-01 DOI: 10.2352/j.percept.imaging.2021.4.2.020404
Thirsa Huisman, T. Dau, Tobias Piechowiak, Ewen N. MacDonald
Despite more than 60 years of research, it has remained uncertain if and how realism affects the ventriloquist effect. Here, a sound localization experiment was run using spatially disparate audio-visual stimuli. The visual stimuli were presented using virtual reality, allowing for easy manipulation of the degree of realism of the stimuli. Starting from stimuli commonly used in ventriloquist experiments, i.e., a light flash and noise burst, a new factor was added or changed in each condition to investigate the effect of movement and realism without confounding the effects of an increased temporal correlation of the audio-visual stimuli. First, a distractor task was introduced to ensure that participants fixated their eye gaze during the experiment. Next, movement was added to the visual stimuli while maintaining a similar temporal correlation between the stimuli. Finally, by changing the stimuli from the flash and noise stimuli to the visuals of a bouncing ball that made a matching impact sound, the effect of realism was assessed. No evidence for an effect of realism and movement of the stimuli was found, suggesting that, in simple scenarios, the ventriloquist effect might not be affected by stimulus realism.
尽管有超过60年的研究,现实主义是否以及如何影响腹语效果仍然不确定。在此,使用空间不同的视听刺激进行声音定位实验。视觉刺激使用虚拟现实呈现,允许轻松操纵刺激的现实程度。从腹语实验中常用的刺激,即闪光和噪音爆发开始,在每个条件下增加或改变一个新的因素来研究运动和真实感的影响,而不会混淆视听刺激增加的时间相关性的影响。首先,引入了一个分心任务,以确保参与者在实验过程中盯着他们的眼睛。接下来,将运动添加到视觉刺激中,同时保持刺激之间相似的时间相关性。最后,通过将刺激从闪光和噪音刺激改变为弹跳球的视觉效果,并发出匹配的撞击声,来评估真实感的效果。没有证据表明刺激的真实性和运动的影响,这表明,在简单的场景中,腹语表演的效果可能不受刺激真实性的影响。
{"title":"The Ventriloquist Effect is not Consistently Affected by Stimulus Realism†","authors":"Thirsa Huisman, T. Dau, Tobias Piechowiak, Ewen N. MacDonald","doi":"10.2352/j.percept.imaging.2021.4.2.020404","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2021.4.2.020404","url":null,"abstract":"Despite more than 60 years of research, it has remained uncertain if and how realism affects the ventriloquist effect. Here, a sound localization experiment was run using spatially disparate audio-visual stimuli. The visual stimuli were presented using virtual reality, allowing for easy manipulation of the degree of realism of the stimuli. Starting from stimuli commonly used in ventriloquist experiments, i.e., a light flash and noise burst, a new factor was added or changed in each condition to investigate the effect of movement and realism without confounding the effects of an increased temporal correlation of the audio-visual stimuli. First, a distractor task was introduced to ensure that participants fixated their eye gaze during the experiment. Next, movement was added to the visual stimuli while maintaining a similar temporal correlation between the stimuli. Finally, by changing the stimuli from the flash and noise stimuli to the visuals of a bouncing ball that made a matching impact sound, the effect of realism was assessed. No evidence for an effect of realism and movement of the stimuli was found, suggesting that, in simple scenarios, the ventriloquist effect might not be affected by stimulus realism.","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"56 1","pages":"000404-1"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84538439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controllable Medical Image Generation via GAN. 基于GAN的可控医学图像生成。
Pub Date : 2022-01-01 DOI: 10.2352/j.percept.imaging.2022.5.000502
Zhihang Ren, Stella X Yu, David Whitney

Medical image data is critically important for a range of disciplines, including medical image perception research, clinician training programs, and computer vision algorithms, among many other applications. Authentic medical image data, unfortunately, is relatively scarce for many of these uses. Because of this, researchers often collect their own data in nearby hospitals, which limits the generalizabilty of the data and findings. Moreover, even when larger datasets become available, they are of limited use because of the necessary data processing procedures such as de-identification, labeling, and categorizing, which requires significant time and effort. Thus, in some applications, including behavioral experiments on medical image perception, researchers have used naive artificial medical images (e.g., shapes or textures that are not realistic). These artificial medical images are easy to generate and manipulate, but the lack of authenticity inevitably raises questions about the applicability of the research to clinical practice. Recently, with the great progress in Generative Adversarial Networks (GAN), authentic images can be generated with high quality. In this paper, we propose to use GAN to generate authentic medical images for medical imaging studies. We also adopt a controllable method to manipulate the generated image attributes such that these images can satisfy any arbitrary experimenter goals, tasks, or stimulus settings. We have tested the proposed method on various medical image modalities, including mammogram, MRI, CT, and skin cancer images. The generated authentic medical images verify the success of the proposed method. The model and generated images could be employed in any medical image perception research.

医学图像数据对一系列学科至关重要,包括医学图像感知研究、临床医生培训计划和计算机视觉算法,以及许多其他应用。不幸的是,真实的医学图像数据对于许多这些用途来说相对稀缺。正因为如此,研究人员经常在附近的医院收集自己的数据,这限制了数据和发现的普遍性。此外,即使有更大的数据集可用,它们的用途也有限,因为需要进行必要的数据处理程序,如去识别、标记和分类,这需要大量的时间和精力。因此,在一些应用中,包括医学图像感知的行为实验中,研究人员使用了幼稚的人工医学图像(例如,不真实的形状或纹理)。这些人工医学图像易于生成和操作,但缺乏真实性不可避免地引发了对研究在临床实践中的适用性的质疑。近年来,随着生成对抗网络(GAN)技术的发展,可以生成高质量的真实图像。在本文中,我们建议使用GAN来生成真实的医学图像,用于医学成像研究。我们还采用了一种可控的方法来操纵生成的图像属性,使这些图像可以满足任何任意的实验目标、任务或刺激设置。我们已经在各种医学图像模式上测试了所提出的方法,包括乳房x光片、MRI、CT和皮肤癌图像。生成的真实医学图像验证了该方法的成功。该模型和生成的图像可用于任何医学图像感知研究。
{"title":"Controllable Medical Image Generation via GAN.","authors":"Zhihang Ren,&nbsp;Stella X Yu,&nbsp;David Whitney","doi":"10.2352/j.percept.imaging.2022.5.000502","DOIUrl":"https://doi.org/10.2352/j.percept.imaging.2022.5.000502","url":null,"abstract":"<p><p>Medical image data is critically important for a range of disciplines, including medical image perception research, clinician training programs, and computer vision algorithms, among many other applications. Authentic medical image data, unfortunately, is relatively scarce for many of these uses. Because of this, researchers often collect their own data in nearby hospitals, which limits the generalizabilty of the data and findings. Moreover, even when larger datasets become available, they are of limited use because of the necessary data processing procedures such as de-identification, labeling, and categorizing, which requires significant time and effort. Thus, in some applications, including behavioral experiments on medical image perception, researchers have used naive artificial medical images (e.g., shapes or textures that are not realistic). These artificial medical images are easy to generate and manipulate, but the lack of authenticity inevitably raises questions about the applicability of the research to clinical practice. Recently, with the great progress in Generative Adversarial Networks (GAN), authentic images can be generated with high quality. In this paper, we propose to use GAN to generate authentic medical images for medical imaging studies. We also adopt a controllable method to manipulate the generated image attributes such that these images can satisfy any arbitrary experimenter goals, tasks, or stimulus settings. We have tested the proposed method on various medical image modalities, including mammogram, MRI, CT, and skin cancer images. The generated authentic medical images verify the success of the proposed method. The model and generated images could be employed in any medical image perception research.</p>","PeriodicalId":73895,"journal":{"name":"Journal of perceptual imaging","volume":"5 ","pages":"0005021-50215"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10448967/pdf/nihms-1871254.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10104475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Journal of perceptual imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1