首页 > 最新文献

IEEE Open Journal on Immersive Displays最新文献

英文 中文
Beyond the Mean: Statistical Measures for Quantifying Perceptual Quality in Neural Rendering 超越平均值:神经渲染中量化感知质量的统计方法
Pub Date : 2025-11-07 DOI: 10.1109/OJID.2025.3630586
Shihao Luo;Truong Cong Thang
Novel View Synthesis (NVS) aims to generate unseen views of a scene from limited observations and has become foundational to immersive multimedia through recent advances in neural rendering. In immersive displays, the success of an NVS system mainly depends on the overall perceptual quality it can provide. However, most neural rendering models report only the mean (the average) values of synthesized novel views as the overall quality performance indicator. This practice lacks validation, raising the question of whether relying solely on the mean is sufficient to capture overall perceptual quality. In this work, we present a statistical examination to investigate whether overall perceptual quality in neural rendering is sufficiently represented by the mean alone or requires additional measures. Our results highlight the dominant role of mean quality while introducing a more comprehensive statistical framework for overall quality assessment in neural rendering. We demonstrate that other statistical measures offer additional aspects of the perceptual quality that the mean alone cannot fully represent.
新颖视图合成(NVS)旨在从有限的观察中生成场景的未见视图,并且通过神经渲染的最新进展已成为沉浸式多媒体的基础。在沉浸式显示中,NVS系统的成功主要取决于它能提供的整体感知质量。然而,大多数神经渲染模型仅报告合成新颖视图的平均值作为整体质量性能指标。这种做法缺乏验证,提出了一个问题,即仅仅依靠平均值是否足以捕捉整体感知质量。在这项工作中,我们提出了一项统计检查,以调查神经渲染中的整体感知质量是否仅由平均值充分表示,还是需要额外的测量。我们的研究结果强调了平均质量的主导作用,同时引入了一个更全面的统计框架来评估神经渲染的整体质量。我们证明,其他统计度量提供了感知质量的其他方面,而这些方面仅靠平均值无法完全代表。
{"title":"Beyond the Mean: Statistical Measures for Quantifying Perceptual Quality in Neural Rendering","authors":"Shihao Luo;Truong Cong Thang","doi":"10.1109/OJID.2025.3630586","DOIUrl":"https://doi.org/10.1109/OJID.2025.3630586","url":null,"abstract":"Novel View Synthesis (NVS) aims to generate unseen views of a scene from limited observations and has become foundational to immersive multimedia through recent advances in neural rendering. In immersive displays, the success of an NVS system mainly depends on the overall perceptual quality it can provide. However, most neural rendering models report only the mean (the average) values of synthesized novel views as the overall quality performance indicator. This practice lacks validation, raising the question of whether relying solely on the mean is sufficient to capture overall perceptual quality. In this work, we present a statistical examination to investigate whether overall perceptual quality in neural rendering is sufficiently represented by the mean alone or requires additional measures. Our results highlight the dominant role of mean quality while introducing a more comprehensive statistical framework for overall quality assessment in neural rendering. We demonstrate that other statistical measures offer additional aspects of the perceptual quality that the mean alone cannot fully represent.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"200-209"},"PeriodicalIF":0.0,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11235564","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145674681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maxwellian-View Augmented Reality Displays With Extended Depth of Field 具有扩展景深的麦克斯韦视图增强现实显示器
Pub Date : 2025-10-15 DOI: 10.1109/OJID.2025.3622128
Sung-Min Jung;Laurynas Valantinas;Leon Preston;Pawan K. Shrestha
Driven by the growing demand for enhanced visual interaction with the physical world, augmented reality (AR) display technologies have rapidly emerged as transformative interfaces, bridging digital information seamlessly with real-world environments. Conventional optical architectures—such as birdbath, freeform combiners, holographic optical elements (HOEs), and waveguide couplers—offer trade-offs in resolution, brightness, transparency, and field of view (FOV) but often suffer from accommodation mismatch and vergence–accommodation conflict (VAC), causing visual discomfort. Maxwellian-view or retina projection displays address these issues by projecting focused light bundles directly into the pupil, producing depth-invariant images that remain sharp regardless of the eye’s accommodative state. This principle provides extended depth of field (DOF) and enables compact optical designs, though challenges such as a narrow eye-box remain. Recent advances—such as exit pupil expansion via multiple replications of converging points using HOE optics, beam splitter arrays, integration of holographic elements, and selective LED illumination—are enhancing the practicality of Maxwellian systems. By combining these with adaptive optics, intelligent tracking, and hybrid focal strategies, next-generation AR displays could deliver superior comfort, immersion, and usability. This review outlines the principles, advantages, limitations, and recent developments of Maxwellian AR optics, highlighting their potential as a transformative foundation for future AR display technologies across medical, industrial, and consumer applications.
在增强与物理世界的视觉交互需求不断增长的推动下,增强现实(AR)显示技术迅速成为变革性的接口,将数字信息与现实世界环境无缝地连接起来。传统的光学结构,如水盆、自由形状组合器、全息光学元件(HOEs)和波导耦合器,在分辨率、亮度、透明度和视场(FOV)方面进行了权衡,但往往存在调节不匹配和收敛调节冲突(VAC),导致视觉不适。麦克斯韦视图或视网膜投影显示器通过将聚焦的光束直接投射到瞳孔中来解决这些问题,产生深度不变的图像,无论眼睛的调节状态如何,图像都保持清晰。这一原理提供了扩展的景深(DOF),并使紧凑的光学设计成为可能,尽管窄小的眼盒等挑战仍然存在。最近的一些进展,如利用HOE光学、分束器阵列、全息元件集成和选择性LED照明,通过多次复制会聚点来扩大出瞳,正在增强麦克斯韦系统的实用性。通过将这些与自适应光学、智能跟踪和混合焦点策略相结合,下一代AR显示器可以提供卓越的舒适性、沉浸感和可用性。本文概述了麦克斯韦AR光学的原理、优势、局限性和最新发展,强调了它们作为未来AR显示技术在医疗、工业和消费者应用领域的变革性基础的潜力。
{"title":"Maxwellian-View Augmented Reality Displays With Extended Depth of Field","authors":"Sung-Min Jung;Laurynas Valantinas;Leon Preston;Pawan K. Shrestha","doi":"10.1109/OJID.2025.3622128","DOIUrl":"https://doi.org/10.1109/OJID.2025.3622128","url":null,"abstract":"Driven by the growing demand for enhanced visual interaction with the physical world, augmented reality (AR) display technologies have rapidly emerged as transformative interfaces, bridging digital information seamlessly with real-world environments. Conventional optical architectures—such as birdbath, freeform combiners, holographic optical elements (HOEs), and waveguide couplers—offer trade-offs in resolution, brightness, transparency, and field of view (FOV) but often suffer from accommodation mismatch and vergence–accommodation conflict (VAC), causing visual discomfort. Maxwellian-view or retina projection displays address these issues by projecting focused light bundles directly into the pupil, producing depth-invariant images that remain sharp regardless of the eye’s accommodative state. This principle provides extended depth of field (DOF) and enables compact optical designs, though challenges such as a narrow eye-box remain. Recent advances—such as exit pupil expansion via multiple replications of converging points using HOE optics, beam splitter arrays, integration of holographic elements, and selective LED illumination—are enhancing the practicality of Maxwellian systems. By combining these with adaptive optics, intelligent tracking, and hybrid focal strategies, next-generation AR displays could deliver superior comfort, immersion, and usability. This review outlines the principles, advantages, limitations, and recent developments of Maxwellian AR optics, highlighting their potential as a transformative foundation for future AR display technologies across medical, industrial, and consumer applications.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"114-121"},"PeriodicalIF":0.0,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11204834","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145405440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Touch to Immersion: A Systematic Review of Haptic and Multimodal Perception in the Metaverse 从触摸到沉浸:对虚拟世界中触觉和多模态感知的系统回顾
Pub Date : 2025-10-15 DOI: 10.1109/OJID.2025.3622139
Wenyu Yang;Zihe Zhao;Shuo Gao
In the emerging metaverse, immersive experiences depend on the integration of multimodal sensory channels, with haptic feedback playing a central role in enhancing realism and interaction. Recent progress in wearable devices, force feedback, and neural interfaces has driven research on combining tactile sensations with visual, auditory, olfactory, thermal, and force-related signals. Yet, current systems still face challenges in latency, energy efficiency, synchronisation, and personalisation, limiting their ability to match the complexity of human perception. This paper reviews the trajectory of haptic and multimodal perception fusion in the metaverse. It first introduces the biological and psychological foundations of touch, then discusses tactile technologies and device configurations. Next, it examines multimodal fusion models and mechanisms through comparative analyses and application evaluations, focusing on spatio-temporal synchronisation, cross-modal compensation, perceptual enhancement, and attentional allocation. The review provides an overview of technical approaches, implementation strategies, and application challenges, offering both theoretical grounding and practical insights for designing synchronised, real-time, personalised, and scalable multisensory interaction systems.
在新兴的虚拟世界中,沉浸式体验依赖于多模态感官通道的整合,触觉反馈在增强真实感和交互性方面发挥着核心作用。最近在可穿戴设备、力反馈和神经接口方面的进展推动了将触觉与视觉、听觉、嗅觉、热和力相关信号相结合的研究。然而,当前的系统仍然面临着延迟、能源效率、同步和个性化方面的挑战,限制了它们与人类感知复杂性相匹配的能力。本文综述了触觉和多模态知觉在虚拟世界中的融合轨迹。首先介绍了触觉的生物学和心理学基础,然后讨论了触觉技术和设备配置。其次,通过比较分析和应用评估,研究了多模态融合模型和机制,重点关注时空同步、跨模态补偿、感知增强和注意力分配。该综述概述了技术方法、实施策略和应用挑战,为设计同步、实时、个性化和可扩展的多感官交互系统提供了理论基础和实践见解。
{"title":"From Touch to Immersion: A Systematic Review of Haptic and Multimodal Perception in the Metaverse","authors":"Wenyu Yang;Zihe Zhao;Shuo Gao","doi":"10.1109/OJID.2025.3622139","DOIUrl":"https://doi.org/10.1109/OJID.2025.3622139","url":null,"abstract":"In the emerging metaverse, immersive experiences depend on the integration of multimodal sensory channels, with haptic feedback playing a central role in enhancing realism and interaction. Recent progress in wearable devices, force feedback, and neural interfaces has driven research on combining tactile sensations with visual, auditory, olfactory, thermal, and force-related signals. Yet, current systems still face challenges in latency, energy efficiency, synchronisation, and personalisation, limiting their ability to match the complexity of human perception. This paper reviews the trajectory of haptic and multimodal perception fusion in the metaverse. It first introduces the biological and psychological foundations of touch, then discusses tactile technologies and device configurations. Next, it examines multimodal fusion models and mechanisms through comparative analyses and application evaluations, focusing on spatio-temporal synchronisation, cross-modal compensation, perceptual enhancement, and attentional allocation. The review provides an overview of technical approaches, implementation strategies, and application challenges, offering both theoretical grounding and practical insights for designing synchronised, real-time, personalised, and scalable multisensory interaction systems.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"122-138"},"PeriodicalIF":0.0,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11204837","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Carbon Nanotube-Based Flexible and Stretchable Electronics 基于碳纳米管的柔性和可拉伸电子学
Pub Date : 2025-10-14 DOI: 10.1109/OJID.2025.3620828
Dexing Liu;Rui Qiu;Jingwen Liu;Can Li;Min Zhang
Flexible and stretchable electronics hold great promise for wearable devices and health monitoring systems owing to their mechanical conformability, light weight, and seamless integration capabilities. Nevertheless, achieving high electrical performance while retaining flexibility and operational stability remains a considerable challenge. Among various candidate materials, carbon nanotubes (CNTs) have garnered significant attention due to their exceptional carrier mobility, inherent mechanical flexibility, and compatibility with low-temperature fabrication processes. This article provides a comprehensive overview of recent progress in flexible and stretchable electronics, focusing specifically on advances in CNT-based thin-film transistors and integrated circuits. Furthermore, it examines emerging applications of CNTs in artificial neuromorphic systems and soft sensors, underscoring their potential to enable next-generation soft electronic technologies.
柔性和可拉伸的电子产品由于其机械一致性、重量轻和无缝集成能力,在可穿戴设备和健康监测系统中具有很大的前景。然而,在保持灵活性和操作稳定性的同时实现高电气性能仍然是一个相当大的挑战。在各种候选材料中,碳纳米管(CNTs)由于其特殊的载流子迁移率、固有的机械灵活性和与低温制造工艺的兼容性而引起了极大的关注。本文全面概述了柔性和可拉伸电子器件的最新进展,重点介绍了基于碳纳米管的薄膜晶体管和集成电路的进展。此外,它还研究了碳纳米管在人工神经形态系统和软传感器中的新兴应用,强调了它们在实现下一代软电子技术方面的潜力。
{"title":"Carbon Nanotube-Based Flexible and Stretchable Electronics","authors":"Dexing Liu;Rui Qiu;Jingwen Liu;Can Li;Min Zhang","doi":"10.1109/OJID.2025.3620828","DOIUrl":"https://doi.org/10.1109/OJID.2025.3620828","url":null,"abstract":"Flexible and stretchable electronics hold great promise for wearable devices and health monitoring systems owing to their mechanical conformability, light weight, and seamless integration capabilities. Nevertheless, achieving high electrical performance while retaining flexibility and operational stability remains a considerable challenge. Among various candidate materials, carbon nanotubes (CNTs) have garnered significant attention due to their exceptional carrier mobility, inherent mechanical flexibility, and compatibility with low-temperature fabrication processes. This article provides a comprehensive overview of recent progress in flexible and stretchable electronics, focusing specifically on advances in CNT-based thin-film transistors and integrated circuits. Furthermore, it examines emerging applications of CNTs in artificial neuromorphic systems and soft sensors, underscoring their potential to enable next-generation soft electronic technologies.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"139-165"},"PeriodicalIF":0.0,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11202373","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145510232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Eye Pose Estimation and Tracking Using Iris as a Base Feature 以虹膜为基本特征的眼姿估计与跟踪
Pub Date : 2025-10-01 DOI: 10.1109/OJID.2025.3616949
Dmitry Shmunk
A novel, fast, and robust method for 3D eye pose tracking that leverages the anatomical constancy of the human iris to improve accuracy and computational efficiency is proposed. Traditional pupil-based methods suffer from limitations due to pupil size variability, decentering, and the need for complex corrections for refraction through the corneal bulge. In contrast, the iris, due to its fixed size and direct visibility, serves as a more reliable feature for precise eye pose estimation. Our method combines key advantages of both model-based and regression-based approaches without requiring external glint-producing light sources or high computational overheads associated with neural-network-based solutions. The iris is used as the primary tracking feature, enabling robust detection even under partial occlusion and in users wearing prescription eyewear. Exploiting the consistent geometry of the iris, we estimate gaze direction and 3D eye position with high precision. Unlike existing methods, the proposed approach minimizes reliance on pupil measurements, employing the pupil’s high contrast only to augment iris detection. This strategy ensures robustness in real-world scenarios, including varying illumination and stray light/glints/distortions introduced by corrective eyewear. Experimental results show that the method achieves low computational cost while maintaining state-of-the-art performance.
提出了一种新颖、快速、鲁棒的三维眼姿跟踪方法,该方法利用人眼虹膜的解剖稳定性来提高眼姿跟踪的精度和计算效率。传统的基于瞳孔的方法由于瞳孔大小的可变性、离心以及需要通过角膜凸起进行复杂的屈光矫正而受到限制。相比之下,虹膜由于其固定的大小和直接可见性,可以作为精确的眼睛姿态估计的更可靠的特征。我们的方法结合了基于模型和基于回归的方法的关键优势,不需要外部产生闪烁的光源,也不需要与基于神经网络的解决方案相关的高计算开销。虹膜被用作主要的跟踪特征,即使在部分遮挡和佩戴处方眼镜的用户中也能进行稳健的检测。利用虹膜的一致几何形状,我们可以高精度地估计凝视方向和3D眼睛位置。与现有的方法不同,本文提出的方法最大限度地减少了对瞳孔测量的依赖,利用瞳孔的高对比度来增强虹膜检测。这一策略确保了在现实场景中的稳健性,包括不同的照明和杂散光/闪烁/矫正眼镜引入的畸变。实验结果表明,该方法在保持最先进性能的同时,实现了较低的计算成本。
{"title":"Eye Pose Estimation and Tracking Using Iris as a Base Feature","authors":"Dmitry Shmunk","doi":"10.1109/OJID.2025.3616949","DOIUrl":"https://doi.org/10.1109/OJID.2025.3616949","url":null,"abstract":"A novel, fast, and robust method for 3D eye pose tracking that leverages the anatomical constancy of the human iris to improve accuracy and computational efficiency is proposed. Traditional pupil-based methods suffer from limitations due to pupil size variability, decentering, and the need for complex corrections for refraction through the corneal bulge. In contrast, the iris, due to its fixed size and direct visibility, serves as a more reliable feature for precise eye pose estimation. Our method combines key advantages of both model-based and regression-based approaches without requiring external glint-producing light sources or high computational overheads associated with neural-network-based solutions. The iris is used as the primary tracking feature, enabling robust detection even under partial occlusion and in users wearing prescription eyewear. Exploiting the consistent geometry of the iris, we estimate gaze direction and 3D eye position with high precision. Unlike existing methods, the proposed approach minimizes reliance on pupil measurements, employing the pupil’s high contrast only to augment iris detection. This strategy ensures robustness in real-world scenarios, including varying illumination and stray light/glints/distortions introduced by corrective eyewear. Experimental results show that the method achieves low computational cost while maintaining state-of-the-art performance.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"96-105"},"PeriodicalIF":0.0,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11189046","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comfort-Aware Trajectory Optimization for Immersive Human-Robot Interaction 沉浸式人机交互的舒适感知轨迹优化
Pub Date : 2025-09-25 DOI: 10.1109/OJID.2025.3614514
Yitian Kou;Dandan Zhu;Hao Zeng;Kaiwei Zhang;Xiaoxiao Sui;Xiongkuo Min;Guangtao Zhai
In human-robot cohabited environments, generating socially acceptable and human-like trajectories is critical to fostering safe, comfortable, and intuitive interactions. This paper presents a trajectory prediction framework that emulates human walking behavior by incorporating social dynamics and comfort-driven optimization, specifically within immersive virtual environments. Leveraging the Social Locomotion Model (SLM), our framework captures inter-personal interactions and spatial preferences, modeling how humans implicitly adjust paths to maintain social norms. We further introduce a Nelder-Mead-based optimization process to refine robot trajectories under these constraints, ensuring both goal-directedness and human-likeness with efficiency and applicability. To evaluate the perceptual realism and spatial comfort of the generated trajectories, we conduct a user study in a virtual reality (VR) setting, where participants experience and assess various robot navigation behaviors from a first-person perspective. Subjective feedback indicates that the trajectories optimized by our model are perceived to be significantly more natural and comfortable than those generated by baseline approaches. Our framework demonstrates strong potential for deployment in virtual human-robot interaction systems, where social legibility, responsiveness, and computational efficiency are all critical.
在人-机器人共存的环境中,产生社会上可接受的和类似人类的轨迹对于促进安全、舒适和直观的交互至关重要。本文提出了一个轨迹预测框架,通过结合社会动态和舒适驱动优化,特别是在沉浸式虚拟环境中模拟人类行走行为。利用社会运动模型(Social movement Model, SLM),我们的框架捕捉了人际互动和空间偏好,模拟了人类如何隐式调整路径以维持社会规范。我们进一步引入了一种基于nelder - mead的优化过程来细化这些约束下的机器人轨迹,以确保目标定向性和人类相似性,并具有效率和适用性。为了评估生成轨迹的感知真实感和空间舒适性,我们在虚拟现实(VR)环境中进行了一项用户研究,参与者从第一人称视角体验和评估各种机器人导航行为。主观反馈表明,我们的模型优化的轨迹被认为比基线方法产生的轨迹更加自然和舒适。我们的框架展示了在虚拟人机交互系统中部署的强大潜力,其中社会易读性,响应性和计算效率都至关重要。
{"title":"Comfort-Aware Trajectory Optimization for Immersive Human-Robot Interaction","authors":"Yitian Kou;Dandan Zhu;Hao Zeng;Kaiwei Zhang;Xiaoxiao Sui;Xiongkuo Min;Guangtao Zhai","doi":"10.1109/OJID.2025.3614514","DOIUrl":"https://doi.org/10.1109/OJID.2025.3614514","url":null,"abstract":"In human-robot cohabited environments, generating socially acceptable and human-like trajectories is critical to fostering safe, comfortable, and intuitive interactions. This paper presents a trajectory prediction framework that emulates human walking behavior by incorporating social dynamics and comfort-driven optimization, specifically within immersive virtual environments. Leveraging the Social Locomotion Model (SLM), our framework captures inter-personal interactions and spatial preferences, modeling how humans implicitly adjust paths to maintain social norms. We further introduce a Nelder-Mead-based optimization process to refine robot trajectories under these constraints, ensuring both goal-directedness and human-likeness with efficiency and applicability. To evaluate the perceptual realism and spatial comfort of the generated trajectories, we conduct a user study in a virtual reality (VR) setting, where participants experience and assess various robot navigation behaviors from a first-person perspective. Subjective feedback indicates that the trajectories optimized by our model are perceived to be significantly more natural and comfortable than those generated by baseline approaches. Our framework demonstrates strong potential for deployment in virtual human-robot interaction systems, where social legibility, responsiveness, and computational efficiency are all critical.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"106-113"},"PeriodicalIF":0.0,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11180046","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI and Physics-Based Computational Methods for Oxide Thin-Film Transistors: A Review 基于人工智能和物理的氧化薄膜晶体管计算方法综述
Pub Date : 2025-09-16 DOI: 10.1109/OJID.2025.3610414
Eunkyung Koh;Hyeon-Deuk Kim;Rokyeon Kim;Byoungtaek Son;Sang-Hoon Lee;Gyehyun Park;Eui-Cheol Shin;Yongsoo Lee;Insoo Wang
Oxide thin-film transistors (TFTs) are critical components in modern display technologies due to their high mobility, optical transparency, and low-temperature processability. As the design space expands across material systems, device architectures, and operating conditions, there is a growing demand for computational methods that support reliable and efficient modeling. This review presents a comprehensive overview of AI- and physics-based methods for oxide TFTs, spanning from atomistic material analysis to circuit-level modeling. We discuss atomistic simulations such as density functional theory (DFT) and molecular dynamics (MD) for defect energetics and carrier behavior, technology computer-aided design (TCAD) for device-level electrothermal analysis, and compact models for circuit simulation. The role of artificial intelligence in surrogate modeling, parameter extraction, optimization of materials, device structures, and processes is discussed. By bridging simulation methods across multiple scales, this review provides insights into accelerating the design, and analysis of oxide TFTs.
氧化物薄膜晶体管(TFTs)由于其高迁移率、光学透明性和低温可加工性而成为现代显示技术中的关键部件。随着设计空间在材料系统、器件架构和操作条件上的扩展,对支持可靠和高效建模的计算方法的需求不断增长。本文综述了基于人工智能和物理的氧化tft方法的全面概述,从原子材料分析到电路级建模。我们讨论了原子模拟,如密度泛函理论(DFT)和分子动力学(MD)的缺陷能量学和载流子行为,技术计算机辅助设计(TCAD)的器件级电热分析,和紧凑模型的电路仿真。讨论了人工智能在替代建模、参数提取、材料、器件结构和工艺优化中的作用。通过跨多个尺度的桥接模拟方法,本综述为加速氧化物tft的设计和分析提供了见解。
{"title":"AI and Physics-Based Computational Methods for Oxide Thin-Film Transistors: A Review","authors":"Eunkyung Koh;Hyeon-Deuk Kim;Rokyeon Kim;Byoungtaek Son;Sang-Hoon Lee;Gyehyun Park;Eui-Cheol Shin;Yongsoo Lee;Insoo Wang","doi":"10.1109/OJID.2025.3610414","DOIUrl":"https://doi.org/10.1109/OJID.2025.3610414","url":null,"abstract":"Oxide thin-film transistors (TFTs) are critical components in modern display technologies due to their high mobility, optical transparency, and low-temperature processability. As the design space expands across material systems, device architectures, and operating conditions, there is a growing demand for computational methods that support reliable and efficient modeling. This review presents a comprehensive overview of AI- and physics-based methods for oxide TFTs, spanning from atomistic material analysis to circuit-level modeling. We discuss atomistic simulations such as density functional theory (DFT) and molecular dynamics (MD) for defect energetics and carrier behavior, technology computer-aided design (TCAD) for device-level electrothermal analysis, and compact models for circuit simulation. The role of artificial intelligence in surrogate modeling, parameter extraction, optimization of materials, device structures, and processes is discussed. By bridging simulation methods across multiple scales, this review provides insights into accelerating the design, and analysis of oxide TFTs.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"89-95"},"PeriodicalIF":0.0,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11165197","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BPGI: A Brain-Perception Guided Interactive Network for Stereoscopic Omnidirectional Image Quality Assessment BPGI:用于立体全方位图像质量评估的脑感知引导交互网络
Pub Date : 2025-09-16 DOI: 10.1109/OJID.2025.3610449
Yun Liu;Sifan Li;Zihan Liu;Haiyuan Wang;Daoxin Fan
Stereoscopic omnidirectional image quality assessment is a combination task of stereoscopic image quality assessment and omnidirectional image quality assessment, which is more challenging than traditional three-dimensional images. Previous works fail to present a satisfying performance due to neglecting human brain perception mechanism. To solve the above problem, we proposed an effective brain-perception guided interactive network for stereoscopic omnidirectional image quality assessment (BPGI), which is built following three perception steps: visual information processing, feature fusion cognition, and quality evaluation. Considering the stereoscopic perception characteristics, binocular and monocular visual features are both extracted. Following human complex cognition mechanism, a Bi-LSTM module is introduced to dig the deeply inherent relationship between monocular and binocular visual feature and improve the feature representation ability of the proposed model. Then a visual feature fusion module is built to obtain effective interactive fusion for quality prediction. Experimental results prove that the proposed model outperforms many state-of-the-art models, and can be effectively applied to predict the quality of stereoscopic omnidirectional images.
立体全方位图像质量评价是立体图像质量评价与全方位图像质量评价相结合的任务,比传统的三维图像更具挑战性。由于忽视了人脑的感知机制,以往的研究未能呈现出令人满意的效果。为了解决上述问题,我们提出了一种有效的脑感知引导的立体全方位图像质量评价交互网络,该网络由视觉信息处理、特征融合认知和质量评价三个感知步骤组成。考虑立体感知特征,提取双眼和单眼视觉特征。根据人类复杂的认知机制,引入Bi-LSTM模块,深入挖掘单眼和双眼视觉特征之间的内在联系,提高模型的特征表征能力。然后构建视觉特征融合模块,实现有效的交互式融合,实现质量预测。实验结果表明,该模型优于许多现有的模型,可以有效地用于立体全向图像的质量预测。
{"title":"BPGI: A Brain-Perception Guided Interactive Network for Stereoscopic Omnidirectional Image Quality Assessment","authors":"Yun Liu;Sifan Li;Zihan Liu;Haiyuan Wang;Daoxin Fan","doi":"10.1109/OJID.2025.3610449","DOIUrl":"https://doi.org/10.1109/OJID.2025.3610449","url":null,"abstract":"Stereoscopic omnidirectional image quality assessment is a combination task of stereoscopic image quality assessment and omnidirectional image quality assessment, which is more challenging than traditional three-dimensional images. Previous works fail to present a satisfying performance due to neglecting human brain perception mechanism. To solve the above problem, we proposed an effective brain-perception guided interactive network for stereoscopic omnidirectional image quality assessment (BPGI), which is built following three perception steps: visual information processing, feature fusion cognition, and quality evaluation. Considering the stereoscopic perception characteristics, binocular and monocular visual features are both extracted. Following human complex cognition mechanism, a Bi-LSTM module is introduced to dig the deeply inherent relationship between monocular and binocular visual feature and improve the feature representation ability of the proposed model. Then a visual feature fusion module is built to obtain effective interactive fusion for quality prediction. Experimental results prove that the proposed model outperforms many state-of-the-art models, and can be effectively applied to predict the quality of stereoscopic omnidirectional images.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"81-88"},"PeriodicalIF":0.0,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11165215","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Immersive Touch-Interactive Display Enabled by Virtual Button 身临其境的触摸互动显示,虚拟按钮启用
Pub Date : 2025-09-05 DOI: 10.1109/OJID.2025.3606863
Shijie Xing;Shiqi Luo;Kai Wang
Virtual buttons on touch screens have assumed an increasingly prominent role within the field of human-computer interaction, functioning as a critical interface modality between users and digital systems. Following their emergence as a widely adopted alternative to traditional input devices such as the mouse and keyboard, virtual buttons have been extensively implemented across a broad spectrum of sectors, including manufacturing, healthcare, finance, and education. Their key advantages—including compact physical design, minimal hardware requirements, ease of system integration, and a relatively low user learning threshold—contribute to their suitability for both consumer-oriented products and domain-specific industrial applications. This paper presents a comprehensive review of numerous influential studies published over recent decades, offering a structured synthesis of the technological progression and functional evolution of virtual button systems. It provides a comparative analysis of various implementations grounded in distinct touch-sensing technologies, such as capacitive, resistive, piezoelectric, and infrared mechanisms, and evaluates their applicability in contexts including consumer electronics and in-vehicle interfaces. The discussion concludes with an examination of emerging trends and prospective development trajectories aimed at addressing the growing demand for intelligent, adaptive virtual button solutions.
触摸屏上的虚拟按钮作为用户与数字系统之间的关键接口方式,在人机交互领域中发挥着越来越突出的作用。随着虚拟按键作为鼠标和键盘等传统输入设备的替代品被广泛采用,虚拟按键在制造业、医疗保健、金融和教育等广泛领域得到了广泛的应用。它们的主要优势—包括紧凑的物理设计、最小的硬件需求、易于系统集成和相对较低的用户学习阈值—有助于它们适合面向消费者的产品和特定于领域的工业应用程序。本文对近几十年来发表的许多有影响力的研究进行了全面的回顾,对虚拟按钮系统的技术进步和功能演变进行了结构化的综合。它提供了基于不同触摸传感技术的各种实现的比较分析,如电容式、电阻式、压电式和红外机制,并评估了它们在消费电子和车载接口等环境中的适用性。讨论的最后是对新兴趋势和未来发展轨迹的考察,旨在解决对智能、自适应虚拟按钮解决方案日益增长的需求。
{"title":"Immersive Touch-Interactive Display Enabled by Virtual Button","authors":"Shijie Xing;Shiqi Luo;Kai Wang","doi":"10.1109/OJID.2025.3606863","DOIUrl":"https://doi.org/10.1109/OJID.2025.3606863","url":null,"abstract":"Virtual buttons on touch screens have assumed an increasingly prominent role within the field of human-computer interaction, functioning as a critical interface modality between users and digital systems. Following their emergence as a widely adopted alternative to traditional input devices such as the mouse and keyboard, virtual buttons have been extensively implemented across a broad spectrum of sectors, including manufacturing, healthcare, finance, and education. Their key advantages—including compact physical design, minimal hardware requirements, ease of system integration, and a relatively low user learning threshold—contribute to their suitability for both consumer-oriented products and domain-specific industrial applications. This paper presents a comprehensive review of numerous influential studies published over recent decades, offering a structured synthesis of the technological progression and functional evolution of virtual button systems. It provides a comparative analysis of various implementations grounded in distinct touch-sensing technologies, such as capacitive, resistive, piezoelectric, and infrared mechanisms, and evaluates their applicability in contexts including consumer electronics and in-vehicle interfaces. The discussion concludes with an examination of emerging trends and prospective development trajectories aimed at addressing the growing demand for intelligent, adaptive virtual button solutions.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"71-80"},"PeriodicalIF":0.0,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11152562","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Utilization of SLAM for Obstacle Detection in Commodity Mobile Devices SLAM在商用移动设备障碍物检测中的应用研究
Pub Date : 2025-07-25 DOI: 10.1109/OJID.2025.3592064
Dionysios Koulouris;Orestis Zaras;Andreas Menychtas;Panayiotis Tsanakas;Ilias Maglogiannis
Immersive Technologies are an increasingly prevalent field, employed by a plethora of portable and stationary solutions. Ongoing research continues to unlock new possibilities for their use, improving Human-Machine Interaction. In the health sector, such technologies have the potential to optimize the living of individuals with special needs, like the visually impaired. SLAM is a technique used in robotics and computer vision for achieving environmental understanding by building a vector map of the frontal space. It is the fundamental for developing applications that achieve immersive experiences to the user, such as Augmented Reality applications. The ability of these applications to understand their surroundings and the diverse methodologies studied over the years has led to the proposal of efficient techniques for extracting depth data from a camera feed, without the need of a depth sensor. This work introduces a system that exploits the Depth Estimation capabilities of SLAM to detect obstacles and cliffs in the user’s frontal environment. A mobile application was developed, that retrieves the camera feed and generates scene Depth-Maps, before importing them to a newly designed algorithm for obstacle and cliff identification. Audio and haptic feedback is used for warnings and usability notifications. The system was fine-tuned and tested in both indoor and outdoor spaces and quantitative and qualitative results were captured. The goal of the study is to present the development of a tool that can be executed on commodity mobile devices in real-time and it can enhance safety facilitating movement in both indoor and outdoor environments.
沉浸式技术是一个日益流行的领域,被大量便携式和固定式解决方案所采用。正在进行的研究继续为它们的使用打开新的可能性,改善人机交互。在卫生部门,这类技术有可能使有特殊需要的个人,如视障人士的生活达到最佳状态。SLAM是一种用于机器人和计算机视觉的技术,通过构建正面空间的矢量地图来实现环境理解。它是开发为用户实现沉浸式体验的应用程序(如增强现实应用程序)的基础。这些应用程序了解周围环境的能力,以及多年来研究的各种方法,导致了从相机馈送中提取深度数据的有效技术的提出,而不需要深度传感器。这项工作介绍了一个利用SLAM的深度估计能力来检测用户正面环境中的障碍物和悬崖的系统。开发了一个移动应用程序,它可以检索相机馈送并生成场景深度图,然后将它们导入新设计的障碍物和悬崖识别算法中。音频和触觉反馈用于警告和可用性通知。该系统在室内和室外空间进行了微调和测试,并获得了定量和定性的结果。本研究的目标是开发一种可以在商品移动设备上实时执行的工具,它可以提高室内和室外环境中运动的安全性。
{"title":"On the Utilization of SLAM for Obstacle Detection in Commodity Mobile Devices","authors":"Dionysios Koulouris;Orestis Zaras;Andreas Menychtas;Panayiotis Tsanakas;Ilias Maglogiannis","doi":"10.1109/OJID.2025.3592064","DOIUrl":"https://doi.org/10.1109/OJID.2025.3592064","url":null,"abstract":"Immersive Technologies are an increasingly prevalent field, employed by a plethora of portable and stationary solutions. Ongoing research continues to unlock new possibilities for their use, improving Human-Machine Interaction. In the health sector, such technologies have the potential to optimize the living of individuals with special needs, like the visually impaired. SLAM is a technique used in robotics and computer vision for achieving environmental understanding by building a vector map of the frontal space. It is the fundamental for developing applications that achieve immersive experiences to the user, such as Augmented Reality applications. The ability of these applications to understand their surroundings and the diverse methodologies studied over the years has led to the proposal of efficient techniques for extracting depth data from a camera feed, without the need of a depth sensor. This work introduces a system that exploits the Depth Estimation capabilities of SLAM to detect obstacles and cliffs in the user’s frontal environment. A mobile application was developed, that retrieves the camera feed and generates scene Depth-Maps, before importing them to a newly designed algorithm for obstacle and cliff identification. Audio and haptic feedback is used for warnings and usability notifications. The system was fine-tuned and tested in both indoor and outdoor spaces and quantitative and qualitative results were captured. The goal of the study is to present the development of a tool that can be executed on commodity mobile devices in real-time and it can enhance safety facilitating movement in both indoor and outdoor environments.","PeriodicalId":100634,"journal":{"name":"IEEE Open Journal on Immersive Displays","volume":"2 ","pages":"42-54"},"PeriodicalIF":0.0,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11096572","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144868345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Open Journal on Immersive Displays
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1