首页 > 最新文献

2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)最新文献

英文 中文
Automatic Detection of Lumbar Disc Herniation Using YOLOv7 应用YOLOv7自动检测腰椎间盘突出症
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10226718
Ardha Ardea Prisilla, Yori Pusparani, Wen-Thong Chang, B. Liau, Yih-Kuen Jan, Peter Ardhianto, Chih-Yang Lin, Chi-Wen Lung
The detection of lumbar disc herniation (LDH) through magnetic resonance imaging (MRI) poses a challenge due to the various shapes, sizes, angles, and regions associated with bulges, protrusions, extrusions, and sequestrations. One potential solution is using deep learning methods to identify lumbar abnormalities in MRI images automatically. The YOU ONLY LOOK ONCE (YOLO) model series has gained popularity for training deep learning algorithms for real-time biomedical image detection. This study aims to assess the performance of the latest YOLOv7 in detecting LDH across different regions of the lumbar intervertebral disc. The analysis revealed that YOLOv7 exhibits a poor performance and low detection rate of LDH across the L1-L2, L2-L3, L3-L4, L4-L5, and L5-S1 regions.
通过磁共振成像(MRI)检测腰椎间盘突出症(LDH)提出了一个挑战,因为与凸起、突出、挤压和隔离相关的各种形状、大小、角度和区域。一个潜在的解决方案是使用深度学习方法自动识别MRI图像中的腰椎异常。YOU ONLY LOOK ONCE (YOLO)模型系列在训练用于实时生物医学图像检测的深度学习算法方面获得了广泛的应用。本研究旨在评估最新的YOLOv7在检测腰椎间盘不同区域LDH方面的性能。分析表明,YOLOv7在L1-L2、L2-L3、L3-L4、L4-L5和L5-S1区域的LDH检测性能较差,检出率较低。
{"title":"Automatic Detection of Lumbar Disc Herniation Using YOLOv7","authors":"Ardha Ardea Prisilla, Yori Pusparani, Wen-Thong Chang, B. Liau, Yih-Kuen Jan, Peter Ardhianto, Chih-Yang Lin, Chi-Wen Lung","doi":"10.1109/ICCE-Taiwan58799.2023.10226718","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226718","url":null,"abstract":"The detection of lumbar disc herniation (LDH) through magnetic resonance imaging (MRI) poses a challenge due to the various shapes, sizes, angles, and regions associated with bulges, protrusions, extrusions, and sequestrations. One potential solution is using deep learning methods to identify lumbar abnormalities in MRI images automatically. The YOU ONLY LOOK ONCE (YOLO) model series has gained popularity for training deep learning algorithms for real-time biomedical image detection. This study aims to assess the performance of the latest YOLOv7 in detecting LDH across different regions of the lumbar intervertebral disc. The analysis revealed that YOLOv7 exhibits a poor performance and low detection rate of LDH across the L1-L2, L2-L3, L3-L4, L4-L5, and L5-S1 regions.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133075244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Mode AI Accelerator Architecture for Thermal-Aware 3D Stacked Deep Neural Network Design 热感知3D堆叠深度神经网络设计的多模AI加速器架构
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10227033
Hari Chandhana Varma M, Advaidh Swaminathan, Shu-Yen Lin
Deep Neural Networks (DNN) find its prominent presence in the AI world. The properties of its algorithms are very well exploited to make its computations power faster and more efficient. One such adaptation requires reduction in the bit width of the operations for DNN. This paper deals about a DNN architecture design with reconfigurable bitwidth accelerator without affecting the accuracy. This proposed architecture is a modified version of Bit Fusion architecture dealing with dynamic bit-level decomposition for accelerating complex DNN computations. The design make computations with less power consumption, and it is more suitable for the trade-offs among the latency, power, and temperature. The micro architecture design of the modified BitFusion using RTL simulations is carried out to evaluate the functionality.
深度神经网络(DNN)在人工智能领域占有重要地位。其算法的特性被很好地利用,使其计算能力更快、更高效。一种这样的适应需要减少深度神经网络操作的位宽。本文讨论了一种不影响精度的具有可重构位宽加速器的深度神经网络结构设计。该架构是比特融合架构的改进版本,用于处理动态比特级分解,以加速复杂深度神经网络的计算。该设计使计算功耗更低,更适合于延迟、功耗和温度之间的权衡。利用RTL仿真对改进后的BitFusion进行了微架构设计,以评估其功能。
{"title":"Multi-Mode AI Accelerator Architecture for Thermal-Aware 3D Stacked Deep Neural Network Design","authors":"Hari Chandhana Varma M, Advaidh Swaminathan, Shu-Yen Lin","doi":"10.1109/ICCE-Taiwan58799.2023.10227033","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10227033","url":null,"abstract":"Deep Neural Networks (DNN) find its prominent presence in the AI world. The properties of its algorithms are very well exploited to make its computations power faster and more efficient. One such adaptation requires reduction in the bit width of the operations for DNN. This paper deals about a DNN architecture design with reconfigurable bitwidth accelerator without affecting the accuracy. This proposed architecture is a modified version of Bit Fusion architecture dealing with dynamic bit-level decomposition for accelerating complex DNN computations. The design make computations with less power consumption, and it is more suitable for the trade-offs among the latency, power, and temperature. The micro architecture design of the modified BitFusion using RTL simulations is carried out to evaluate the functionality.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133804143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent Lighting Control under AI Architecture AI架构下的智能照明控制
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10226811
S. Chiang, C. Tsuei
This study utilized an optimized YOLOv3 training model to improve the accuracy of image recognition technology to over 90% through the data results obtained from the grayscale image and personnel recognition technology of the surveillance system at the experimental field - Baoshan Library. In addition, the system's control screen, combined with the verification field's system architecture, achieved the function of detecting and lighting fixtures within 1.4 to 1.6 seconds when personnel appeared within the detection system range, and the system could turn off the lighting fixtures within 1.4 to 1.9 seconds when personnel disappeared from the detection system.
本研究利用优化的YOLOv3训练模型,通过对宝山图书馆试验场监控系统灰度图像和人员识别技术的数据结果,将图像识别技术的准确率提高到90%以上。此外,系统的控制屏结合验证场的系统架构,实现了当人员出现在检测系统范围内时,在1.4 ~ 1.6秒内检测并点亮灯具的功能,当人员从检测系统中消失时,在1.4 ~ 1.9秒内关闭灯具的功能。
{"title":"Intelligent Lighting Control under AI Architecture","authors":"S. Chiang, C. Tsuei","doi":"10.1109/ICCE-Taiwan58799.2023.10226811","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226811","url":null,"abstract":"This study utilized an optimized YOLOv3 training model to improve the accuracy of image recognition technology to over 90% through the data results obtained from the grayscale image and personnel recognition technology of the surveillance system at the experimental field - Baoshan Library. In addition, the system's control screen, combined with the verification field's system architecture, achieved the function of detecting and lighting fixtures within 1.4 to 1.6 seconds when personnel appeared within the detection system range, and the system could turn off the lighting fixtures within 1.4 to 1.9 seconds when personnel disappeared from the detection system.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"739 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124398267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ICCE-Taiwan 2023 Conference Proceedings 台湾icce 2023会议论文集
Pub Date : 2023-07-17 DOI: 10.1109/icce-taiwan58799.2023.10226807
{"title":"ICCE-Taiwan 2023 Conference Proceedings","authors":"","doi":"10.1109/icce-taiwan58799.2023.10226807","DOIUrl":"https://doi.org/10.1109/icce-taiwan58799.2023.10226807","url":null,"abstract":"","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125555996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
QoE Assessment of Audiovisual Streaming over a Full-Duplex Wireless LAN with Interference Traffic 具有干扰流量的全双工无线局域网视听流的QoE评估
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10226707
Toshiro Nunome, Daiki Deguchi
This paper evaluates the QoE of video and audio transmission over a full-duplex wireless LAN with interference traffic through a computer simulation and a subjective experiment. We employ a simulation environment with a pair of audiovisual transmission and reception terminals and a pair of interference traffic transmission and reception ones. We investigate the effect of the transmission rate of interference traffic and communication distance in a wireless channel on the output quality of the video and audio stream at the reception terminal. We perform a subjective experiment with the output timing of video and audio obtained by the simulation.
本文通过计算机仿真和主观实验,对具有干扰业务的全双工无线局域网中视频和音频传输的QoE进行了评价。我们采用了一个模拟环境,包括一对视听收发终端和一对干扰交通收发终端。我们研究了无线信道中干扰流量的传输速率和通信距离对接收端视频和音频流输出质量的影响。对仿真得到的视频和音频输出时序进行了主观实验。
{"title":"QoE Assessment of Audiovisual Streaming over a Full-Duplex Wireless LAN with Interference Traffic","authors":"Toshiro Nunome, Daiki Deguchi","doi":"10.1109/ICCE-Taiwan58799.2023.10226707","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226707","url":null,"abstract":"This paper evaluates the QoE of video and audio transmission over a full-duplex wireless LAN with interference traffic through a computer simulation and a subjective experiment. We employ a simulation environment with a pair of audiovisual transmission and reception terminals and a pair of interference traffic transmission and reception ones. We investigate the effect of the transmission rate of interference traffic and communication distance in a wireless channel on the output quality of the video and audio stream at the reception terminal. We perform a subjective experiment with the output timing of video and audio obtained by the simulation.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125593709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Binaural Audio Generation with Data Augmentation from 360° Videos 双耳音频生成与数据增强从360°视频
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10227056
Masaki Yoshida, Ren Togo, Takahiro Ogawa, M. Haseyama
We present a novel binaural audio generation method with data augmentation from 360° videos. Visually informed binaural audio generation requires ground truth pairs of video and binaural audio. However, collecting diverse ground truth requires a lot of effort, and low data diversity reduces the generalization performance of the model. Our method introduces the data generation from 360° videos to solve the low diversity of ground truth. Experimental results show that our method improves the generalization performance of the binaural audio generation model and that 360° video is effective in generating video and pseudo-binaural audio pairs.
提出了一种基于360°视频数据增强的双耳音频生成方法。视觉上的双耳音频生成需要视频和双耳音频的真实对。然而,收集不同的地面真值需要大量的努力,低数据多样性降低了模型的泛化性能。该方法引入了360°视频数据生成,解决了地面真实度多样性低的问题。实验结果表明,该方法提高了双耳音频生成模型的泛化性能,360°视频可以有效地生成视频和伪双耳音频对。
{"title":"Binaural Audio Generation with Data Augmentation from 360° Videos","authors":"Masaki Yoshida, Ren Togo, Takahiro Ogawa, M. Haseyama","doi":"10.1109/ICCE-Taiwan58799.2023.10227056","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10227056","url":null,"abstract":"We present a novel binaural audio generation method with data augmentation from 360° videos. Visually informed binaural audio generation requires ground truth pairs of video and binaural audio. However, collecting diverse ground truth requires a lot of effort, and low data diversity reduces the generalization performance of the model. Our method introduces the data generation from 360° videos to solve the low diversity of ground truth. Experimental results show that our method improves the generalization performance of the binaural audio generation model and that 360° video is effective in generating video and pseudo-binaural audio pairs.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124319492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Survival Game Assisted Tactical Helmets 生存游戏辅助战术头盔
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10226765
Zhe-Xuan Su, Zeng Wei-chang, Pan Guo-Siang, Sun Chi-Chia
The modern idea of gunplay has its origins in games and TV shows, and survival games have emerged as a new form of competitive sport. To assist users in survival games, a system has been developed to help them gather enemy intelligence and gain an advantage for their team during gameplay. The system is designed to detect the enemy's position and transmit the information to the user, enabling them to respond quickly to the intelligence and significantly reduce the risk of being ambushed from behind.YOLOv4, a powerful object detection model, has found wide application in various industries, such as manufacturing and facial recognition. It is used to detect product defects and ensure the quality of products, and for comparing facial features to facilitate tasks such as face recognition and criminal tracking. In the pandemic, YOLOv4 has been employed for mask detection and social distancing monitoring in manufacturing facilities. Additionally, it has been customized for traffic management in the "Smart City Traffic Flow Solution" developed by Academia Sinica and Yilong. The modified YOLOv4 system uses real-time vehicle detection at intersections to regulate traffic flow and enforce speed limits.
现代的枪战理念起源于游戏和电视节目,而生存游戏已经成为一种新的竞技运动形式。为了帮助用户在生存游戏中获得帮助,他们开发了一个系统来帮助他们收集敌人的情报,并在游戏过程中为他们的团队获得优势。该系统被设计用于探测敌人的位置并将信息传送给用户,使他们能够迅速对情报作出反应,并显著降低被从后面伏击的风险。YOLOv4是一个强大的目标检测模型,在制造业和面部识别等各个行业得到了广泛的应用。它用于检测产品缺陷,确保产品质量,并用于比较面部特征,以方便人脸识别和犯罪跟踪等任务。在大流行期间,YOLOv4已被用于制造设施的口罩检测和社交距离监测。此外,它还被中研院与亿龙共同开发的“智慧城市交通流解决方案”定制用于交通管理。改进后的YOLOv4系统在十字路口使用实时车辆检测来调节交通流量并执行速度限制。
{"title":"Survival Game Assisted Tactical Helmets","authors":"Zhe-Xuan Su, Zeng Wei-chang, Pan Guo-Siang, Sun Chi-Chia","doi":"10.1109/ICCE-Taiwan58799.2023.10226765","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226765","url":null,"abstract":"The modern idea of gunplay has its origins in games and TV shows, and survival games have emerged as a new form of competitive sport. To assist users in survival games, a system has been developed to help them gather enemy intelligence and gain an advantage for their team during gameplay. The system is designed to detect the enemy's position and transmit the information to the user, enabling them to respond quickly to the intelligence and significantly reduce the risk of being ambushed from behind.YOLOv4, a powerful object detection model, has found wide application in various industries, such as manufacturing and facial recognition. It is used to detect product defects and ensure the quality of products, and for comparing facial features to facilitate tasks such as face recognition and criminal tracking. In the pandemic, YOLOv4 has been employed for mask detection and social distancing monitoring in manufacturing facilities. Additionally, it has been customized for traffic management in the \"Smart City Traffic Flow Solution\" developed by Academia Sinica and Yilong. The modified YOLOv4 system uses real-time vehicle detection at intersections to regulate traffic flow and enforce speed limits.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134241350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D object detection with low resolution depth map 低分辨率深度图的3D目标检测
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10226909
Wei-Ting Huang, Xiu-Zhi Chen, Yen-Lin Chen, Chieh-Sheng Huang
Obtaining high-resolution depth map from radar and lidar for training depth estimation network is a common approach in recent years, although the expensive price and high computational complexity cause it hard to be apply in real-world applications. In this research, we propose a novel concept that only requires low resolution depth map for training, but able to reach similar performance comparing to high-resolution depth map-based approaches. Refers to the experimental results, which evaluated on KITTI dataset, our proposed concept decreases 7% and 2% on the mAP of car and pedestrian comparing to the original approach in CaDDN, although it achieves similar performance on visualization result.
从雷达和激光雷达获取高分辨率深度图用于训练深度估计网络是近年来常用的方法,但由于其昂贵的价格和较高的计算复杂度,难以在实际应用中得到应用。在本研究中,我们提出了一个新的概念,只需要低分辨率深度图进行训练,但与基于高分辨率深度图的方法相比,能够达到相似的性能。根据在KITTI数据集上评估的实验结果,我们提出的概念在CaDDN上与原始方法相比,在汽车和行人的地图上分别下降了7%和2%,尽管在可视化结果上取得了相似的性能。
{"title":"3D object detection with low resolution depth map","authors":"Wei-Ting Huang, Xiu-Zhi Chen, Yen-Lin Chen, Chieh-Sheng Huang","doi":"10.1109/ICCE-Taiwan58799.2023.10226909","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226909","url":null,"abstract":"Obtaining high-resolution depth map from radar and lidar for training depth estimation network is a common approach in recent years, although the expensive price and high computational complexity cause it hard to be apply in real-world applications. In this research, we propose a novel concept that only requires low resolution depth map for training, but able to reach similar performance comparing to high-resolution depth map-based approaches. Refers to the experimental results, which evaluated on KITTI dataset, our proposed concept decreases 7% and 2% on the mAP of car and pedestrian comparing to the original approach in CaDDN, although it achieves similar performance on visualization result.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134515878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Scale Selection Network for Crowd Counting 人群计数的自适应尺度选择网络
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10227011
Ting-Hsu Lai, Tsung-Jung Liu, Kuan-Hsien Liu
Crowd counting is a computer vision task that focuses on accurately estimating the number of people present in a given scene. In the past few years, convolutional neural network-based deep learning techniques have achieved remarkable success in many computer vision tasks, including crowd counting. In the field of crowd counting, large-scale changes have always been a great challenge. To resolve this problem, previous work used multiple branches to obtain information at different scales and combined it. However, purely combining multi-branch features cannot effectively utilize multi-scale information. In this work, we modify the previous multi-branch architecture, which can reasonably select the appropriate scale information. Furthermore, we test our model on the ShanghaiTech dataset and demonstrate the competitive performance of our method.
人群计数是一项计算机视觉任务,重点是准确估计给定场景中出现的人数。在过去的几年里,基于卷积神经网络的深度学习技术在许多计算机视觉任务中取得了显著的成功,包括人群计数。在人群统计领域,大规模的变化一直是一个巨大的挑战。为了解决这个问题,以前的工作使用多个分支来获取不同尺度的信息并将其组合起来。然而,单纯的多分支特征组合并不能有效地利用多尺度信息。在这项工作中,我们修改了以前的多分支架构,可以合理地选择合适的规模信息。此外,我们在上海科技的数据集上测试了我们的模型,并证明了我们的方法的竞争性能。
{"title":"Adaptive Scale Selection Network for Crowd Counting","authors":"Ting-Hsu Lai, Tsung-Jung Liu, Kuan-Hsien Liu","doi":"10.1109/ICCE-Taiwan58799.2023.10227011","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10227011","url":null,"abstract":"Crowd counting is a computer vision task that focuses on accurately estimating the number of people present in a given scene. In the past few years, convolutional neural network-based deep learning techniques have achieved remarkable success in many computer vision tasks, including crowd counting. In the field of crowd counting, large-scale changes have always been a great challenge. To resolve this problem, previous work used multiple branches to obtain information at different scales and combined it. However, purely combining multi-branch features cannot effectively utilize multi-scale information. In this work, we modify the previous multi-branch architecture, which can reasonably select the appropriate scale information. Furthermore, we test our model on the ShanghaiTech dataset and demonstrate the competitive performance of our method.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"228 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132540040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual Traits-Based Recommendation System for Proactive Retailing in Physical Store Environment 基于视觉特征的实体店环境下主动零售推荐系统
Pub Date : 2023-07-17 DOI: 10.1109/ICCE-Taiwan58799.2023.10226980
X. Kh’ng, Boon Yaik Ooi, Sheng Kang Teoh, Boon Sheng Ooi, Yen-Lin Chen
A comprehensive visual traits-based recommendation system is designed for proactive retailing in a physical store environment. The proposed system utilizes computer vision algorithms to analyze various visual traits of customers, including facial features, clothing and accessories, to provide targeted product recommendations. The system does not require customers to provide any personal information, making it a less intrusive and more hassle-free approach compared to the conventional membership approach. By providing tailored product suggestions, the system offers retailers the opportunity to enhance their customers’ shopping experience and increase sales. This paper provides a detailed description of the system's design and setup.
针对实体店环境下的主动零售,设计了一个基于视觉特征的综合推荐系统。该系统利用计算机视觉算法分析顾客的各种视觉特征,包括面部特征、服装和配饰,从而提供有针对性的产品推荐。该系统不需要客户提供任何个人信息,与传统的会员方式相比,它的侵入性更小,麻烦更少。通过提供量身定制的产品建议,该系统为零售商提供了提高客户购物体验和增加销售额的机会。本文详细介绍了该系统的设计和设置。
{"title":"Visual Traits-Based Recommendation System for Proactive Retailing in Physical Store Environment","authors":"X. Kh’ng, Boon Yaik Ooi, Sheng Kang Teoh, Boon Sheng Ooi, Yen-Lin Chen","doi":"10.1109/ICCE-Taiwan58799.2023.10226980","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226980","url":null,"abstract":"A comprehensive visual traits-based recommendation system is designed for proactive retailing in a physical store environment. The proposed system utilizes computer vision algorithms to analyze various visual traits of customers, including facial features, clothing and accessories, to provide targeted product recommendations. The system does not require customers to provide any personal information, making it a less intrusive and more hassle-free approach compared to the conventional membership approach. By providing tailored product suggestions, the system offers retailers the opportunity to enhance their customers’ shopping experience and increase sales. This paper provides a detailed description of the system's design and setup.","PeriodicalId":112903,"journal":{"name":"2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)","volume":"35 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131545930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1