首页 > 最新文献

2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)最新文献

英文 中文
Continuous Supervision and Diagnostics System for Legacy Vehicles Integrated to Ambient Intelligence 集成环境智能的传统车辆持续监控和诊断系统
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937121
Walmir A. Silva, M. N. Rylo, Guido Soprano Machado, R. L. Medeiros, V. Lucena
Maintenance aims to keep the vehicle in good working order and avoid unpleasant surprises, such as mechanical breakdown and component breakage due to damaged parts, increasing losses. Currently, to detect these problems, the On-Board Diagnostic II (OBD-II) is used to diagnose issues in the Electronic Control Unit (ECU). Data are obtained from it by employing adapters, and such data are used in various applications. Ambient intelligence (AmI) is an environment with several devices connected in a wired or wireless network to obtain data from users without knowing it to aid and automate routine tasks. In this way, we consider that can access the information a home AmI can offer through these devices. In this context, would it be possible to connect the car to the AmI to provide information that will help us avoid significant vehicle problems? This paper presents a system capable of sending vehicle information to the user's smartphone via the messaging app alerting that the vehicle is above average temperature.
维修的目的是保持车辆良好的工作状态,避免意外的不愉快,如机械故障和部件损坏,增加损失。目前,为了检测这些问题,车载诊断II (OBD-II)被用于诊断电子控制单元(ECU)的问题。通过使用适配器从它获得数据,并且这些数据用于各种应用程序。环境智能(AmI)是一种由有线或无线网络连接的多个设备组成的环境,可以在不知情的情况下从用户那里获取数据,以辅助和自动执行日常任务。通过这种方式,我们认为可以通过这些设备访问家庭AmI可以提供的信息。在这种情况下,是否有可能将汽车连接到AmI,以提供有助于我们避免重大车辆问题的信息?本文介绍了一个系统,该系统能够通过消息应用程序向用户的智能手机发送车辆信息,提醒车辆高于平均温度。
{"title":"Continuous Supervision and Diagnostics System for Legacy Vehicles Integrated to Ambient Intelligence","authors":"Walmir A. Silva, M. N. Rylo, Guido Soprano Machado, R. L. Medeiros, V. Lucena","doi":"10.1109/ICCE-Berlin56473.2022.9937121","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937121","url":null,"abstract":"Maintenance aims to keep the vehicle in good working order and avoid unpleasant surprises, such as mechanical breakdown and component breakage due to damaged parts, increasing losses. Currently, to detect these problems, the On-Board Diagnostic II (OBD-II) is used to diagnose issues in the Electronic Control Unit (ECU). Data are obtained from it by employing adapters, and such data are used in various applications. Ambient intelligence (AmI) is an environment with several devices connected in a wired or wireless network to obtain data from users without knowing it to aid and automate routine tasks. In this way, we consider that can access the information a home AmI can offer through these devices. In this context, would it be possible to connect the car to the AmI to provide information that will help us avoid significant vehicle problems? This paper presents a system capable of sending vehicle information to the user's smartphone via the messaging app alerting that the vehicle is above average temperature.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115051860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing Physical Unclonable Functions Implemented on Commercial Off-the-Shelf NAND Flash Memories Using Programming Disturbances 利用编程干扰测试商用现成NAND闪存上实现的物理不可克隆功能
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.10021310
N. Anagnostopoulos, Yufan Fan, Muhammad Umair Saleem, Nico Mexis, Emiliia Geloczi, Felix Klement, Florian Frank, André Schaller, T. Arul, S. Katzenbeisser
In this work, we present a Physical Unclonable Function (PUF) implemented on a Commercial Off-The-Shelf (COTS) NAND Flash memory module using programming disturbances, and examine the robustness of its responses to environmental variations. In particular, we test a removable Flash memory module serving as a PUF, under nominal conditions, as well as under temperature and voltage variations. To determine its resilience to environmental variations, we utilise well-known PUF metrics, such as the Hamming weight and the intra-device Hamming distance. Our results prove that, in general, the tested Samsung K9F1G08U0E NAND Flash memory can be used to realise a lightweight, scalable, and flexible hardware security primitive, namely a PUF, that can be utilised in the context of smart homes, smart vehicles, and other smart applications, as well as to protect commercial devices and networks in general. However, voltage variations seem to pose a substantial threat to the adoption of this PUF in practice. This threat may be addressed by small-scale design improvements that should be implemented and tested in practice as part of future works.
在这项工作中,我们提出了一个使用编程干扰在商用现货(COTS) NAND闪存模块上实现的物理不可克隆函数(PUF),并检查了其对环境变化响应的鲁棒性。特别是,我们测试了一个可移动的闪存模块作为PUF,在标称条件下,以及在温度和电压变化。为了确定其对环境变化的适应能力,我们利用了众所周知的PUF指标,如汉明权重和设备内汉明距离。我们的研究结果证明,总的来说,经过测试的三星K9F1G08U0E NAND闪存可用于实现轻量级,可扩展和灵活的硬件安全原语,即PUF,可用于智能家居,智能车辆和其他智能应用程序,以及保护商业设备和网络。然而,电压变化似乎对这种PUF在实践中的采用构成了实质性的威胁。这种威胁可以通过小规模的设计改进来解决,这些改进应该作为未来工作的一部分在实践中实施和测试。
{"title":"Testing Physical Unclonable Functions Implemented on Commercial Off-the-Shelf NAND Flash Memories Using Programming Disturbances","authors":"N. Anagnostopoulos, Yufan Fan, Muhammad Umair Saleem, Nico Mexis, Emiliia Geloczi, Felix Klement, Florian Frank, André Schaller, T. Arul, S. Katzenbeisser","doi":"10.1109/ICCE-Berlin56473.2022.10021310","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.10021310","url":null,"abstract":"In this work, we present a Physical Unclonable Function (PUF) implemented on a Commercial Off-The-Shelf (COTS) NAND Flash memory module using programming disturbances, and examine the robustness of its responses to environmental variations. In particular, we test a removable Flash memory module serving as a PUF, under nominal conditions, as well as under temperature and voltage variations. To determine its resilience to environmental variations, we utilise well-known PUF metrics, such as the Hamming weight and the intra-device Hamming distance. Our results prove that, in general, the tested Samsung K9F1G08U0E NAND Flash memory can be used to realise a lightweight, scalable, and flexible hardware security primitive, namely a PUF, that can be utilised in the context of smart homes, smart vehicles, and other smart applications, as well as to protect commercial devices and networks in general. However, voltage variations seem to pose a substantial threat to the adoption of this PUF in practice. This threat may be addressed by small-scale design improvements that should be implemented and tested in practice as part of future works.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116443871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Cell-wise encoding and decoding for TLC flash memories TLC闪存的Cell-wise编码和解码
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937136
Daniel Nicolas Bailon, S. Shavgulidze, J. Freudenberger
Automotive computing applications like AI databases, ADAS, and advanced infotainment systems have a huge need for persistent memory. This trend requires NAND flash memories designed for extreme automotive environments. However, the error probability of NAND flash memories has increased in recent years due to higher memory density and production tolerances. Hence, strong error correction coding is needed to meet automotive storage requirements. Many errors can be corrected by soft decoding algorithms. However, soft decoding is very resource-intensive and should be avoided when possible. NAND flash memories are organized in pages, and the error correction codes are usually encoded page-wise to reduce the latency of random reads. This page-wise encoding does not reach the maximum achievable capacity. Reading soft information increases the channel capacity but at the cost of higher latency and power consumption. In this work, we consider cell-wise encoding, which also increases the capacity compared to page-wise encoding. We analyze the cell-wise processing of data in triple-level cell (TLC) NAND flash and show the performance gain when using Low-Density Parity-Check (LDPC) codes. In addition, we investigate a coding approach with page-wise encoding and cell-wise reading.
像人工智能数据库、ADAS和高级信息娱乐系统这样的汽车计算应用对持久内存有着巨大的需求。这种趋势需要为极端汽车环境设计的NAND闪存。然而,近年来,由于更高的存储密度和生产公差,NAND闪存的错误概率有所增加。因此,需要强纠错编码来满足汽车存储的要求。软译码算法可以纠正许多错误。然而,软解码是非常资源密集的,应该尽可能避免。NAND闪存是按页组织的,纠错码通常按页编码,以减少随机读取的延迟。此分页编码未达到可实现的最大容量。读取软信息增加了信道容量,但代价是更高的延迟和功耗。在这项工作中,我们考虑了单元智能编码,与页面智能编码相比,它也增加了容量。我们分析了三电平单元(TLC) NAND闪存中逐单元的数据处理,并展示了使用低密度奇偶校验(LDPC)代码时的性能增益。此外,我们研究了一种基于页面的编码和基于单元的读取的编码方法。
{"title":"Cell-wise encoding and decoding for TLC flash memories","authors":"Daniel Nicolas Bailon, S. Shavgulidze, J. Freudenberger","doi":"10.1109/ICCE-Berlin56473.2022.9937136","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937136","url":null,"abstract":"Automotive computing applications like AI databases, ADAS, and advanced infotainment systems have a huge need for persistent memory. This trend requires NAND flash memories designed for extreme automotive environments. However, the error probability of NAND flash memories has increased in recent years due to higher memory density and production tolerances. Hence, strong error correction coding is needed to meet automotive storage requirements. Many errors can be corrected by soft decoding algorithms. However, soft decoding is very resource-intensive and should be avoided when possible. NAND flash memories are organized in pages, and the error correction codes are usually encoded page-wise to reduce the latency of random reads. This page-wise encoding does not reach the maximum achievable capacity. Reading soft information increases the channel capacity but at the cost of higher latency and power consumption. In this work, we consider cell-wise encoding, which also increases the capacity compared to page-wise encoding. We analyze the cell-wise processing of data in triple-level cell (TLC) NAND flash and show the performance gain when using Low-Density Parity-Check (LDPC) codes. In addition, we investigate a coding approach with page-wise encoding and cell-wise reading.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"31 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116848968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gesture recognition of wrist motion using low-frequency PPG 基于低频PPG的手腕动作手势识别
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937135
M. N. Rylo, Walmir A. Silva, R. L. P. Medeiros, V. Lucena
This paper evaluated two machine learning techniques using low-frequency photoplethysmography and motion sensor data from wearable devices in gesture segmentation and classification. SVM and random forests were the classifiers selected for testing. Preliminary evaluations show that frequencies of 25 Hz are suitable for the recognition process, achieving an F1-score of 0.819 for seven gesture sets.
本文评估了两种机器学习技术,使用低频光容积脉搏波和来自可穿戴设备的运动传感器数据进行手势分割和分类。选择SVM和随机森林作为分类器进行测试。初步评估表明,25 Hz的频率适合于识别过程,七个手势集的f1得分为0.819。
{"title":"Gesture recognition of wrist motion using low-frequency PPG","authors":"M. N. Rylo, Walmir A. Silva, R. L. P. Medeiros, V. Lucena","doi":"10.1109/ICCE-Berlin56473.2022.9937135","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937135","url":null,"abstract":"This paper evaluated two machine learning techniques using low-frequency photoplethysmography and motion sensor data from wearable devices in gesture segmentation and classification. SVM and random forests were the classifiers selected for testing. Preliminary evaluations show that frequencies of 25 Hz are suitable for the recognition process, achieving an F1-score of 0.819 for seven gesture sets.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130832520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural network aided reference voltage adaptation for NAND flash memory 神经网络辅助的NAND快闪记忆体参考电压自适应
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937118
Daniel Nicolas Bailon, G. Taburet, S. Shavgulidze, J. Freudenberger
Large persistent memory is crucial for many applications in embedded systems and automotive computing like AI databases, ADAS, and cutting-edge infotainment systems. Such applications require reliable NAND flash memories made for harsh automotive conditions. However, due to high memory densities and production tolerances, the error probability of NAND flash memories has risen. As the number of program/erase cycles and the data retention times increase, non-volatile NAND flash memories' performance and dependability suffer. The read reference voltages of the flash cells vary due to these aging processes. In this work, we consider the issue of reference voltage adaption. The considered estimation procedure uses shallow neural networks to estimate the read reference voltages for different life-cycle conditions with the help of histogram measurements. We demonstrate that the training data for the neural networks can be enhanced by using shifted histograms, i.e., a training of the neural networks is possible based on a few measurements of some extreme points used as training data. The trained neural networks generalize well for other life-cycle conditions.
大型持久内存对于嵌入式系统和汽车计算中的许多应用程序(如AI数据库、ADAS和尖端信息娱乐系统)至关重要。这种应用需要可靠的NAND闪存,用于恶劣的汽车条件。然而,由于高存储密度和生产公差,NAND闪存的错误概率已经上升。随着程序/擦除周期和数据保留时间的增加,非易失性NAND闪存的性能和可靠性受到影响。由于这些老化过程,闪光电池的读参考电压会发生变化。在这项工作中,我们考虑了参考电压自适应的问题。所考虑的估计过程使用浅神经网络在直方图测量的帮助下估计不同生命周期条件下的读参考电压。我们证明了神经网络的训练数据可以通过使用移位直方图来增强,即,基于一些极值点的测量作为训练数据,神经网络的训练是可能的。经过训练的神经网络可以很好地泛化其他生命周期条件。
{"title":"Neural network aided reference voltage adaptation for NAND flash memory","authors":"Daniel Nicolas Bailon, G. Taburet, S. Shavgulidze, J. Freudenberger","doi":"10.1109/ICCE-Berlin56473.2022.9937118","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937118","url":null,"abstract":"Large persistent memory is crucial for many applications in embedded systems and automotive computing like AI databases, ADAS, and cutting-edge infotainment systems. Such applications require reliable NAND flash memories made for harsh automotive conditions. However, due to high memory densities and production tolerances, the error probability of NAND flash memories has risen. As the number of program/erase cycles and the data retention times increase, non-volatile NAND flash memories' performance and dependability suffer. The read reference voltages of the flash cells vary due to these aging processes. In this work, we consider the issue of reference voltage adaption. The considered estimation procedure uses shallow neural networks to estimate the read reference voltages for different life-cycle conditions with the help of histogram measurements. We demonstrate that the training data for the neural networks can be enhanced by using shifted histograms, i.e., a training of the neural networks is possible based on a few measurements of some extreme points used as training data. The trained neural networks generalize well for other life-cycle conditions.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131382309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
In-Vehicle Monitoring for Passengers' Safety 车载监控乘客安全
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937111
Loujaina Hatim Backar, Meriam A. Khalifa, Mohammed Abdel-Megeed Salem
Driving drowsiness detection through videos/images is one of the most important issues for driver safety in today's world. Because of the great advancements in technology in the last few decades, deep learning techniques applied to computer vision applications such as sleep detection have shown promising results. Drowsiness is characterised by closed eyes, yawning, and micro-sleeps. Moreover, one of the biggest tragedies in the news lately, is toddlers or pets dying from heat built up in cars. In this work, a real-time deep learning algorithm is designed to monitor driver drowsiness, driver distraction, as well as an alert system for forgetting children and pets, and a seat belt usage system. The approach taken was to recognise and localise the face, eyes, and mouth, using the Dlib library, Histogram of Oriented Gradients, and a facial landmark predictor. The eye aspect ratio and the mouth aspect ratio are then calculated and evaluated for yawning detection and micro-sleep detection. The information on the driver's state was saved using a Firebase real-time database. This information is used by the children and pets detection algorithm, which sends an automatic email to the driver if a child or pet is discovered in the backseat when the driver is not in the car. When a driver uses a cell phone, eats, or drinks while driving, this is considered as a distraction. Canny edge detection is used to monitor the seat belt. Furthermore, the proposed method was subjected to several rounds of testing, that proved its viability and reliability.
通过视频/图像检测驾驶困倦是当今世界驾驶员安全最重要的问题之一。由于过去几十年技术的巨大进步,深度学习技术应用于计算机视觉应用,如睡眠检测,已经显示出有希望的结果。困倦的特征是闭上眼睛、打哈欠和微睡眠。此外,最近新闻中最大的悲剧之一是幼儿或宠物死于车内积聚的热量。在这项工作中,设计了一种实时深度学习算法,用于监测驾驶员的嗜睡,驾驶员分心,以及忘记儿童和宠物的警报系统,以及安全带使用系统。采用的方法是使用Dlib库、方向梯度直方图和面部地标预测器来识别和定位面部、眼睛和嘴巴。然后计算和评估眼宽高比和嘴宽高比用于哈欠检测和微睡眠检测。驾驶员的状态信息使用Firebase实时数据库保存。儿童和宠物检测算法使用这些信息,如果在驾驶员不在车内时发现后座上有儿童或宠物,该算法会自动向驾驶员发送电子邮件。当司机在开车时使用手机、吃东西或喝酒时,这被认为是一种分心。Canny边缘检测用于监控安全带。并对该方法进行了多轮测试,验证了该方法的可行性和可靠性。
{"title":"In-Vehicle Monitoring for Passengers' Safety","authors":"Loujaina Hatim Backar, Meriam A. Khalifa, Mohammed Abdel-Megeed Salem","doi":"10.1109/ICCE-Berlin56473.2022.9937111","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937111","url":null,"abstract":"Driving drowsiness detection through videos/images is one of the most important issues for driver safety in today's world. Because of the great advancements in technology in the last few decades, deep learning techniques applied to computer vision applications such as sleep detection have shown promising results. Drowsiness is characterised by closed eyes, yawning, and micro-sleeps. Moreover, one of the biggest tragedies in the news lately, is toddlers or pets dying from heat built up in cars. In this work, a real-time deep learning algorithm is designed to monitor driver drowsiness, driver distraction, as well as an alert system for forgetting children and pets, and a seat belt usage system. The approach taken was to recognise and localise the face, eyes, and mouth, using the Dlib library, Histogram of Oriented Gradients, and a facial landmark predictor. The eye aspect ratio and the mouth aspect ratio are then calculated and evaluated for yawning detection and micro-sleep detection. The information on the driver's state was saved using a Firebase real-time database. This information is used by the children and pets detection algorithm, which sends an automatic email to the driver if a child or pet is discovered in the backseat when the driver is not in the car. When a driver uses a cell phone, eats, or drinks while driving, this is considered as a distraction. Canny edge detection is used to monitor the seat belt. Furthermore, the proposed method was subjected to several rounds of testing, that proved its viability and reliability.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131618045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Image Captioning using Pretrained Language Models and Image Segmentation 使用预训练语言模型和图像分割的图像字幕
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937098
S. Bianco, Gabriele Ferrario, Paolo Napoletano
Large-scale pre-trained language models, which have learned cross-modal representations on image-text pairs, are becoming popular for vision-language tasks because the fine-tuning to a specific task enables state-of-the-art results. Existing methods require features of image regions as input, but these regions are extracted with an object detection model that does not handle overlapping, noisy and ambiguous regions; this inevitably results in less meaningful features. In this paper we propose a new way to extract region features based on image segmentation, with the goal of reducing overlapping and noise. Our method is motivated by the observation that image segmentation can remove useless pixels using the binary mask to extract only the object of interest.
大规模的预训练语言模型,已经学习了图像-文本对的跨模态表示,在视觉语言任务中越来越受欢迎,因为对特定任务的微调可以获得最先进的结果。现有的方法需要图像区域的特征作为输入,但是这些区域的提取是用一个不处理重叠、噪声和模糊区域的目标检测模型;这将不可避免地导致没有意义的功能。本文提出了一种基于图像分割的区域特征提取方法,以减少重叠和噪声。我们的方法是由观察到的图像分割可以去除无用的像素使用二值掩码提取感兴趣的对象。
{"title":"Image Captioning using Pretrained Language Models and Image Segmentation","authors":"S. Bianco, Gabriele Ferrario, Paolo Napoletano","doi":"10.1109/ICCE-Berlin56473.2022.9937098","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937098","url":null,"abstract":"Large-scale pre-trained language models, which have learned cross-modal representations on image-text pairs, are becoming popular for vision-language tasks because the fine-tuning to a specific task enables state-of-the-art results. Existing methods require features of image regions as input, but these regions are extracted with an object detection model that does not handle overlapping, noisy and ambiguous regions; this inevitably results in less meaningful features. In this paper we propose a new way to extract region features based on image segmentation, with the goal of reducing overlapping and noise. Our method is motivated by the observation that image segmentation can remove useless pixels using the binary mask to extract only the object of interest.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125676792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems 基于深度学习的自由视点视频系统的前景分割分析
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937087
Javier Usón, J. Cabrera, Daniel Corregidor, Narciso García
Volumetric video acquisition systems enable realistic virtual experiences such as Free-Viewpoint Video (FVV). Stereo matching is a well known way of obtaining this volumetric information as depth images, calculating the disparity be-tween two stereo color images. On these applications, the background of the scene captured is static and does not change, so foreground information is much more valuable. We propose adding foreground segmentation to help learning based algorithms, such as deep learning models, improve results previously obtained. We utilized the framework De-tectron2 to model foreground segmentation by detecting people. Additionally, we built a large stereo dataset focused on FVV systems. Finally, we modified a successful deep learning model from the state-of-the-art, CREStereo, to add foreground segmentation and performed supervised training on it to estimate disparity, obtaining promising results.
体积视频采集系统可以实现真实的虚拟体验,如自由视点视频(FVV)。立体匹配是一种众所周知的获得深度图像的体积信息的方法,计算两个立体彩色图像之间的差异。在这些应用程序中,所捕获的场景背景是静态的,不会改变,因此前景信息更有价值。我们建议添加前景分割来帮助基于学习的算法,如深度学习模型,改进先前获得的结果。我们利用De-tectron2框架通过检测人来建模前景分割。此外,我们还建立了一个专注于FVV系统的大型立体数据集。最后,我们修改了一个成功的深度学习模型CREStereo,加入前景分割,并对其进行监督训练来估计视差,获得了很好的结果。
{"title":"Analysing Foreground Segmentation in Deep Learning Based Depth Estimation on Free-Viewpoint Video Systems","authors":"Javier Usón, J. Cabrera, Daniel Corregidor, Narciso García","doi":"10.1109/ICCE-Berlin56473.2022.9937087","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937087","url":null,"abstract":"Volumetric video acquisition systems enable realistic virtual experiences such as Free-Viewpoint Video (FVV). Stereo matching is a well known way of obtaining this volumetric information as depth images, calculating the disparity be-tween two stereo color images. On these applications, the background of the scene captured is static and does not change, so foreground information is much more valuable. We propose adding foreground segmentation to help learning based algorithms, such as deep learning models, improve results previously obtained. We utilized the framework De-tectron2 to model foreground segmentation by detecting people. Additionally, we built a large stereo dataset focused on FVV systems. Finally, we modified a successful deep learning model from the state-of-the-art, CREStereo, to add foreground segmentation and performed supervised training on it to estimate disparity, obtaining promising results.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116586501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incremental Two-Stage Logo Recognition with Knowledge Distillation 基于知识蒸馏的增量两阶段标识识别
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937102
S. Bianco, M. Buzzelli, Gianluca Giudice
The recognition of logos can be useful in developing autonomous checkout systems, or monitoring brand presence and advertisement in shopping malls. The continuous generation and update of new brand logos imposes the definition of a flexible solution to the problem. We therefore define a two-stage logo recognition system composed of an agnostic logo detector, to locate image regions that possess generic logo-like characteristics, and an incremental logo classifier, to progressively update the set of known logo classes. We investigate our solution's sensitivity to regularization and availability of training samples, and we develop two alternative techniques for model compression. Results are presented and compared with state of the art solutions, showing promising results. Our code is made available for public download.
对标识的识别在开发自动结账系统或监控购物中心的品牌存在和广告方面很有用。新品牌标识的不断生成和更新要求对问题的灵活解决方案的定义。因此,我们定义了一个两阶段的标志识别系统,该系统由一个不可知的标志检测器组成,用于定位具有通用标志特征的图像区域,以及一个增量标志分类器,用于逐步更新已知标志类集。我们研究了我们的解决方案对正则化的敏感性和训练样本的可用性,并开发了两种可供选择的模型压缩技术。给出了结果,并与最先进的解决方案进行了比较,显示出有希望的结果。我们的代码可供公众下载。
{"title":"Incremental Two-Stage Logo Recognition with Knowledge Distillation","authors":"S. Bianco, M. Buzzelli, Gianluca Giudice","doi":"10.1109/ICCE-Berlin56473.2022.9937102","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937102","url":null,"abstract":"The recognition of logos can be useful in developing autonomous checkout systems, or monitoring brand presence and advertisement in shopping malls. The continuous generation and update of new brand logos imposes the definition of a flexible solution to the problem. We therefore define a two-stage logo recognition system composed of an agnostic logo detector, to locate image regions that possess generic logo-like characteristics, and an incremental logo classifier, to progressively update the set of known logo classes. We investigate our solution's sensitivity to regularization and availability of training samples, and we develop two alternative techniques for model compression. Results are presented and compared with state of the art solutions, showing promising results. Our code is made available for public download.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120987699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A LoRa Network Emulator Using Software Defined Radio 使用软件定义无线电的LoRa网络仿真器
Pub Date : 2022-09-02 DOI: 10.1109/ICCE-Berlin56473.2022.9937124
M. Shawki, Ziad Ayman, Kareem Mahmoud, T. Elshabrawy
Nowadays, the internet of things and its application have rapid growth in the industry and research committee. Many of those applications require long battery life. Such applications have motivated the recent developments in LoRa technology. LoRa has a simple modulation scheme that allows the transmission with low power consumption, low bit rates, and extensive coverage area through an loT network. As a result, the capacity estimation of LoRa networks has paramount importance in the design phase. This paper aims to develop a LoRa network emulator that can represent the LoRa traffic received from thousands of loT devices at a given LoRa gateway. The proposed emulator utilizes a software defined radio (SDR) device and LoRa commercial transceiver to design a realistic network capacity estimation. The proposed network emulator incorporates different wireless channel conditions within the network under study. Furthermore, the transmitted signals from the SDR consider interference scenarios in terms of relative time over lap as well as SIR between interfering signals. The capacity of LoRa networks can be evaluated by the proposed emulator. The presented emulator for capacity evaluation has the advantage that it derives the cumulative distribution that could be supported from the cell-edge towards the cell-center of an individual LoRa gateway. The experimental result shows that the emulator can generate a representative SIR/SNR profile by comparing target emulated traffic signal level cumulative distribution with that measured by a commercial LoRa transceiver. The emulator adopts a calibration process such that the confidence interval for estimated data extraction rate performance is within the scale of ±2 %.
如今,物联网及其应用在业界和研究委员会中都得到了快速的发展。许多这样的应用都需要很长的电池寿命。这些应用程序推动了LoRa技术的最新发展。LoRa具有简单的调制方案,可以通过loT网络以低功耗、低比特率和广泛的覆盖区域进行传输。因此,LoRa网络的容量估计在设计阶段具有至关重要的意义。本文旨在开发一个LoRa网络仿真器,该仿真器可以表示在给定的LoRa网关上从数千个loT设备接收的LoRa流量。该仿真器利用软件定义无线电(SDR)器件和LoRa商用收发器设计了一个真实的网络容量估计。所提出的网络仿真器在所研究的网络中包含了不同的无线信道条件。此外,从SDR发射的信号考虑了干扰场景的相对重叠时间以及干扰信号之间的SIR。该仿真器可以对LoRa网络的容量进行评估。所提出的容量评估仿真器的优点是,它推导出了从单个LoRa网关的蜂窝边缘到蜂窝中心所支持的累积分布。实验结果表明,仿真器可以通过比较目标仿真的交通信号电平累积分布与商用LoRa收发器测量的交通信号电平累积分布,生成具有代表性的SIR/SNR剖面。仿真器采用了校准过程,使得估计的数据提取率性能的置信区间在±2%的范围内。
{"title":"A LoRa Network Emulator Using Software Defined Radio","authors":"M. Shawki, Ziad Ayman, Kareem Mahmoud, T. Elshabrawy","doi":"10.1109/ICCE-Berlin56473.2022.9937124","DOIUrl":"https://doi.org/10.1109/ICCE-Berlin56473.2022.9937124","url":null,"abstract":"Nowadays, the internet of things and its application have rapid growth in the industry and research committee. Many of those applications require long battery life. Such applications have motivated the recent developments in LoRa technology. LoRa has a simple modulation scheme that allows the transmission with low power consumption, low bit rates, and extensive coverage area through an loT network. As a result, the capacity estimation of LoRa networks has paramount importance in the design phase. This paper aims to develop a LoRa network emulator that can represent the LoRa traffic received from thousands of loT devices at a given LoRa gateway. The proposed emulator utilizes a software defined radio (SDR) device and LoRa commercial transceiver to design a realistic network capacity estimation. The proposed network emulator incorporates different wireless channel conditions within the network under study. Furthermore, the transmitted signals from the SDR consider interference scenarios in terms of relative time over lap as well as SIR between interfering signals. The capacity of LoRa networks can be evaluated by the proposed emulator. The presented emulator for capacity evaluation has the advantage that it derives the cumulative distribution that could be supported from the cell-edge towards the cell-center of an individual LoRa gateway. The experimental result shows that the emulator can generate a representative SIR/SNR profile by comparing target emulated traffic signal level cumulative distribution with that measured by a commercial LoRa transceiver. The emulator adopts a calibration process such that the confidence interval for estimated data extraction rate performance is within the scale of ±2 %.","PeriodicalId":138931,"journal":{"name":"2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128882751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 IEEE 12th International Conference on Consumer Electronics (ICCE-Berlin)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1