2022 IEEE International Conference on Consumer Electronics - Taiwan最新文献

英文中文

Scene Retrieval in Soccer Videos by Spatial-temporal Attention with Video Vision Transformer 基于视频视觉变换的时空注意足球视频场景检索

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869188

Yaozong Gan, Ren Togo, Takahiro Ogawa, M. Haseyama

This paper presents a scene retrieval method in soccer videos with video vision Transformer (ViViT). In soccer coaching, it is difficult for the training staff to find the required scenes efficiently from the large number of soccer videos. We tackle this problem with a simple yet effective method. We train ViViT and obtain the output token features of the soccer scene by the pre-trained ViViT model. The output tokens of the pre-trained ViViT contain spatio-temporal information of soccer scenes. We then transform a query scene and candidate scenes into output token features using the pre-trained ViViT and calculate the similarity between the tokens with cosine similarity. We conducted experiments on SoccerNet-V2dataset. The experimental results show that the proposed method achieves outstanding retrieval accuracy compared to the previous methods.

提出了一种基于视频视觉转换器(ViViT)的足球视频场景检索方法。在足球训练中，训练人员很难从大量的足球视频中高效地找到所需的场景。我们用一种简单而有效的方法来解决这个问题。我们对ViViT进行训练，并通过预训练好的ViViT模型获得足球场景的输出token特征。预训练ViViT的输出令牌包含了足球场景的时空信息。然后，我们使用预训练的ViViT将查询场景和候选场景转换为输出标记特征，并计算标记之间的余弦相似度。我们在SoccerNet-V2dataset上进行了实验。实验结果表明，与以往的方法相比，该方法取得了较好的检索精度。

引用次数: 0

Heuristic Optimization based Abnormal Posture Detection Algorithm 基于启发式优化的异常姿态检测算法

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869081

Yufeng Li, Lin Shang, Peng Pan

This paper studies the abnormal posture detection algorithm based on heuristic optimization. Using the data collected by sensors, the features such as acceleration and angular velocity are extracted and put into the classifiers for training. We select the appropriate heuristic algorithm according to different classifier models for optimization. The results demonstrate that, in binary classification experiment, the accuracy ratio of the K-Nearest Neighbor (KNN) model is 99.54%, and the AUC is 0.99. In quad classification experiment, The Support Vector Machine (SVM) model has a 94.32% accuracy ratio and a 0.95 AUC, which has the optimal performance.

本文研究了基于启发式优化的异常姿态检测算法。利用传感器采集的数据，提取加速度、角速度等特征，并将其输入分类器进行训练。我们根据不同的分类器模型选择合适的启发式算法进行优化。结果表明，在二值分类实验中，k -最近邻(KNN)模型的准确率为99.54%，AUC为0.99。在四组分类实验中，支持向量机(SVM)模型的准确率为94.32%，AUC为0.95，具有最优的分类性能。

引用次数: 0

Customized Speaker Verification System with Noise-Cancellation using Blind Source Separation 使用盲源分离的消噪定制扬声器验证系统

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869211

Tsung-Han Tsai, Ping-Cheng Hao, Fong-Lin Tsai

In this paper, a customized speaker verification system combined with noise-cancellation using blind source separation was proposed. This system is divided into two phases: the noise-cancellation phase and the speaker verification phase. In the noise-cancellation phase, a fast time-frequency mask technique based on Short Time Fourier Transform (STFT) was proposed for separating a mixture of two input sounds in a single signal. After obtaining the separated speech data, this input is processed to the wake-up word system. In the speaker verification phase, we use Mel-Frequency Cepstral Coefficients (MFCC) as the feature extraction module. Then we train the feature data into a voiceprint model and a state sequence model of the speaker using Gaussian mixture model (GMM) and hidden Markov model (HMM), respectively. An analysis is done on noisy speech signals corrupted by white noise at different angles. Based on the output SIR (Signal to Interference Ratio) and SDR (Signal to Distortion Ratio) analysis, the improved accuracy is derived in the proposed system. We have obtained promising results in the real experimental environment.

本文提出了一种结合盲源分离噪声消除的定制说话人验证系统。该系统分为两个阶段:噪声消除阶段和说话人验证阶段。在噪声消除阶段，提出了一种基于短时傅里叶变换(STFT)的快速时频掩模技术，用于分离单个信号中两个输入声音的混合。在获得分离的语音数据后，该输入被处理到唤醒词系统。在说话人验证阶段，我们使用Mel-Frequency倒谱系数(MFCC)作为特征提取模块。然后分别使用高斯混合模型(GMM)和隐马尔可夫模型(HMM)将特征数据训练成说话人的声纹模型和状态序列模型。分析了白噪声在不同角度下对语音信号的干扰。通过对输出信号的SIR(信干扰比)和SDR(信失真比)分析，得到了系统精度的提高。我们在真实的实验环境中取得了令人满意的结果。

{"title":"Customized Speaker Verification System with Noise-Cancellation using Blind Source Separation","authors":"Tsung-Han Tsai, Ping-Cheng Hao, Fong-Lin Tsai","doi":"10.1109/ICCE-Taiwan55306.2022.9869211","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan55306.2022.9869211","url":null,"abstract":"In this paper, a customized speaker verification system combined with noise-cancellation using blind source separation was proposed. This system is divided into two phases: the noise-cancellation phase and the speaker verification phase. In the noise-cancellation phase, a fast time-frequency mask technique based on Short Time Fourier Transform (STFT) was proposed for separating a mixture of two input sounds in a single signal. After obtaining the separated speech data, this input is processed to the wake-up word system. In the speaker verification phase, we use Mel-Frequency Cepstral Coefficients (MFCC) as the feature extraction module. Then we train the feature data into a voiceprint model and a state sequence model of the speaker using Gaussian mixture model (GMM) and hidden Markov model (HMM), respectively. An analysis is done on noisy speech signals corrupted by white noise at different angles. Based on the output SIR (Signal to Interference Ratio) and SDR (Signal to Distortion Ratio) analysis, the improved accuracy is derived in the proposed system. We have obtained promising results in the real experimental environment.","PeriodicalId":164671,"journal":{"name":"2022 IEEE International Conference on Consumer Electronics - Taiwan","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114768232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Integration Method for ECG Multi-Classification 一种心电多分类的集成方法

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869291

Chao-Xin Xie, Minghui Fan, Liang-Hung Wang, Pao-Cheng Huang

The application of artificial intelligence to the diagnosis of ECG is of great significance. We combine machine learning algorithm with deep learning algorithm to give full play to the advantages of different algorithms by ensemble learning. Finally, we fuse the selected models so that the accuracy of identifying five kinds of arrhythmias can reach 94%. Particularly, the accuracy of class F beat which is difficult to identify has also been improved.

人工智能在心电图诊断中的应用具有重要意义。我们将机器学习算法与深度学习算法相结合，通过集成学习充分发挥不同算法的优势。最后对所选模型进行融合，使五种心律失常的识别准确率达到94%。特别是，难以识别的F类拍的精度也得到了提高。

引用次数: 0

DriverID: Driver Identity System Based on Voiceprint and Acoustic Sensing DriverID:基于声纹和声传感的驾驶员身份识别系统

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869000

Kam-Hong Chan, C. Chao

The identification of drivers is essential for many applications, such as attribution of liability for car accidents and driving risk assessment. Most existing driver identification systems adopt identity keys (such as car keys and smart cards) or biometrics technology (such as face recognition, iris recognition, fingerprint recognition, voiceprint recognition, and vein recognition, etc.) to identify drivers. However, these schemes are unable to detect driver changes during a trip. In this paper, combining voiceprint and acoustic driving characteristics, the driver identity system DriverID is proposed to identify the person who is actually driving. DriverID uses the Deep Residual Network (ResNet) to construct an acoustic recognition model based on the voice key recorded by the driver. In addition, the Convolutional Neural Network (CNN) is used to construct an acoustic driving action recognition model based on the reflection of acoustic signals generated by the user. Combining the two recognition methods, DriverID can correctly identify the driver with high probability. It is believed that DriverID is a practical driver identity system.

驾驶员的身份识别在许多应用中是必不可少的，例如车祸的责任归属和驾驶风险评估。现有的驾驶员身份识别系统大多采用身份钥匙(如车钥匙、智能卡)或生物识别技术(如人脸识别、虹膜识别、指纹识别、声纹识别、静脉识别等)来识别驾驶员。然而，这些方案无法检测到行驶过程中驾驶员的变化。本文结合声纹和声学驾驶特性，提出了驾驶员身份识别系统DriverID来识别实际驾驶人。DriverID利用深度残差网络(Deep Residual Network, ResNet)，根据驾驶员录制的语音键构建声音识别模型。此外，利用卷积神经网络(CNN)基于用户产生的声信号反射构建声驾驶动作识别模型。结合这两种识别方法，DriverID能够以高概率正确识别驾驶员。认为DriverID是一种实用的司机身份识别系统。

{"title":"DriverID: Driver Identity System Based on Voiceprint and Acoustic Sensing","authors":"Kam-Hong Chan, C. Chao","doi":"10.1109/ICCE-Taiwan55306.2022.9869000","DOIUrl":"https://doi.org/10.1109/ICCE-Taiwan55306.2022.9869000","url":null,"abstract":"The identification of drivers is essential for many applications, such as attribution of liability for car accidents and driving risk assessment. Most existing driver identification systems adopt identity keys (such as car keys and smart cards) or biometrics technology (such as face recognition, iris recognition, fingerprint recognition, voiceprint recognition, and vein recognition, etc.) to identify drivers. However, these schemes are unable to detect driver changes during a trip. In this paper, combining voiceprint and acoustic driving characteristics, the driver identity system DriverID is proposed to identify the person who is actually driving. DriverID uses the Deep Residual Network (ResNet) to construct an acoustic recognition model based on the voice key recorded by the driver. In addition, the Convolutional Neural Network (CNN) is used to construct an acoustic driving action recognition model based on the reflection of acoustic signals generated by the user. Combining the two recognition methods, DriverID can correctly identify the driver with high probability. It is believed that DriverID is a practical driver identity system.","PeriodicalId":164671,"journal":{"name":"2022 IEEE International Conference on Consumer Electronics - Taiwan","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123907479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

LSTM-Based Ransomware Detection Using API Call Information 基于lstm的API调用信息的勒索软件检测

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869122

Kohei Tsunewaki, Tomotaka Kimura, Jun Cheng

In this paper, we propose a ransomware detection method based on API IDs and call intervals. In the proposed method, to detect ransomware, when each API call occurs, we input both the API ID and the call interval from the previous call into an LSTM (Long Short Term Memory). By inputting the API IDs and call intervals into LSTM, we can learn the characteristics of the time series change of API calls in the ransomware. Through the experiments using an original dataset, we demonstrated that the accuracy of our proposed method was high and the characteristic learning of the call interval was useful for detecting ransomware.

本文提出了一种基于API id和调用间隔的勒索软件检测方法。在提出的方法中，为了检测勒索软件，当每个API调用发生时，我们将API ID和前一次调用的调用间隔输入到LSTM(长短期记忆)中。通过将API id和调用间隔输入到LSTM中，我们可以了解勒索软件中API调用的时间序列变化特征。通过使用原始数据集的实验，我们证明了我们提出的方法的准确性很高，并且调用间隔的特征学习对检测勒索软件是有用的。

引用次数: 1

Scheduling Method for Improving Transmission and Reception Efficiency in IEEE802.15.4 used Heterogeneous Wireless Sensor Networks 一种提高IEEE802.15.4异构无线传感器网络收发效率的调度方法

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869210

Kohei Hayashi, Rrota Horiuchi, N. Komuro

In recent years, WSNs have been expected to be applied to various fields such as home security, healthcare, and environmental monitoring. A number of studies have been done on IEEE 802.15.4, the standard for WSNs, but they have left some problems and there is still room for improvement. In this paper, we propose a new access control protocol in tree-type heterogeneous sensor networks that achieves low EC, high PDR, and low latency by adjusting the active period so that the buffer occupancy ratio of the relay node is less than 1 to prevent the buffer from overflowing, and then performing channel partitioning and scheduling to avoid packet collisions.

近年来，无线传感器网络有望应用于家庭安全、医疗保健、环境监测等各个领域。针对无线传感器网络的标准IEEE 802.15.4已经进行了大量的研究，但仍存在一些问题，仍有改进的空间。本文提出了一种新的树型异构传感器网络访问控制协议，通过调整活动周期使中继节点的缓冲区占用率小于1以防止缓冲区溢出，然后进行通道划分和调度以避免分组冲突，从而达到低EC、高PDR和低延迟的目的。

引用次数: 0

A New Light Weight Convolutional Neural Network for Mobile Devices 面向移动设备的新型轻量级卷积神经网络

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869273

Kuan-Ting Lai, Guo-Shiang Lin

In this paper, we propose a light-weight convolutional neural network based on depth-wise separable convolutions and cross stage partial (CSP) network. Dissimilar to MobileNetV3, the proposed network is composed of some CSP blocks to reduce the model size and computational operations. For performance evaluation, Cifar10 and Cifar100 are used for testing. Compared to MobileNetv3, the model size and execution time of the proposed network in PC and mobile device are smaller. Therefore, the experimental results show that the proposed light-weight network can effectively extract visual features for image classification compared with MobileNetV3.

本文提出了一种基于深度可分离卷积和跨阶段部分(CSP)网络的轻量级卷积神经网络。与MobileNetV3不同，提出的网络由一些CSP块组成，以减少模型大小和计算操作。性能评估使用Cifar10和Cifar100进行测试。与MobileNetv3相比，本文提出的网络在PC和移动设备上的模型大小和执行时间都更小。因此，实验结果表明，与MobileNetV3相比，所提出的轻量级网络可以有效地提取用于图像分类的视觉特征。

引用次数: 0

Reducing Large LDNMOSFET Substrate Currents by Modifying Isolation Ring Voltages 通过修改隔离环电压来减小大LDNMOSFET衬底电流

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9868978

Sue-Yi Chen, Shao-Chang Huang, K. Hsu, Yin-Wei Peng, Jiabin Dong, J. Gan

Large substrate currents could induce the device melt in power manager integrated circuit applications. Many researches are focused on how to reduce substrate currents from process modifications. In this paper, the fundamental substrate current mechanism analyses are studied. Then, a tracing-high voltage between the device drain terminal and the device source terminal applied on the isolation ring is proposed for substrate current reductions. Engineers can apply this method for avoiding the device burned-out without the complicated process changes.

在电源管理集成电路中，较大的衬底电流会导致器件熔化。许多研究都集中在如何通过工艺修改来减少衬底电流。本文对衬底电流的基本机理进行了分析。然后，在隔离环上施加器件漏极和器件源端之间的跟踪高压，以减小衬底电流。工程师可以应用这种方法避免设备烧毁，而不需要改变复杂的工艺。

引用次数: 0

Cooperative Fall Detection with Multiple Cameras 多摄像头协同跌倒检测

2022 IEEE International Conference on Consumer Electronics - Taiwan

Pub Date : 2022-07-06 DOI: 10.1109/ICCE-Taiwan55306.2022.9869279

Jian-Chiuan Hou, Weimin Xu, Yuanyuan Chu, Chih-Lin Hu, Ying-Hong Chen, Shi Chen, Lin Hui

We propose a fall detection mechanism based on multi-camera cooperation in home space. Cameras capture image-based falling events, and self-organize a group using deep reinforcement learning. Neighbor cameras exchange sensing data and statuses in local network proximity. With information sharing in a group, cameras can improve the accuracy of decision making on falling events and cope with the limited fields of view against physical deployment of cameras in residential areas.

提出了一种基于家庭空间多摄像头协同的跌倒检测机制。相机捕捉基于图像的坠落事件，并使用深度强化学习自组织一个群体。相邻摄像机在本地网络邻近中交换传感数据和状态。通过一组信息共享，摄像头可以提高对坠落事件决策的准确性，并应对在居民区部署摄像头的有限视野。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2022 IEEE International Conference on Consumer Electronics - Taiwan

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀