2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)最新文献

英文中文

Audio processing with using Python language science libraries 音频处理与使用Python语言的科学库

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563430

Tatsiana Viarbitskaya, A. Dobrucki

The topic of the article is recognition of instruments and playing techniques of music for detection and correction of errors in a given music sample. It shows how to achieve characteristics of recorded sound and also how to compare amplitudes and frequencies of the same music piece, but played by different persons and also with using various instruments. For this aim the signal processing algorithms are used, which are available in standard Python libraries such as “numpy” or “scipy”. The key idea of the processing is detection of errors, but save playing technique and individual style of the player.

本文的主题是识别乐器和音乐演奏技术，以检测和纠正给定音乐样本中的错误。它展示了如何实现录制声音的特征，以及如何比较由不同人演奏的同一音乐作品的振幅和频率，也使用不同的乐器。为此，使用了信号处理算法，这些算法可以在标准Python库中获得，例如“numpy”或“scipy”。处理的关键思想是检测错误，但保留玩家的演奏技术和个人风格。

引用次数: 7

Adaptive methods of time-dependent crowd density distribution visualization 时变人群密度分布可视化的自适应方法

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563391

Marianna Parzych, T. Marciniak, A. Dabrowski

The paper presents an analysis of visualization methods of crowd density visualization. Generated density maps take into account changes in time. Three methods have been implemented and tested. The first one uses motion detection based on the background subtraction. The second one is based on BLOBs (binary large objects) analysis. The third method uses interest points ie. points on the image that can be used by the object track the movement. The tests were performed using the PETS2009 video sequence database. The obtained maps were evaluated and the time consumptions were estimated.

本文分析了人群密度可视化的可视化方法。生成的密度图考虑了时间的变化。已经实施和测试了三种方法。第一种是基于背景减法的运动检测。第二种是基于blob(二进制大对象)分析。第三种方法是使用兴趣点。图像上可以被物体使用的点跟踪运动。使用PETS2009视频序列数据库进行测试。对获得的地图进行了评估，并估计了时间消耗。

引用次数: 0

Spatial Transformations in Deep Neural Networks 深度神经网络中的空间变换

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563429

Michał Bednarek, K. Walas

Convolutional Neural Networks (CNNs) have brought us the exceptionally significant improvement in the performance of the variety of visual tasks, such as object classification, semantic segmentation or linear regression. However, these powerful neural models suffer from the lack of spatial invariance. In this paper, we introduce the end-to-end system that is able to learn such invariance including in-plane and out-of-plane rotations. We performed extensive experiments on variations of widely known MNIST dataset, which consist of images subjected to deformations. Our comparative results show that we can successfully improve the classification score by implementing so-called Spatial Transformer module.

卷积神经网络(cnn)为我们带来了各种视觉任务性能的显著改善，如对象分类、语义分割或线性回归。然而，这些强大的神经模型缺乏空间不变性。在本文中，我们引入了一个端到端系统，它能够学习平面内和平面外旋转的不变性。我们对广为人知的MNIST数据集进行了广泛的实验，该数据集由变形的图像组成。我们的比较结果表明，通过实现所谓的空间转换器模块，我们可以成功地提高分类分数。

引用次数: 0

Analysis of application possibilities of Grey System Theory to detection of acoustic feedback 灰色系统理论在声反馈检测中的应用可能性分析

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563432

Maciej Sabiniok, S. Brachmański

Notch-filter-based howling suppression is one of the most popular gain reduction method of dealing with acoustic feedback problem. The main goal of this paper is to analyze the possibilities of using the grey prediction model GM(1,1) in order to accelerate the feedback detection process of the algorithm. Computer based comparative simulations of the algorithm containing the prediction model in the detection stage and without it were performed. Simulations were performed for different prediction order, number of predicted samples and analysis window length. The comparison and evaluation were carried out for different source signals. Music, speech and noise signals were used.

基于陷波滤波器的嚎叫抑制是处理声反馈问题最常用的增益降低方法之一。本文的主要目的是分析使用灰色预测模型 GM(1,1) 加快算法反馈检测过程的可能性。本文对在检测阶段包含预测模型和不包含预测模型的算法进行了基于计算机的比较模拟。模拟针对不同的预测顺序、预测样本数量和分析窗口长度进行。比较和评估针对不同的信号源进行。使用了音乐、语音和噪声信号。

引用次数: 1

Microprocessor implementation of the sound source location process based on the correlation of signals 用微处理器实现基于信号相关的声源定位过程

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563401

Krzysztof Krupa, Marcin Grochowina

Sound direction estimation can be used in many different mechatronic systems, while the use of bare-metal programming microcontrollers allows for miniaturization and broadening the range of applications. The paper presents a microprocessor implementation of the system allowing to determine the azimuth for the source of sound. The device operates based on the measurement of the phase shift of the incoming signal to two spaced apart microphones. The algorithm based on calculating the correlation of sound signals using the FFT algorithm was used in the research.

声音方向估计可用于许多不同的机电系统，而使用裸金属编程微控制器允许小型化和扩大应用范围。本文提出了一个微处理器实现的系统，允许确定声源的方位角。该装置的工作原理是测量输入信号到两个间隔麦克风的相移。研究中采用了基于FFT算法计算声音信号相关性的算法。

引用次数: 0

Marking the Allophones Boundaries Based on the DTW Algorithm 基于DTW算法的音素边界标记

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563359

J. Rafalko

The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighbourhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border between allophones is in some cases very difficult to determine. Nowadays, this task is carried out manually in cooperation with specialists in the field of phonetics. The presented approach allows to build a system that is able to automate this process. The aim of the work currently carried out by the author is a method that facilitates the training material processing for the needs of the development of multimodal speech recognition systems. For this purpose, the difficult problem of marking boundaries of allophones is solved in this report based on the Polish dictionary in the context of the creation of allophone bases for speech synthesis. This is done in this way due to the simplified possibility of organizing critical listening and subjective evaluation of received allophones by a large group of Polish native speakers (73 people). Strengthening the method will allow it to be used for the extraction of allophones for the needs of developed system of automatic transcription of English speech and for its notation according to the IPA standard. The analysed continuous speech is combined in the DTW algorithm with a synthesized speech signal. The comparison of both signals is performed not in the time domain as in the classical DTW, but in the frequency domain. This allows for a statement that the phonetic content of both signals is compared. The paper describes the process of marking the boundaries of allophones for the Polish language, however after appropriate modifications, this approach can be used to determine the allophones boundaries in other languages, especially for English.

提出了一种基于动态时间翘曲(DTW)算法的语音信号中音素边界标记方法。由于相邻音素之间的相互影响，连续语音中音素边界的设置和标记是一个难题。就是这个邻域，一方面创造了音素的变体，即音素的变体，另一方面，它影响了音素之间的边界在某些情况下很难确定。如今，这项任务是与语音学领域的专家合作手工完成的。提出的方法允许构建一个能够自动化此过程的系统。作者目前开展的工作的目的是为开发多模态语音识别系统的需要提供一种促进训练材料处理的方法。为此，本报告以波兰语词典为基础，在创建语音合成的音素基的背景下，解决了音素边界标注的难题。这样做的原因是，通过一大批波兰语母语者(73人)对收到的音素进行组织批判性听力和主观评价的可能性简化了。对该方法进行强化，可用于已开发的英语语音自动转录系统的音素提取和按国际音标标准标注语音。分析后的连续语音在DTW算法中与合成语音信号相结合。这两个信号的比较不是在时域进行的，而是在频域进行的。这样就可以比较两个信号的语音内容。本文描述了波兰语的音素边界标记过程，但经过适当的修改，这种方法可以用于确定其他语言的音素边界，特别是英语。

{"title":"Marking the Allophones Boundaries Based on the DTW Algorithm","authors":"J. Rafalko","doi":"10.23919/SPA.2018.8563359","DOIUrl":"https://doi.org/10.23919/SPA.2018.8563359","url":null,"abstract":"The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighbourhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border between allophones is in some cases very difficult to determine. Nowadays, this task is carried out manually in cooperation with specialists in the field of phonetics. The presented approach allows to build a system that is able to automate this process. The aim of the work currently carried out by the author is a method that facilitates the training material processing for the needs of the development of multimodal speech recognition systems. For this purpose, the difficult problem of marking boundaries of allophones is solved in this report based on the Polish dictionary in the context of the creation of allophone bases for speech synthesis. This is done in this way due to the simplified possibility of organizing critical listening and subjective evaluation of received allophones by a large group of Polish native speakers (73 people). Strengthening the method will allow it to be used for the extraction of allophones for the needs of developed system of automatic transcription of English speech and for its notation according to the IPA standard. The analysed continuous speech is combined in the DTW algorithm with a synthesized speech signal. The comparison of both signals is performed not in the time domain as in the classical DTW, but in the frequency domain. This allows for a statement that the phonetic content of both signals is compared. The paper describes the process of marking the boundaries of allophones for the Polish language, however after appropriate modifications, this approach can be used to determine the allophones boundaries in other languages, especially for English.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"277 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122937932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Infrared thermal camera-based system for tram drivers warning about hazardous situations 基于红外热像仪的电车司机危险情况预警系统

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563417

A. Konieczka, Ewelina Michałowicz, Karol Piniarski

In this paper, we propose a new thermal camera-based system for tram drivers. It aims to increase the safety of tram traffic at night. The proposed solution uses a standard vision camera and a thermal camera. Firstly, it processes the achieved images in order to detect the tram tracks. Secondly, it detects people or obstacles on tracks and generates warnings for the driver. This solution has been tested in static condition using a standard-gauge tram. The achieved results prove that this prototype system can effectively warn of danger situations especially in dark places.

在本文中，我们为电车司机提出了一种基于热像仪的新型系统。该系统旨在提高夜间电车交通的安全性。建议的解决方案使用标准视觉摄像头和热像仪。首先，系统会处理所获得的图像，以检测电车轨道。其次，它能检测到轨道上的人员或障碍物，并向司机发出警告。该解决方案已在静态条件下使用标准轨距有轨电车进行了测试。测试结果证明，该原型系统能够有效地发出危险警报，尤其是在黑暗的地方。

引用次数: 2

Electronic Systems and Interfaces Aiding the Visually Impaired 帮助视障人士的电子系统和接口

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563381

P. Strumiłło

Visual impairment is one of the most serious sensory disabilities. It deprives a human being of an active professional and social live. EU reports indicate that for every 1000 Europeans citizens 4 are blind or suffer from serious visual impairment and this number is predicted to increase with time due to our ageing society. In spite of numerous, worldwide research efforts focusing on building innovative aids helping the blind no single electronic travel aid (ETA) solution has been widely accepted by the blind community. The aim of the tutorial is to apprise the current state of the art in the field of electronic interfaces aiding the blind in independent travel, navigation and access to information. Functional solutions and outcomes of recent research projects devoted to assistive technologies for the visually impaired will be presented.

视觉障碍是最严重的感官障碍之一。它剥夺了一个人积极的职业和社会生活。欧盟的报告显示，每1000名欧洲公民中就有4人失明或患有严重的视力障碍，由于我们的老龄化社会，这一数字预计会随着时间的推移而增加。尽管世界范围内进行了大量的研究工作，重点是建立创新的帮助盲人的辅助工具，但没有一种电子旅行辅助工具(ETA)解决方案被盲人社区广泛接受。该教程的目的是介绍帮助盲人独立旅行、导航和获取信息的电子接口领域的最新技术。将介绍最近致力于视障人士辅助技术的研究项目的功能解决方案和成果。

引用次数: 0

Perfect Low Power Narrowband Transmitters for Dense Wireless Sensor Networks 完美的低功耗窄带发射机密集无线传感器网络

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563390

A. Platonov, I. Zaitsev

This paper presents the backgrounds of approach to optimization and design of software defined adaptive feedback communication systems (AFCS) for applications at the physical (PHY) layer of wireless sensor networks. A particular feature of AFCS is they transmit the signals from digital or analog sensors to the base stations (BS) using pulse-amplitude (PAM) modulators adaptively adjusted by the controls formed in BS, no coding. Absence of coders permits to derive optimal transmission-reception algorithms determining the way of optimal AFCS design. Adaptive properties of the systems permit to transmit data to BS perfectly, i.e. with energy and spectral efficiencies attaining Shannon's limits. The not used before adequate measures of AFCS performance are discussed and used for investigation of designed prototype of optimal AFCS functioning. Optimal AFCS may become a perspective class of high efficient narrowband low energy communication channels for the wireless sensor networks.

本文介绍了应用于无线传感器网络物理层的软件自适应反馈通信系统(AFCS)的优化和设计方法的研究背景。AFCS的一个特殊特点是，它们使用脉冲幅度(PAM)调制器将来自数字或模拟传感器的信号传输到基站(BS)，该调制器由BS中形成的控件自适应调节，无需编码。在没有编码器的情况下，可以推导出最优的发送-接收算法，从而确定最优的AFCS设计方法。系统的自适应特性允许将数据完美地传输到BS，即能量和频谱效率达到香农极限。讨论了之前未使用的AFCS性能指标，并将其用于优化AFCS功能的设计原型的研究。最优AFCS可能成为无线传感器网络中一类高效窄带低能量通信信道。

引用次数: 1

Fuzzy Bayesian Filter for Sound Environment by Considering Additive Property of Energy Variable and Fuzzy Observation in Decibel Scale 考虑能量变量可加性和分贝尺度模糊观测的声环境模糊贝叶斯滤波

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

Pub Date : 2018-09-01 DOI: 10.23919/SPA.2018.8563341

A. Ikuta, H. Orimoto

In the measurement and evaluation of actual random signal in a sound environment, the observed data often contain the fuzziness due to several causes. Furthermore, there exists usually a background noise in addition to the objective specific signal, and it is often that the specific signal partly or completely is buried in the background noise. In this paper, a fuzzy Bayesian filter for estimating a specific signal, based on the observed data containing the fuzziness, and the effects of a background noise with non-Gaussian type is proposed. More specifically, after paying attention to the energy variables satisfying the additive property of the specific signal and background noise, by introducing a new type of membership function suitable for the energy variable and the observation in decibel scale, a state estimation method is theoretically derived. The proposed theory is applied to the actual estimation problem of the sound environment, and its usefulness is experimentally verified.

在声环境中实际随机信号的测量和评价中，由于多种原因，观测数据往往具有模糊性。此外，除了客观的特定信号外，通常还存在背景噪声，并且通常特定信号部分或全部被淹没在背景噪声中。本文基于观测数据的模糊性和非高斯型背景噪声的影响，提出了一种用于估计特定信号的模糊贝叶斯滤波器。更具体地说，在关注满足特定信号和背景噪声加性的能量变量后，通过引入一种适合于能量变量和分贝尺度观测值的新型隶属函数，从理论上推导出一种状态估计方法。将该理论应用于声环境的实际估计问题，并通过实验验证了其有效性。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀