首页 > 最新文献

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献

英文 中文
Multi-block dependency based watermarking scheme for binary-text image authentication 基于多块依赖的二进制文本图像认证水印方案
Fan Chen, Yao Qin, Hongjie He
To improve the ability against different counterfeiting attacks, a watermarking algorithm is proposed for binary-text image (BTI) authentication. To protect the uniform regions in BTI, the watermark information of a fixed-size block is generated according to the content of it and divided into three parts. One part is embedded in the flippable pixels of itself, and the other two parts are respectively embedded in the flippable pixels of other two blocks in BTI, which are randomly chosen based on the secret key. This strategy can not only introduce the block-wise dependence, but also make it possible for the authentication watermark of a uniform block to be embedded in the BTI. In the tamper detection stage, a multi-block based statistic detection method is designed to verify the validity of an image block. Simulation results show that the proposed algorithm can achieve a good imperceptibility and have an ability resisting the maliciously attacks such as collage attack, delete tampering, replace tampering etc.
为了提高对各种伪造攻击的防御能力,提出了一种用于二进制文本图像(BTI)认证的水印算法。为了保护BTI中的均匀区域,根据固定大小块的内容生成水印信息,并将其分为三部分。其中一部分嵌入到自身的可翻转像素中,另外两部分分别嵌入到BTI中根据密钥随机选择的其他两个块的可翻转像素中。该策略不仅可以引入块依赖,而且可以将统一块的认证水印嵌入到BTI中。在篡改检测阶段,设计了一种基于多块的统计检测方法来验证图像块的有效性。仿真结果表明,该算法具有良好的不可感知性,能够抵抗拼贴攻击、删除篡改、替换篡改等恶意攻击。
{"title":"Multi-block dependency based watermarking scheme for binary-text image authentication","authors":"Fan Chen, Yao Qin, Hongjie He","doi":"10.1109/APSIPA.2014.7041611","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041611","url":null,"abstract":"To improve the ability against different counterfeiting attacks, a watermarking algorithm is proposed for binary-text image (BTI) authentication. To protect the uniform regions in BTI, the watermark information of a fixed-size block is generated according to the content of it and divided into three parts. One part is embedded in the flippable pixels of itself, and the other two parts are respectively embedded in the flippable pixels of other two blocks in BTI, which are randomly chosen based on the secret key. This strategy can not only introduce the block-wise dependence, but also make it possible for the authentication watermark of a uniform block to be embedded in the BTI. In the tamper detection stage, a multi-block based statistic detection method is designed to verify the validity of an image block. Simulation results show that the proposed algorithm can achieve a good imperceptibility and have an ability resisting the maliciously attacks such as collage attack, delete tampering, replace tampering etc.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125025935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Quality-based channel selection in multi-channel radio-over-fiber system 多通道光纤无线通信系统中基于质量的信道选择
Withawat Tangtrongpairoj, T. Higashino, M. Okada
Radio over Fiber (RoF) is a promising solution for wireless access services by transferring the heterogeneous radio signal via the optical fiber link. However, RoF devices have nonlinear characteristics which create intermodulation products in system. The intermodulation distortion (IMD) interferes uplink RF signals in the presence of coupling between downlink and uplink antennas in the base station (BS). This paper proposed the performance evaluation due to coupled downlink interfere to uplink antenna. The carrier to distortion plus noise ratio (CDNR) is evaluated for all combinations. By using NS3 network simulator, the result shows the best combination achieves better performance. Which coupled downlink interfere in uplink signal can be reduced when amount of downlink packet is decreased.
光纤无线电(RoF)通过光纤链路传输异构无线电信号,是一种很有前途的无线接入服务解决方案。然而,RoF器件具有非线性特性,会在系统中产生互调产物。在基站下行天线和上行天线之间存在耦合的情况下,互调失真(IMD)会干扰上行射频信号。本文提出了由于下行链路耦合干扰对上行天线的性能评估。对所有组合的载波失真加噪声比(CDNR)进行了评估。通过对NS3网络进行仿真,结果表明,最佳组合可以获得更好的性能。减少下行分组的数量可以减少上行信号中的耦合下行干扰。
{"title":"Quality-based channel selection in multi-channel radio-over-fiber system","authors":"Withawat Tangtrongpairoj, T. Higashino, M. Okada","doi":"10.1109/APSIPA.2014.7041690","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041690","url":null,"abstract":"Radio over Fiber (RoF) is a promising solution for wireless access services by transferring the heterogeneous radio signal via the optical fiber link. However, RoF devices have nonlinear characteristics which create intermodulation products in system. The intermodulation distortion (IMD) interferes uplink RF signals in the presence of coupling between downlink and uplink antennas in the base station (BS). This paper proposed the performance evaluation due to coupled downlink interfere to uplink antenna. The carrier to distortion plus noise ratio (CDNR) is evaluated for all combinations. By using NS3 network simulator, the result shows the best combination achieves better performance. Which coupled downlink interfere in uplink signal can be reduced when amount of downlink packet is decreased.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122797541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription 通过选择相似的演讲者进行演讲转录的DNN-HMM的无监督演讲者自适应
M. Mimura, Tatsuya Kawahara
Unsupervised speaker adaptation of Deep Neural Network (DNN) is investigated for lecture transcription tasks, in which a single speaker gives a long speech and thus speaker adaptation is important. The proposed method selects similar speakers to the test data (test speaker) from the training database, which are used for retraining the baseline DNN. Several speaker characteristic features are defined for the speaker similarity measure. The feature based on Universal Background Model (UBM) and principal component analysis (PCA) achieves the best performance, resulting in a significant improvement from the baseline DNN and also from the adapted GMM-HMM system. The method is combined with a naive adaptation method using the initial ASR hypothesis of the test data, and an additional improvement is achieved.
研究了基于深度神经网络(DNN)的无监督演讲人自适应的演讲转录任务,在这种任务中,演讲人自适应是一个重要的问题。该方法从训练数据库中选择与测试数据(测试说话人)相似的说话人,用于对基线DNN进行再训练。定义了几个说话人的特征特征用于说话人相似度度量。基于通用背景模型(Universal Background Model, UBM)和主成分分析(principal component analysis, PCA)的特征得到了最好的性能,与基线深度神经网络和自适应的GMM-HMM系统相比有了显著的改进。该方法与利用试验数据初始ASR假设的朴素自适应方法相结合,实现了进一步的改进。
{"title":"Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription","authors":"M. Mimura, Tatsuya Kawahara","doi":"10.1109/APSIPA.2014.7041567","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041567","url":null,"abstract":"Unsupervised speaker adaptation of Deep Neural Network (DNN) is investigated for lecture transcription tasks, in which a single speaker gives a long speech and thus speaker adaptation is important. The proposed method selects similar speakers to the test data (test speaker) from the training database, which are used for retraining the baseline DNN. Several speaker characteristic features are defined for the speaker similarity measure. The feature based on Universal Background Model (UBM) and principal component analysis (PCA) achieves the best performance, resulting in a significant improvement from the baseline DNN and also from the adapted GMM-HMM system. The method is combined with a naive adaptation method using the initial ASR hypothesis of the test data, and an additional improvement is achieved.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"182 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125828469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Proportional feedback based rate control for intra frame of H.264/AVC high profile 基于比例反馈的H.264/AVC高清帧内速率控制
Yanping Zhou, Y. Duan, Jun Sun, Zongming Guo
This paper focuses on the intra frame rate control of H.264/AVC High Profile and introduces a new frame gradient-based rate control algorithm. In this algorithm, a rate-gradient-quantization parameter model with frame gradient employed as frame complexity is proposed. Then, a proportional feedback scheme, along with an adaptive optimization method, is presented to achieve constant bitrate. Rigorous experiments covering various sequences of different target rates are carried out. Experimental results show that the proposed rate control method outperforms JM16.0 by offering a more constant rate output and reducing rate fluctuation, without video quality loss.
本文重点研究了H.264/AVC High Profile的帧内速率控制,提出了一种基于帧梯度的帧内速率控制算法。在该算法中,提出了一种以帧梯度作为帧复杂度的速率梯度量化参数模型。然后,提出了一种比例反馈方案,并结合自适应优化方法来实现恒定比特率。对不同目标速率的各种序列进行了严格的实验。实验结果表明,该方法在不影响视频质量的前提下,提供了更稳定的速率输出,减少了速率波动,优于JM16.0。
{"title":"Proportional feedback based rate control for intra frame of H.264/AVC high profile","authors":"Yanping Zhou, Y. Duan, Jun Sun, Zongming Guo","doi":"10.1109/APSIPA.2014.7041537","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041537","url":null,"abstract":"This paper focuses on the intra frame rate control of H.264/AVC High Profile and introduces a new frame gradient-based rate control algorithm. In this algorithm, a rate-gradient-quantization parameter model with frame gradient employed as frame complexity is proposed. Then, a proportional feedback scheme, along with an adaptive optimization method, is presented to achieve constant bitrate. Rigorous experiments covering various sequences of different target rates are carried out. Experimental results show that the proposed rate control method outperforms JM16.0 by offering a more constant rate output and reducing rate fluctuation, without video quality loss.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127201372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent ad hoc team partitioning by observing and modeling single-agent performance 通过观察和建模单智能体性能来进行多智能体特别团队划分
Etkin Baris Ozgul, Somchaya Liemhetcharat, K. H. Low
Multi-agent research has focused on finding the optimal team for a task. Many approaches assume that the performance of the agents are known a priori. We are interested in ad hoc teams, where the agents' algorithms and performance are initially unknown. We focus on the task of modeling the performance of single agents through observation in training environments, and using the learned models to partition a new environment for a multi-agent team. The goal is to minimize the number of agents used, while maintaining a performance threshold of the multi-agent team. We contribute a novel model to learn the agent's performance through observations, and a partitioning algorithm that minimizes the team size. We evaluate our algorithms in simulation, and show the efficacy of our learn model and partitioning algorithm.
多智能体研究的重点是寻找任务的最佳团队。许多方法假设代理的性能是已知的先验。我们对临时团队感兴趣,其中代理的算法和性能最初是未知的。我们专注于通过在训练环境中观察单个智能体的性能来建模,并使用学习到的模型为多智能体团队划分新的环境。目标是尽量减少使用的代理数量,同时保持多代理团队的性能阈值。我们提出了一个新的模型,通过观察来学习智能体的性能,以及一个最小化团队规模的划分算法。我们在仿真中评估了我们的算法,并证明了我们的学习模型和划分算法的有效性。
{"title":"Multi-agent ad hoc team partitioning by observing and modeling single-agent performance","authors":"Etkin Baris Ozgul, Somchaya Liemhetcharat, K. H. Low","doi":"10.1109/APSIPA.2014.7041644","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041644","url":null,"abstract":"Multi-agent research has focused on finding the optimal team for a task. Many approaches assume that the performance of the agents are known a priori. We are interested in ad hoc teams, where the agents' algorithms and performance are initially unknown. We focus on the task of modeling the performance of single agents through observation in training environments, and using the learned models to partition a new environment for a multi-agent team. The goal is to minimize the number of agents used, while maintaining a performance threshold of the multi-agent team. We contribute a novel model to learn the agent's performance through observations, and a partitioning algorithm that minimizes the team size. We evaluate our algorithms in simulation, and show the efficacy of our learn model and partitioning algorithm.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114247026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spectral-temporal receptive fields and MFCC balanced feature extraction for noisy speech recognition 噪声语音识别的频谱-时间接受野和MFCC平衡特征提取
Jia-Ching Wang, Chang-Hong Lin, En-Ting Chen, P. Chang
This paper aims to propose a new set of acoustic features based on spectral-temporal receptive fields (STRFs). The STRF is an analysis method for studying physiological model of the mammalian auditory system in spectral-temporal domain. It has two different parts: one is the rate (in Hz) which represents the temporal response and the other is the scale (in cycle/octave) which represents the spectral response. With the obtained STRF, we propose an effective acoustic feature. First, the energy of each scale is calculated from the STRF. The logarithmic operation is then imposed on the scale energies. Finally, the discrete Cosine transform is applied to generate the proposed STRF feature. In our experiments, we combine the proposed STRF feature with conventional Mel frequency cepstral coefficients (MFCCs) to verify its effectiveness. In a noise-free environment, the proposed feature can increase the recognition rate by 17.48%. Moreover, the increase in the recognition rate ranges from 5% to 12% in noisy environments.
本文旨在提出一套新的基于频谱-时间接受场(strf)的声学特征。STRF是研究哺乳动物听觉系统生理模型的一种频谱-时域分析方法。它有两个不同的部分:一个是表示时间响应的速率(以赫兹为单位),另一个是表示频谱响应的尺度(以周期/倍频)。利用得到的STRF,我们提出了一个有效的声学特征。首先,从STRF中计算出各个尺度的能量。然后对刻度能量进行对数运算。最后,应用离散余弦变换生成所提出的STRF特征。在我们的实验中,我们将提出的STRF特征与传统的Mel频率倒谱系数(MFCCs)相结合来验证其有效性。在无噪声环境下,该特征可将识别率提高17.48%。在噪声环境下,识别率的提高幅度在5% ~ 12%之间。
{"title":"Spectral-temporal receptive fields and MFCC balanced feature extraction for noisy speech recognition","authors":"Jia-Ching Wang, Chang-Hong Lin, En-Ting Chen, P. Chang","doi":"10.1109/APSIPA.2014.7041624","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041624","url":null,"abstract":"This paper aims to propose a new set of acoustic features based on spectral-temporal receptive fields (STRFs). The STRF is an analysis method for studying physiological model of the mammalian auditory system in spectral-temporal domain. It has two different parts: one is the rate (in Hz) which represents the temporal response and the other is the scale (in cycle/octave) which represents the spectral response. With the obtained STRF, we propose an effective acoustic feature. First, the energy of each scale is calculated from the STRF. The logarithmic operation is then imposed on the scale energies. Finally, the discrete Cosine transform is applied to generate the proposed STRF feature. In our experiments, we combine the proposed STRF feature with conventional Mel frequency cepstral coefficients (MFCCs) to verify its effectiveness. In a noise-free environment, the proposed feature can increase the recognition rate by 17.48%. Moreover, the increase in the recognition rate ranges from 5% to 12% in noisy environments.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114445257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A narrowband active noise control system with frequency mismatch compensation 带频率失配补偿的窄带主动噪声控制系统
Jinwei Sun, Fei Ma, Boyan Huang, Liang Wen
Narrowband active noise control (ANC) systems enjoy good performance where sinusoidal signals dominate in the primary noise, on condition that a reference signal of the same frequencies with the primary noise is given. However, frequencies of the reference signal provided by nonacoustic sensors are usually different from that of the primary noise due to temperature changes, aging, etc. Such frequency mismatch (FM) will make the narrowband ANC systems unable to suppress the primary noise effectively, even render them useless. In this paper, we propose a new narrowband ANC system that integrated with a frequency estimation subsystem. The frequency estimation is obtained from a spectrum computation based on an adaptive linear prediction filter. The estimated frequencies are used by the cosine signal generator to produce a more accurate reference signal to the main controller, thus the performance deterioration caused by FM can be mitigated. The effectiveness of the proposed system has been confirmed by numerous simulations.
窄带有源噪声控制系统在主噪声中以正弦信号为主的情况下,只要给出与主噪声相同频率的参考信号,就具有良好的控制性能。然而,由于温度变化、老化等原因,非声学传感器提供的参考信号的频率通常与主噪声的频率不同。这种频率失配(FM)会使窄带无线通信系统无法有效抑制主噪声,甚至使其失效。本文提出了一种集成了频率估计子系统的窄带自适应无线通信系统。频率估计是基于自适应线性预测滤波器的频谱计算得到的。余弦信号发生器利用估计的频率为主控制器提供更精确的参考信号,从而减轻调频引起的性能下降。通过大量的仿真验证了该系统的有效性。
{"title":"A narrowband active noise control system with frequency mismatch compensation","authors":"Jinwei Sun, Fei Ma, Boyan Huang, Liang Wen","doi":"10.1109/APSIPA.2014.7041689","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041689","url":null,"abstract":"Narrowband active noise control (ANC) systems enjoy good performance where sinusoidal signals dominate in the primary noise, on condition that a reference signal of the same frequencies with the primary noise is given. However, frequencies of the reference signal provided by nonacoustic sensors are usually different from that of the primary noise due to temperature changes, aging, etc. Such frequency mismatch (FM) will make the narrowband ANC systems unable to suppress the primary noise effectively, even render them useless. In this paper, we propose a new narrowband ANC system that integrated with a frequency estimation subsystem. The frequency estimation is obtained from a spectrum computation based on an adaptive linear prediction filter. The estimated frequencies are used by the cosine signal generator to produce a more accurate reference signal to the main controller, thus the performance deterioration caused by FM can be mitigated. The effectiveness of the proposed system has been confirmed by numerous simulations.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126276962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An automatic input protocol recommendation method for tailored switch-to-speech communication aid systems 一种用于定制切换语音通信辅助系统的自动输入协议推荐方法
Fuming Fang, T. Shinozaki, Takao Kobayashi
A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.
语音转换界面可作为辅助系统,为残障人士提供互动沟通的手段。任何身体部位的运动,比如眼睛的运动,都可以作为开关的输入。可能的交换操作的数量因人而异,但带宽通常是相当有限的。因此,需要有效的输入协议来将切换操作映射到发音。同时,该协议必须易于学习,以便任何人都可以使用它。为此,我们提出了一种能够接受交换机操作中个性化需求的协议推荐方法。该方法为界面的每个用户提供了一个定制的协议,该协议既快速输入又易于记忆。协议设计的两个主要思想是利用每个人都知道的字母表知识,以及通过允许发音转换中的歧义来提高输入速度和可学习性。由歧义引起的转换误差由N-gram语言模型抵消。通过仿真和研究参与者的实测值对方案的性能进行了评价,表明了所提方法的优越性。
{"title":"An automatic input protocol recommendation method for tailored switch-to-speech communication aid systems","authors":"Fuming Fang, T. Shinozaki, Takao Kobayashi","doi":"10.1109/APSIPA.2014.7041638","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041638","url":null,"abstract":"A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126306398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Discovering and analyzing learning pattern on web based learning using social network analysis 利用社会网络分析发现和分析基于web的学习模式
P. Temdee, Wacharawan Intayoad
Web based learning has been promoting alternative way of learning for decades. The difficulty of web based learning is to provide the appropriate support for the learners so that the learners will not get lost and their learning achievements can be ensured. This paper thus proposes the method for discovering learning patterns of the learners on web based learning particularly for ensuring the learning achievement. The learning pattern is discovered by analyzing the interactions among the learners and the learning objects with social network analysis. Then, the achievement learning pattern is finally determined by analyzing the sets of obtained social network measurements. The interaction data is gathered from online course named introduction to Information Technology in the 2013 academic year, particularly for spreadsheet content module having 10 learning objects. The interaction patterns only of two groups of students including scientific and nonscientific background knowledge who pass the spreadsheet examination are analyzed. Finally, learning patterns ensuring learning achievement for spreadsheet content module of those students having different background knowledge is revealed.
几十年来,基于网络的学习一直在推广另一种学习方式。网络学习的难点在于为学习者提供适当的支持,使学习者不迷失方向,保证学习成果。在此基础上,本文提出了在网络学习中发现学习者学习模式的方法,以保证学习者的学习效果。运用社会网络分析法分析学习者与学习对象之间的相互作用,发现学习模式。然后,通过分析获得的社会网络测量集,最终确定成就学习模式。交互数据收集自2013学年的《信息技术导论》在线课程,特别是包含10个学习对象的电子表格内容模块。分析了通过电子表格考试的两组学生(包括科学背景知识和非科学背景知识)的交互模式。最后揭示了具有不同背景知识的学生电子表格内容模块的学习模式,保证了学生的学习成果。
{"title":"Discovering and analyzing learning pattern on web based learning using social network analysis","authors":"P. Temdee, Wacharawan Intayoad","doi":"10.1109/APSIPA.2014.7041814","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041814","url":null,"abstract":"Web based learning has been promoting alternative way of learning for decades. The difficulty of web based learning is to provide the appropriate support for the learners so that the learners will not get lost and their learning achievements can be ensured. This paper thus proposes the method for discovering learning patterns of the learners on web based learning particularly for ensuring the learning achievement. The learning pattern is discovered by analyzing the interactions among the learners and the learning objects with social network analysis. Then, the achievement learning pattern is finally determined by analyzing the sets of obtained social network measurements. The interaction data is gathered from online course named introduction to Information Technology in the 2013 academic year, particularly for spreadsheet content module having 10 learning objects. The interaction patterns only of two groups of students including scientific and nonscientific background knowledge who pass the spreadsheet examination are analyzed. Finally, learning patterns ensuring learning achievement for spreadsheet content module of those students having different background knowledge is revealed.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"145 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129695065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Range extrapolation of Head-Related Transfer Function using improved Higher Order Ambisonics 基于改进高阶双声的头部相关传递函数范围外推
Ling-song Zhou, C. Bao, Mao-shen Jia, Bing Bu
3D audio technology based on binaural reproduction requires the Head-Related Transfer Function (HRTF) datasets to be available for all possible distance. However, due to the tedious work of measurement and large volume of resulting datasets, the HRTF is typically measured only for sources located at a fixed distance. In this paper, the concept of virtual loudspeaker arrays is utilized to achieve range extrapolation of the measured HRTF datasets at a single range. The virtual loudspeaker is driven by Higher Order Ambisonics (HOA). Specially, to restrict the near-field effect of HOA, a compensation method of modified Wiener filter is proposed. The simulation results indicate that the proposed method provides effective range extrapolation of HRTF.
基于双耳再现的3D音频技术要求头部相关传递函数(HRTF)数据集在所有可能的距离都可用。然而,由于测量工作繁琐且产生的数据集量大,通常仅对位于固定距离的源进行HRTF测量。本文利用虚拟扬声器阵列的概念,对实测的HRTF数据集进行单量程外推。该虚拟扬声器由高阶立体声(HOA)驱动。特别地,提出了一种改进的维纳滤波补偿方法,以限制高噪点的近场效应。仿真结果表明,该方法能够有效地进行距离外推。
{"title":"Range extrapolation of Head-Related Transfer Function using improved Higher Order Ambisonics","authors":"Ling-song Zhou, C. Bao, Mao-shen Jia, Bing Bu","doi":"10.1109/APSIPA.2014.7041527","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041527","url":null,"abstract":"3D audio technology based on binaural reproduction requires the Head-Related Transfer Function (HRTF) datasets to be available for all possible distance. However, due to the tedious work of measurement and large volume of resulting datasets, the HRTF is typically measured only for sources located at a fixed distance. In this paper, the concept of virtual loudspeaker arrays is utilized to achieve range extrapolation of the measured HRTF datasets at a single range. The virtual loudspeaker is driven by Higher Order Ambisonics (HOA). Specially, to restrict the near-field effect of HOA, a compensation method of modified Wiener filter is proposed. The simulation results indicate that the proposed method provides effective range extrapolation of HRTF.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127604411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1