首页 > 最新文献

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献

英文 中文
Target design and low complexity signal detection for two-dimensional magnetic recording 二维磁记录目标设计与低复杂度信号检测
C. Matcha, S. G. Srinivasa
Partial response maximum likelihood (PRML) scheme is a well known technique to equalize the data read from ID magnetic recording channels. The PRML scheme uses a linear equalizer followed by a maximum likelihood (ML) detector. This paper is novel in addressing the following aspects: a) We propose two different methods to design separable and non-separable 2D PR targets that help in signal detection, b) We propose an extension of ID Viterbi detector for signal detection in 2D ISI channels. We use the detector to study the efficacy of PR targets designed for a particular choice of 2D ISI channel.
部分响应最大似然(PRML)方案是一种众所周知的均衡从ID磁记录通道读取数据的技术。PRML方案使用线性均衡器,然后是最大似然(ML)检测器。本文在以下方面是新颖的:a)我们提出了两种不同的方法来设计有助于信号检测的可分离和不可分离的二维PR目标,b)我们提出了ID Viterbi检测器的扩展,用于二维ISI通道中的信号检测。我们使用检测器来研究为特定选择的二维ISI通道设计的PR靶标的功效。
{"title":"Target design and low complexity signal detection for two-dimensional magnetic recording","authors":"C. Matcha, S. G. Srinivasa","doi":"10.1109/APSIPA.2014.7041724","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041724","url":null,"abstract":"Partial response maximum likelihood (PRML) scheme is a well known technique to equalize the data read from ID magnetic recording channels. The PRML scheme uses a linear equalizer followed by a maximum likelihood (ML) detector. This paper is novel in addressing the following aspects: a) We propose two different methods to design separable and non-separable 2D PR targets that help in signal detection, b) We propose an extension of ID Viterbi detector for signal detection in 2D ISI channels. We use the detector to study the efficacy of PR targets designed for a particular choice of 2D ISI channel.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129211073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The design of HOA irregular decoders based on the optimal symmetrical virtual microphone response 基于最优对称虚拟传声器响应的HOA不规则解码器设计
Rong Zhu, C. Bao, Mao-shen Jia, Bing Bu, Ling-song Zhou
Ambisonic decoder for irregular speaker arrays could be derived by optimization techniques. In terms of higher order reproduction system, the optimization program is hard to be guided to a large number of decoder coefficients. This paper describes a new method for higher order decoder based on the optimal symmetrical virtual microphone response (OSVMR). In the proposed method, the number of the decoder coefficients is reduced and the optimal symmetrical polar pattern of speaker feeds is obtained. The binaural evaluation demonstrates that the proposed method is found to be significantly better than reference methods on interaural time difference (ITD) and interaural level difference (ILD).
利用优化技术可以推导出适用于不规则扬声器阵列的双声解码器。对于高阶再现系统,优化程序很难被引导到大量的译码系数。本文提出了一种基于最优对称虚拟传声器响应(OSVMR)的高阶解码器新方法。该方法减少了解码器系数的个数,得到了最优的扬声器馈电对称极化方向图。双耳评价结果表明,该方法在耳间时差(ITD)和耳间音阶差(ILD)方面明显优于参考方法。
{"title":"The design of HOA irregular decoders based on the optimal symmetrical virtual microphone response","authors":"Rong Zhu, C. Bao, Mao-shen Jia, Bing Bu, Ling-song Zhou","doi":"10.1109/APSIPA.2014.7041534","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041534","url":null,"abstract":"Ambisonic decoder for irregular speaker arrays could be derived by optimization techniques. In terms of higher order reproduction system, the optimization program is hard to be guided to a large number of decoder coefficients. This paper describes a new method for higher order decoder based on the optimal symmetrical virtual microphone response (OSVMR). In the proposed method, the number of the decoder coefficients is reduced and the optimal symmetrical polar pattern of speaker feeds is obtained. The binaural evaluation demonstrates that the proposed method is found to be significantly better than reference methods on interaural time difference (ITD) and interaural level difference (ILD).","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128646892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A study on replay attack and anti-spoofing for text-dependent speaker verification 基于文本的说话人验证重放攻击与防欺骗研究
Zhizheng Wu, Sheng Gao, Eng Siong Cling, Haizhou Li
Replay, which is to playback a pre-recorded speech sample, presents a genuine risk to automatic speaker verification technology. In this study, we evaluate the vulnerability of text-dependent speaker verification systems under the replay attack using a standard benchmarking database, and also propose an anti-spoofing technique to safeguard the speaker verification systems. The key idea of the spoofing detection technique is to decide whether the presented sample is matched to any previous stored speech samples based a similarity score. The experiments conducted on the RSR2015 database showed that the equal error rate (EER) and false acceptance rate (FAR) increased from both 2.92 % to 25.56 % and 78.36 % respectively as a result of the replay attack. It confirmed the vulnerability of speaker verification to replay attacks. On the other hand, our proposed spoofing countermeasure was able to reduce the FARs from 78.36 % and 73.14 % to 0.06 % and 0.0 % for male and female systems, respectively, in the face of replay spoofing. The experiments confirmed the effectiveness of the proposed anti-spoofing technique.
重放,即回放预先录制的语音样本,对自动说话人验证技术存在真正的风险。在本研究中,我们使用标准基准数据库评估了文本依赖的说话人验证系统在重放攻击下的脆弱性,并提出了一种防欺骗技术来保护说话人验证系统。欺骗检测技术的关键思想是根据相似度评分来判断所呈现的样本是否与之前存储的语音样本相匹配。在RSR2015数据库上进行的实验表明,受重放攻击的影响,等错误率(EER)和误接受率(FAR)分别从2.92%增加到25.56%和78.36%。它证实了说话人验证对重放攻击的脆弱性。另一方面,我们提出的欺骗对策能够将男性和女性系统的FARs分别从78.36%和73.14%降低到0.06%和0.0%,面对重放欺骗。实验验证了所提出的抗欺骗技术的有效性。
{"title":"A study on replay attack and anti-spoofing for text-dependent speaker verification","authors":"Zhizheng Wu, Sheng Gao, Eng Siong Cling, Haizhou Li","doi":"10.1109/APSIPA.2014.7041636","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041636","url":null,"abstract":"Replay, which is to playback a pre-recorded speech sample, presents a genuine risk to automatic speaker verification technology. In this study, we evaluate the vulnerability of text-dependent speaker verification systems under the replay attack using a standard benchmarking database, and also propose an anti-spoofing technique to safeguard the speaker verification systems. The key idea of the spoofing detection technique is to decide whether the presented sample is matched to any previous stored speech samples based a similarity score. The experiments conducted on the RSR2015 database showed that the equal error rate (EER) and false acceptance rate (FAR) increased from both 2.92 % to 25.56 % and 78.36 % respectively as a result of the replay attack. It confirmed the vulnerability of speaker verification to replay attacks. On the other hand, our proposed spoofing countermeasure was able to reduce the FARs from 78.36 % and 73.14 % to 0.06 % and 0.0 % for male and female systems, respectively, in the face of replay spoofing. The experiments confirmed the effectiveness of the proposed anti-spoofing technique.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129278846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 157
Empirical evaluation of visible spectrum iris versus periocular recognition in unconstrained scenario on smartphones 智能手机无约束场景下虹膜可见光谱与眼周识别的实证评价
K. Raja, Ramachandra Raghavendra, C. Busch
Smartphones are increasingly used as biométrie sensor for many authentication applications due to the computational ability and high resolution cameras that can be used to capture biométrie information. The objective of this paper is to assess the performance of iris versus periocular recognition for smartphones in non ideal conditions (change of illumination, highly pigmented iris, shadows on iris pattern) in real-life for verification in visible spectrum. We introduce various protocols for real-life verification scenarios using smartphones for iris and periocular recognition. Further, we also study the verification performance where enrollment and probe data originate from different smartphones. From the extensive set of experiments conducted on a publicly available smartphone database, it can be observed that the information from periocular region provides substantially good performance in terms of recognition accuracy in cross sensor and varying illumination scenarios as compared to iris under same conditions.
由于智能手机的计算能力和高分辨率相机可用于捕获生物变性信息,因此越来越多地用作生物变性传感器用于许多身份验证应用。本文的目的是评估智能手机在现实生活中非理想条件下(光照变化、高色素虹膜、虹膜图案阴影)虹膜与眼周识别的性能,以便在可见光谱中进行验证。我们介绍了使用智能手机进行虹膜和眼周识别的现实验证场景的各种协议。此外,我们还研究了来自不同智能手机的注册和探测数据的验证性能。在一个公开的智能手机数据库上进行了大量的实验,可以观察到,在相同条件下,在交叉传感器和不同光照情况下,与虹膜相比,来自眼周区域的信息在识别精度方面表现得非常好。
{"title":"Empirical evaluation of visible spectrum iris versus periocular recognition in unconstrained scenario on smartphones","authors":"K. Raja, Ramachandra Raghavendra, C. Busch","doi":"10.1109/APSIPA.2014.7041521","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041521","url":null,"abstract":"Smartphones are increasingly used as biométrie sensor for many authentication applications due to the computational ability and high resolution cameras that can be used to capture biométrie information. The objective of this paper is to assess the performance of iris versus periocular recognition for smartphones in non ideal conditions (change of illumination, highly pigmented iris, shadows on iris pattern) in real-life for verification in visible spectrum. We introduce various protocols for real-life verification scenarios using smartphones for iris and periocular recognition. Further, we also study the verification performance where enrollment and probe data originate from different smartphones. From the extensive set of experiments conducted on a publicly available smartphone database, it can be observed that the information from periocular region provides substantially good performance in terms of recognition accuracy in cross sensor and varying illumination scenarios as compared to iris under same conditions.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115396810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Analysis of modifier structure for emotion expressions 情感表达修饰语结构分析
Liang-Chih Yu, K. R. Lai
Dimensional emotion representation such as valence and arousal (VA) space has been an emerging way to represent emotions. In this representation, emotion words can be projected to the VA space according to their valence and arousal values. Sentence and document-level emotions can then be projected based on the emotion words within them. However, emotion expressions in sentences and documents usually contain various modifier structure such as negation (e.g., not happy), degree (very happy) and emotion compounds. Such modifier structure can provide more precise information for measuring VA values in both sentence and document-levels. In this study, we analyze various types of modifier structure for emotion expressions. In addition, we also investigate the effect of different types of modifier structure on measuring VA values for emotion expressions.
情绪的维度表征,如效价和觉醒(VA)空间已成为一种新兴的情绪表征方式。在这种表示中,情绪词可以根据它们的效价和唤醒值投射到VA空间。句子和文档级的情感可以基于其中的情感词进行投射。然而,句子和文献中的情绪表达通常包含各种修饰语结构,如否定(如不高兴)、程度(如非常高兴)和情感复合词。这种修饰语结构可以为句子和文档级别的VA值测量提供更精确的信息。在本研究中,我们分析了不同类型的情感表达修饰语结构。此外,我们还研究了不同类型的修饰语结构对情绪表达VA值测量的影响。
{"title":"Analysis of modifier structure for emotion expressions","authors":"Liang-Chih Yu, K. R. Lai","doi":"10.1109/APSIPA.2014.7041538","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041538","url":null,"abstract":"Dimensional emotion representation such as valence and arousal (VA) space has been an emerging way to represent emotions. In this representation, emotion words can be projected to the VA space according to their valence and arousal values. Sentence and document-level emotions can then be projected based on the emotion words within them. However, emotion expressions in sentences and documents usually contain various modifier structure such as negation (e.g., not happy), degree (very happy) and emotion compounds. Such modifier structure can provide more precise information for measuring VA values in both sentence and document-levels. In this study, we analyze various types of modifier structure for emotion expressions. In addition, we also investigate the effect of different types of modifier structure on measuring VA values for emotion expressions.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114283992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Agile and economic media-centric service realization over Software-Defined Infrastructure 在软件定义的基础设施上实现敏捷和经济的以媒体为中心的服务
JongWon Kim, Taeheum Na, A. C. Risdianto, Byung-Rae Cha, Sun Park
The lifecycle management of service realization is very challenging. With virtualized playgrounds over Future Internet testbeds, the lifecycle experiments could be easily exercised so that all tasks and responsibilities are well-defined for entire experiment stages among developers and operators. Also, the dynamic provisioning of hyper-convergent compute/networking/storage resources is appropriately streamlined with the experiment lifecycle. In this paper, by considering these issues, we discuss the agile and economic realization of automated media-centric experiments over OF@TEIN (OpenFlow @ Trans-Eurasian Information Network) SDI (Software-Defined Infrastructure).
服务实现的生命周期管理非常具有挑战性。有了未来互联网测试平台上的虚拟游乐场,生命周期实验可以很容易地进行,这样开发人员和操作人员在整个实验阶段都可以定义所有的任务和责任。此外,超融合计算/网络/存储资源的动态供应也随着实验的生命周期得到适当的精简。在本文中,通过考虑这些问题,我们讨论了通过OF@TEIN (OpenFlow @ Trans-Eurasian Information Network) SDI(软件定义基础设施)实现以媒体为中心的自动化实验的敏捷和经济实现。
{"title":"Agile and economic media-centric service realization over Software-Defined Infrastructure","authors":"JongWon Kim, Taeheum Na, A. C. Risdianto, Byung-Rae Cha, Sun Park","doi":"10.1109/APSIPA.2014.7041804","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041804","url":null,"abstract":"The lifecycle management of service realization is very challenging. With virtualized playgrounds over Future Internet testbeds, the lifecycle experiments could be easily exercised so that all tasks and responsibilities are well-defined for entire experiment stages among developers and operators. Also, the dynamic provisioning of hyper-convergent compute/networking/storage resources is appropriately streamlined with the experiment lifecycle. In this paper, by considering these issues, we discuss the agile and economic realization of automated media-centric experiments over OF@TEIN (OpenFlow @ Trans-Eurasian Information Network) SDI (Software-Defined Infrastructure).","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115282692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Development of a steerable stereophonic parametric loudspeaker 可操纵立体声参数扬声器的研制
Chuang Shi, H. Nomura, T. Kamakura, W. Gan
The parametric loudspeaker is a type of directional loudspeakers making use of the nonlinear acoustic effects. The past studies to reproduce the three-dimensional audio contents with a pair of the parametric loudspeakers have demonstrated satisfactory performance. In this paper, the steerable parametric loudspeakers are proposed to relocate the sweet spot to follow the head movement of the listener. Although the spatial aliasing effects are observed in the steerable parametric loudspeaker, they can be converted to generate multiple sound beams simultaneously. A new case of the grating lobe elimination, namely the over elimination, is studied to extend the controllable level difference between the two sound beams. The simulation results to compare the equal and Chebyshev weights are also presented in this paper.
参数扬声器是一种利用非线性声效应的定向扬声器。以往用一对参数扬声器再现三维音频内容的研究取得了令人满意的效果。本文提出了一种可调参数扬声器,可以根据听者的头部运动来重新定位最佳点。虽然在可操纵参数扬声器中观察到空间混叠效应,但它们可以转换为同时产生多个声束。研究了一种新的光栅瓣消除方法,即过度消除,以扩大两声束之间的可控电平差。并给出了等效权值与切比雪夫权值的比较仿真结果。
{"title":"Development of a steerable stereophonic parametric loudspeaker","authors":"Chuang Shi, H. Nomura, T. Kamakura, W. Gan","doi":"10.1109/APSIPA.2014.7041715","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041715","url":null,"abstract":"The parametric loudspeaker is a type of directional loudspeakers making use of the nonlinear acoustic effects. The past studies to reproduce the three-dimensional audio contents with a pair of the parametric loudspeakers have demonstrated satisfactory performance. In this paper, the steerable parametric loudspeakers are proposed to relocate the sweet spot to follow the head movement of the listener. Although the spatial aliasing effects are observed in the steerable parametric loudspeaker, they can be converted to generate multiple sound beams simultaneously. A new case of the grating lobe elimination, namely the over elimination, is studied to extend the controllable level difference between the two sound beams. The simulation results to compare the equal and Chebyshev weights are also presented in this paper.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114799596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hearing impairment simulator based on compressive gammachirp filter 基于压缩gammachirp滤波器的听力损伤模拟器
Misaki Nagae, T. Irino, R. Nisimura, Hideki Kawahara, R. Patterson
This paper describes a simulator for presenting normal hearing (NH) listeners with the experience of a hearing impaired (HI) listener. The simulator is based on the compressive gammachirp (cGC) filter used to derive level-dependent filter shapes and the cochlear compression function from to notched-noise masking data. The level dependence of the cGC is reversed to produce inverse compression which is used to resynthesize sounds that cancel the compression applied by the auditory system of the NH listener. A frame-based analysis/synthesis procedure is newly introduced to improve processing speed for a graphical user interface (GUI) that allows the users to control the degree of compression within the range of the audiogram of the HI person. The simulator is intended for speech-language-hearing therapists (ST) and patients' families.
本文介绍了一种模拟器,用于向听力正常(NH)的听众呈现听力受损(HI)听众的体验。该仿真器基于压缩伽马机(cGC)滤波器,用于从陷波噪声掩蔽数据中导出电平相关滤波器形状和耳蜗压缩函数。cGC的水平依赖性被逆转,产生反向压缩,用于重新合成声音,取消NH听者听觉系统施加的压缩。为了提高图形用户界面(GUI)的处理速度,新引入了一种基于帧的分析/合成程序,该程序允许用户在HI患者的听力图范围内控制压缩程度。该模拟器适用于语言听力治疗师(ST)和患者家属。
{"title":"Hearing impairment simulator based on compressive gammachirp filter","authors":"Misaki Nagae, T. Irino, R. Nisimura, Hideki Kawahara, R. Patterson","doi":"10.1109/APSIPA.2014.7041579","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041579","url":null,"abstract":"This paper describes a simulator for presenting normal hearing (NH) listeners with the experience of a hearing impaired (HI) listener. The simulator is based on the compressive gammachirp (cGC) filter used to derive level-dependent filter shapes and the cochlear compression function from to notched-noise masking data. The level dependence of the cGC is reversed to produce inverse compression which is used to resynthesize sounds that cancel the compression applied by the auditory system of the NH listener. A frame-based analysis/synthesis procedure is newly introduced to improve processing speed for a graphical user interface (GUI) that allows the users to control the degree of compression within the range of the audiogram of the HI person. The simulator is intended for speech-language-hearing therapists (ST) and patients' families.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126151718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Banding effect removal for digital multitoning 去除数字多频带效应
Jing-Ming Guo, Jia-Yu Chang, Yun-Fu Liu
Error diffusion is an efficient halftone method for mainly being applied on printers. The promising high image quality and processing efficiency endorse it as a popular and competitive candidate in halftoning and multitoning applications. The multitoning is an extension of halftoning, adopting more than three tone levels for improving the similarity between an original image and the converted image. Yet, the banding effect, indicating the areas with only one tone level, disturbs the visual perception, and thus seriously degrades image quality. To cope with the banding effect, the tone replacement strategy is proposed in this study. As documented in the experimental results, excellent tone-similarity as that of the original image and promising reconstructed dot-distribution can be provided simultaneously. Comparing with the former banding-free methods, the apparent improvements/features suggest that the proposed method can be a very competitive candidate for multitoning applications.
误差扩散是一种有效的半色调方法,主要应用于打印机。它具有很高的图像质量和处理效率,在半色调和多色调应用中具有广泛的应用前景和竞争力。多色调是半色调的扩展,采用三个以上的色调级别来提高原始图像和转换图像之间的相似性。然而,条带效应表示只有一个色调级别的区域,会干扰视觉感知,从而严重降低图像质量。为了应对带状效应,本研究提出了音调替换策略。实验结果表明,该方法可以同时提供与原始图像相同的良好色调相似度和有希望的重建点分布。与以前的无带方法相比,明显的改进/特征表明,该方法可以成为多频应用的一个非常有竞争力的候选方法。
{"title":"Banding effect removal for digital multitoning","authors":"Jing-Ming Guo, Jia-Yu Chang, Yun-Fu Liu","doi":"10.1109/APSIPA.2014.7041661","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041661","url":null,"abstract":"Error diffusion is an efficient halftone method for mainly being applied on printers. The promising high image quality and processing efficiency endorse it as a popular and competitive candidate in halftoning and multitoning applications. The multitoning is an extension of halftoning, adopting more than three tone levels for improving the similarity between an original image and the converted image. Yet, the banding effect, indicating the areas with only one tone level, disturbs the visual perception, and thus seriously degrades image quality. To cope with the banding effect, the tone replacement strategy is proposed in this study. As documented in the experimental results, excellent tone-similarity as that of the original image and promising reconstructed dot-distribution can be provided simultaneously. Comparing with the former banding-free methods, the apparent improvements/features suggest that the proposed method can be a very competitive candidate for multitoning applications.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128106379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Region-based intra-frame rate-control scheme for High Efficiency Video Coding 基于区域的高效视频编码帧内速率控制方案
Mingliang Zhou, Hai-Miao Hu, Yongfei Zhang
In High Efficiency Video Coding (HEVC), the coding efficiency of infra-frames is lower than inter-frames, which will cause the flicker artifact and perceptual fluctuation among CTUs in low bitrates applications. Therefore, this paper proposes a region-based intra-frame rate-control scheme to improve the objective quality and to reduce PSNR fluctuation among CTUs. Firstly, the CTUs in intra-frame are classified into three regions according to their characteristics and complexity. And a region-based bit allocation is proposed to pre-determine bit among different regions. Secondly, a rate-complexity-quality model is proposed for infra-frame to adjust the QPs to achieve a smooth perceptual quality. The experimental results demonstrate that the proposed scheme can achieve higher coding performance and consistent visual quality when compared with the scheme adopted by HM12.0.
在高效视频编码(High Efficiency Video Coding, HEVC)中,基础帧的编码效率低于帧间的编码效率,在低比特率应用中会造成帧间的闪烁伪影和感知波动。因此,本文提出了一种基于区域的帧内速率控制方案,以提高目标质量,降低帧间的PSNR波动。首先,根据帧内cpu的特征和复杂度将其划分为三个区域。提出了一种基于区域的比特分配方法,在不同区域之间预先确定比特。其次,提出了一种速率-复杂度-质量模型,用于调整qp以获得平滑的感知质量。实验结果表明,与HM12.0所采用的编码方案相比,该方案具有更高的编码性能和一致的视觉质量。
{"title":"Region-based intra-frame rate-control scheme for High Efficiency Video Coding","authors":"Mingliang Zhou, Hai-Miao Hu, Yongfei Zhang","doi":"10.1109/APSIPA.2014.7041648","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041648","url":null,"abstract":"In High Efficiency Video Coding (HEVC), the coding efficiency of infra-frames is lower than inter-frames, which will cause the flicker artifact and perceptual fluctuation among CTUs in low bitrates applications. Therefore, this paper proposes a region-based intra-frame rate-control scheme to improve the objective quality and to reduce PSNR fluctuation among CTUs. Firstly, the CTUs in intra-frame are classified into three regions according to their characteristics and complexity. And a region-based bit allocation is proposed to pre-determine bit among different regions. Secondly, a rate-complexity-quality model is proposed for infra-frame to adjust the QPs to achieve a smooth perceptual quality. The experimental results demonstrate that the proposed scheme can achieve higher coding performance and consistent visual quality when compared with the scheme adopted by HM12.0.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126406148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1