{"title":"Digital Representation of Perceptual Criteria","authors":"J. Flanagan","doi":"10.1109/ASPAA.1991.634087","DOIUrl":null,"url":null,"abstract":"Information signals are typically intended for human consumption. Human perception therefore contributes directly to fidelity criteria for digital representation. As computational capabilities increase and costs diminish, coding algorithms are able to iiicorporate more of the constraints that characterize perception. The incentive is still-greater economy for digital transmission and storage. Sight and sound are sensory modes favored by the human for information exchange. These modes are presently most central to humadmachine communications and multimedia systems. The intricacies of visual and auditory perception are therefore figuring more prominently in signal coding. For example, taking account of the eye's sensitivity to quantizing noise as a function of temporal and spatial frequencies leads to good-quality coding of color motion images at fractions of a bit per pixel. Similarly, the characteristics of auditory masking, in both time and frequency domains, provide leverage to identify signal components which are irrelevant to perception and which need not consume coding capacity. This discussion draws a perspective on recent coding advances and points up opportunities for increased sophistication in representing perceptual I y imp0 rtan t factors. It also indicates relations hips between economies gained by perceptual coding alone, and those where source coding can trade on signal-specific characteristics to achieve further reductions in bit rate. It COnChdeS with brief consideration of other sensory modalities, such as the tactile dimension, that might contribute to naturalness and ease of use in interactive multimedia information systems.","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASPAA.1991.634087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Information signals are typically intended for human consumption. Human perception therefore contributes directly to fidelity criteria for digital representation. As computational capabilities increase and costs diminish, coding algorithms are able to iiicorporate more of the constraints that characterize perception. The incentive is still-greater economy for digital transmission and storage. Sight and sound are sensory modes favored by the human for information exchange. These modes are presently most central to humadmachine communications and multimedia systems. The intricacies of visual and auditory perception are therefore figuring more prominently in signal coding. For example, taking account of the eye's sensitivity to quantizing noise as a function of temporal and spatial frequencies leads to good-quality coding of color motion images at fractions of a bit per pixel. Similarly, the characteristics of auditory masking, in both time and frequency domains, provide leverage to identify signal components which are irrelevant to perception and which need not consume coding capacity. This discussion draws a perspective on recent coding advances and points up opportunities for increased sophistication in representing perceptual I y imp0 rtan t factors. It also indicates relations hips between economies gained by perceptual coding alone, and those where source coding can trade on signal-specific characteristics to achieve further reductions in bit rate. It COnChdeS with brief consideration of other sensory modalities, such as the tactile dimension, that might contribute to naturalness and ease of use in interactive multimedia information systems.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
感知标准的数字表示
信息信号通常是供人类使用的。因此,人类感知直接有助于数字表示的保真度标准。随着计算能力的提高和成本的降低,编码算法能够结合更多表征感知的约束。这样做的动机是数字传输和存储更加经济。视觉和听觉是人类进行信息交换的两种感觉方式。这些模式是目前人机通信和多媒体系统的核心。因此,视觉和听觉感知的复杂性在信号编码中更加突出。例如,考虑到眼睛对量化噪声的敏感性作为时间和空间频率的函数,可以以每像素几分之一比特的速度对彩色运动图像进行高质量的编码。同样,在时域和频域上,听觉掩蔽的特征提供了识别与感知无关且不需要消耗编码容量的信号成分的杠杆。本讨论对最近的编码进展进行了展望,并指出了通过重要因素表示感知I的复杂性增加的机会。它还表明了仅通过感知编码获得的经济与源编码可以根据信号特定特征进行交易以进一步降低比特率的经济之间的关系。它还简要考虑了其他感官模式,如触觉维度,这可能有助于在交互式多媒体信息系统中自然和易于使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Overview of the MPEG/audio Compression Algorithm Acoustic Echo Cancellation for Stereophonic Teleconferencing A Single-Input Hearing Aid Based on the Auditory Perceptual Features to Improve Speech Intelligibility in Noise Models of Pitch Perception Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1