Yingjie Meng, Xiaoyu Teng, Huiran Liu, Sanshuai Cui, Zhiyuan Wang
{"title":"A screening scheme based on energy for speech key-frame","authors":"Yingjie Meng, Xiaoyu Teng, Huiran Liu, Sanshuai Cui, Zhiyuan Wang","doi":"10.1109/ICCT.2017.8359908","DOIUrl":null,"url":null,"abstract":"The research of the existing screening algorithm for speech frame has a great deal of shortcomings, such as its applicability and complexity. Worse still, those frames which are screened by algorithm can't achieve the requirement of express and the screening process greatly damages the original signal. This paper presents a strategy for screening speech keyframe and designs a screening scheme for key-frames based on the strategy. This scheme refers to the speech's logarithm energy and the weighted-zero-crossing rate. The detail process of screening scheme: firstly, screening frames according to the logarithm energy of the speech signals. Meanwhile, combine the speech amplitude and zero-crossing rate for frames screening. Finally, calculate the similarity of the two screening results, and getting the key-frame set. In addition, the scheme has been analyzed and validated from those aspects like continuity, characterization and applicability, in order to verify the effectiveness and availability. The results illustrate that these frames which have been screened by this scheme have advantages of continuity, characterization etc.","PeriodicalId":199874,"journal":{"name":"2017 IEEE 17th International Conference on Communication Technology (ICCT)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 17th International Conference on Communication Technology (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT.2017.8359908","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The research of the existing screening algorithm for speech frame has a great deal of shortcomings, such as its applicability and complexity. Worse still, those frames which are screened by algorithm can't achieve the requirement of express and the screening process greatly damages the original signal. This paper presents a strategy for screening speech keyframe and designs a screening scheme for key-frames based on the strategy. This scheme refers to the speech's logarithm energy and the weighted-zero-crossing rate. The detail process of screening scheme: firstly, screening frames according to the logarithm energy of the speech signals. Meanwhile, combine the speech amplitude and zero-crossing rate for frames screening. Finally, calculate the similarity of the two screening results, and getting the key-frame set. In addition, the scheme has been analyzed and validated from those aspects like continuity, characterization and applicability, in order to verify the effectiveness and availability. The results illustrate that these frames which have been screened by this scheme have advantages of continuity, characterization etc.