{"title":"语音活动检测","authors":"Deepti Singh, F. Boland","doi":"10.1145/1315325.1315332","DOIUrl":null,"url":null,"abstract":"A method for detecting voice activity comprises receiving audio signals on a plurality of channels and processing the audio signals on the channels e.g. by beamformers 200 to improve the signal-to-noise ratio thereof. The processed audio signals on each channel are then fed to associated voice activity detection algorithms 202 and further processed. A voice or silence determination is then rendered by decision logic 204 based on at least the output of the voice activity detection algorithms. The method is useful in talker localization systems e.g. for teleconferencing.","PeriodicalId":429016,"journal":{"name":"ACM Crossroads","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"67","resultStr":"{\"title\":\"Voice activity detection\",\"authors\":\"Deepti Singh, F. Boland\",\"doi\":\"10.1145/1315325.1315332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A method for detecting voice activity comprises receiving audio signals on a plurality of channels and processing the audio signals on the channels e.g. by beamformers 200 to improve the signal-to-noise ratio thereof. The processed audio signals on each channel are then fed to associated voice activity detection algorithms 202 and further processed. A voice or silence determination is then rendered by decision logic 204 based on at least the output of the voice activity detection algorithms. The method is useful in talker localization systems e.g. for teleconferencing.\",\"PeriodicalId\":429016,\"journal\":{\"name\":\"ACM Crossroads\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"67\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Crossroads\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1315325.1315332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Crossroads","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1315325.1315332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A method for detecting voice activity comprises receiving audio signals on a plurality of channels and processing the audio signals on the channels e.g. by beamformers 200 to improve the signal-to-noise ratio thereof. The processed audio signals on each channel are then fed to associated voice activity detection algorithms 202 and further processed. A voice or silence determination is then rendered by decision logic 204 based on at least the output of the voice activity detection algorithms. The method is useful in talker localization systems e.g. for teleconferencing.