首页 > 最新文献

International Journal of Speech Technology最新文献

英文 中文
A computationally efficient speech emotion recognition system employing machine learning classifiers and ensemble learning 采用机器学习分类器和集合学习的计算高效语音情感识别系统
Q1 Arts and Humanities Pub Date : 2024-03-30 DOI: 10.1007/s10772-024-10095-8
N. Aishwarya, Kanwaljeet Kaur, Karthik Seemakurthy
{"title":"A computationally efficient speech emotion recognition system employing machine learning classifiers and ensemble learning","authors":"N. Aishwarya, Kanwaljeet Kaur, Karthik Seemakurthy","doi":"10.1007/s10772-024-10095-8","DOIUrl":"https://doi.org/10.1007/s10772-024-10095-8","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"23 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140364321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech recognition based on the transformer's multi-head attention in Arabic 基于变压器多头注意力的阿拉伯语语音识别
Q1 Arts and Humanities Pub Date : 2024-03-29 DOI: 10.1007/s10772-024-10092-x
Omayma Mahmoudi, Mouncef Filali-Bouami, Mohamed Benchat
{"title":"Speech recognition based on the transformer's multi-head attention in Arabic","authors":"Omayma Mahmoudi, Mouncef Filali-Bouami, Mohamed Benchat","doi":"10.1007/s10772-024-10092-x","DOIUrl":"https://doi.org/10.1007/s10772-024-10092-x","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"42 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140368252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature extraction using GTCC spectrogram and ResNet50 based classification for audio spoof detection 利用 GTCC 频谱和基于 ResNet50 的分类进行特征提取,用于音频欺骗检测
Q1 Arts and Humanities Pub Date : 2024-03-29 DOI: 10.1007/s10772-024-10093-w
N. Chakravarty, Mohit Dua
{"title":"Feature extraction using GTCC spectrogram and ResNet50 based classification for audio spoof detection","authors":"N. Chakravarty, Mohit Dua","doi":"10.1007/s10772-024-10093-w","DOIUrl":"https://doi.org/10.1007/s10772-024-10093-w","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"62 20","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140367823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conditional Denoising Diffusion Implicit Model for Speech Enhancement 用于语音增强的条件去噪扩散隐含模型
Q1 Arts and Humanities Pub Date : 2024-03-26 DOI: 10.1007/s10772-024-10091-y
Chengyong Yang, Xiukang Yu, Sheng Huang
{"title":"Conditional Denoising Diffusion Implicit Model for Speech Enhancement","authors":"Chengyong Yang, Xiukang Yu, Sheng Huang","doi":"10.1007/s10772-024-10091-y","DOIUrl":"https://doi.org/10.1007/s10772-024-10091-y","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140378747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stockwell-Transform based feature representation for detection and assessment of voice disorders 基于斯托克韦尔变换的特征表示法检测和评估嗓音疾病
Q1 Arts and Humanities Pub Date : 2024-02-29 DOI: 10.1007/s10772-024-10085-w
Purva Barche, K. Gurugubelli, A. Vuppala
{"title":"Stockwell-Transform based feature representation for detection and assessment of voice disorders","authors":"Purva Barche, K. Gurugubelli, A. Vuppala","doi":"10.1007/s10772-024-10085-w","DOIUrl":"https://doi.org/10.1007/s10772-024-10085-w","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"12 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140412538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Automated detection system for texture feature based classification on different image datasets using S-transform 更正为使用 S 变换对不同图像数据集进行基于纹理特征分类的自动检测系统
Q1 Arts and Humanities Pub Date : 2024-01-29 DOI: 10.1007/s10772-024-10083-y
O. Kesav, G. K. Rajini
{"title":"Correction to: Automated detection system for texture feature based classification on different image datasets using S-transform","authors":"O. Kesav, G. K. Rajini","doi":"10.1007/s10772-024-10083-y","DOIUrl":"https://doi.org/10.1007/s10772-024-10083-y","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"58 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140487019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review on speech emotion recognition for late deafened educators in online education 晚聋教育工作者在线教育语音情感识别综述
Q1 Arts and Humanities Pub Date : 2024-01-24 DOI: 10.1007/s10772-023-10064-7
Aparna Vyakaranam, Tomas Maul, Bavani Ramayah
{"title":"A review on speech emotion recognition for late deafened educators in online education","authors":"Aparna Vyakaranam, Tomas Maul, Bavani Ramayah","doi":"10.1007/s10772-023-10064-7","DOIUrl":"https://doi.org/10.1007/s10772-023-10064-7","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"60 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139601433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements in encoded speech data by background noise suppression under uncontrolled environment 在不受控环境下抑制背景噪声,推动编码语音数据的发展
Q1 Arts and Humanities Pub Date : 2024-01-06 DOI: 10.1007/s10772-023-10078-1
B. G. Nagaraja, G. T. Yadava, Mohamed Anees
{"title":"Advancements in encoded speech data by background noise suppression under uncontrolled environment","authors":"B. G. Nagaraja, G. T. Yadava, Mohamed Anees","doi":"10.1007/s10772-023-10078-1","DOIUrl":"https://doi.org/10.1007/s10772-023-10078-1","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139380910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scene text visual question answering by using YOLO and STN 利用 YOLO 和 STN 进行场景文本可视化问题解答
Q1 Arts and Humanities Pub Date : 2024-01-03 DOI: 10.1007/s10772-023-10081-6
Kimiya Nourali, Elham Dolkhani
{"title":"Scene text visual question answering by using YOLO and STN","authors":"Kimiya Nourali, Elham Dolkhani","doi":"10.1007/s10772-023-10081-6","DOIUrl":"https://doi.org/10.1007/s10772-023-10081-6","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"8 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139389461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An optimized convolutional neural network for speech enhancement 用于语音增强的优化卷积神经网络
Q1 Arts and Humanities Pub Date : 2023-12-29 DOI: 10.1007/s10772-023-10073-6
A. Karthik, J. L. Mazher Iqbal
{"title":"An optimized convolutional neural network for speech enhancement","authors":"A. Karthik, J. L. Mazher Iqbal","doi":"10.1007/s10772-023-10073-6","DOIUrl":"https://doi.org/10.1007/s10772-023-10073-6","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":" 46","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139144647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Speech Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1