S. Islam, Riyad Ahsan Auntor, Minhajul Islam, Mohammad Yousuf Hossain Anik, A. Islam, Jannatun Noor
{"title":"Note: Towards Devising an Efficient VQA in the Bengali Language","authors":"S. Islam, Riyad Ahsan Auntor, Minhajul Islam, Mohammad Yousuf Hossain Anik, A. Islam, Jannatun Noor","doi":"10.1145/3530190.3534837","DOIUrl":null,"url":null,"abstract":"Designing and implementing visual question answering tasks using Bengali datasets and native VQA based smart systems are important, as a huge number of people speak in Bengali who are relatively less advanced to technology adoption due to the language barrier. The important designing and implementing tasks are little explored in the literature. Therefore, we attempt to investigate the tasks in depth in this study. To do so, we follow a step-by-step procedure for overcoming different barriers encountered while adopting datasets as well as creating our own Bengali CLEVR and Bengali VQA. We perform different sets of experiments to demonstrate the efficacy of our proposed approach. Various VQA-based smart systems for Bengali speakers covering virtual doctors, navigation systems, smart glasses for the visually-impaired people, and so on can be benefited from this study through making the applications usable and understandable to those who are not fluent in a foreign language such as English.","PeriodicalId":257424,"journal":{"name":"ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3530190.3534837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Designing and implementing visual question answering tasks using Bengali datasets and native VQA based smart systems are important, as a huge number of people speak in Bengali who are relatively less advanced to technology adoption due to the language barrier. The important designing and implementing tasks are little explored in the literature. Therefore, we attempt to investigate the tasks in depth in this study. To do so, we follow a step-by-step procedure for overcoming different barriers encountered while adopting datasets as well as creating our own Bengali CLEVR and Bengali VQA. We perform different sets of experiments to demonstrate the efficacy of our proposed approach. Various VQA-based smart systems for Bengali speakers covering virtual doctors, navigation systems, smart glasses for the visually-impaired people, and so on can be benefited from this study through making the applications usable and understandable to those who are not fluent in a foreign language such as English.