Vidhi Chhatbar, Mihir Gondhalekar, Shruti Pimple, R. Pawar
{"title":"Machine Interpretation of Medical Images Using Deep Learning","authors":"Vidhi Chhatbar, Mihir Gondhalekar, Shruti Pimple, R. Pawar","doi":"10.1109/GCAT52182.2021.9587518","DOIUrl":null,"url":null,"abstract":"We come across different biomedical images. It is difficult to interpret those images as they do not have any description. Image captioning is the process of generating textual description from an image which depends on the object and action in the image. With the advancement in deep learning techniques, we will build models to generate captions for biomedical images. This model will be very useful to accelerate the diagnosis process by telling the abnormalities present in the image. The model will be based on an encoder-decoder framework along with an attention model. The encoder will be using deep CNN to extract image features and the decoder will be using transformers to generate captions. Caption generating involves different complex scenarios starting from collecting the data set, training the model, validating the model, creating trained model to test the image, detecting the image and generating the captions","PeriodicalId":436231,"journal":{"name":"2021 2nd Global Conference for Advancement in Technology (GCAT)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Global Conference for Advancement in Technology (GCAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCAT52182.2021.9587518","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We come across different biomedical images. It is difficult to interpret those images as they do not have any description. Image captioning is the process of generating textual description from an image which depends on the object and action in the image. With the advancement in deep learning techniques, we will build models to generate captions for biomedical images. This model will be very useful to accelerate the diagnosis process by telling the abnormalities present in the image. The model will be based on an encoder-decoder framework along with an attention model. The encoder will be using deep CNN to extract image features and the decoder will be using transformers to generate captions. Caption generating involves different complex scenarios starting from collecting the data set, training the model, validating the model, creating trained model to test the image, detecting the image and generating the captions