{"title":"An effective image annotation using self-attention based stacked bidirectional capsule network","authors":"Vikas Palekar, Sathish Kumar L","doi":"10.1016/j.csi.2025.103973","DOIUrl":null,"url":null,"abstract":"<div><div>This paper provides an advanced hybrid deep learning (DL) system for accurate image annotation. The first step consists of input images being pre-processed using three techniques: i) cross-guided bilateral filtering, ii) image resizing and iii) colour conversion to the green channel. After pre-processing, key features such as shape, wavelet and texture are extracted using three models: the modified Walsh-Hadamard transform, the extended discrete wavelet transform and the grayscale run length matrix (GLRLM). Once the feature is extracted, the optimal features are selected using the Chaotic Coati Optimization (CCO) algorithm due to feature dimensionality issues. After selecting optimal features, image annotation is performed using the self-awareness-based Stacked Bidirectional Capsule Network (SA_SBiCapNet) model. The stacking Bidirectional long short term memory model (BiLSTM) approach with a capsule network is applied to annotate the given images. The accuracy rate of the proposed method is 0.99 %. Therefore, the proposed method uses a hybrid DL model to perform effective image annotation.</div></div>","PeriodicalId":50635,"journal":{"name":"Computer Standards & Interfaces","volume":"93 ","pages":"Article 103973"},"PeriodicalIF":4.1000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Standards & Interfaces","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0920548925000029","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
This paper provides an advanced hybrid deep learning (DL) system for accurate image annotation. The first step consists of input images being pre-processed using three techniques: i) cross-guided bilateral filtering, ii) image resizing and iii) colour conversion to the green channel. After pre-processing, key features such as shape, wavelet and texture are extracted using three models: the modified Walsh-Hadamard transform, the extended discrete wavelet transform and the grayscale run length matrix (GLRLM). Once the feature is extracted, the optimal features are selected using the Chaotic Coati Optimization (CCO) algorithm due to feature dimensionality issues. After selecting optimal features, image annotation is performed using the self-awareness-based Stacked Bidirectional Capsule Network (SA_SBiCapNet) model. The stacking Bidirectional long short term memory model (BiLSTM) approach with a capsule network is applied to annotate the given images. The accuracy rate of the proposed method is 0.99 %. Therefore, the proposed method uses a hybrid DL model to perform effective image annotation.
期刊介绍:
The quality of software, well-defined interfaces (hardware and software), the process of digitalisation, and accepted standards in these fields are essential for building and exploiting complex computing, communication, multimedia and measuring systems. Standards can simplify the design and construction of individual hardware and software components and help to ensure satisfactory interworking.
Computer Standards & Interfaces is an international journal dealing specifically with these topics.
The journal
• Provides information about activities and progress on the definition of computer standards, software quality, interfaces and methods, at national, European and international levels
• Publishes critical comments on standards and standards activities
• Disseminates user''s experiences and case studies in the application and exploitation of established or emerging standards, interfaces and methods
• Offers a forum for discussion on actual projects, standards, interfaces and methods by recognised experts
• Stimulates relevant research by providing a specialised refereed medium.