{"title":"基于迁移学习的图像识别多模态特征表征模型","authors":"Nupoor Yawale, Neeraj Sahu, Nikkoo Khalsa","doi":"10.1007/s40009-024-01402-7","DOIUrl":null,"url":null,"abstract":"<p>Digital image classification assists in distinguishing natural and synthetic images to detect computer-generated objects. However, CGI improvements make it difficult to discern synthetic photos from genuine ones. Researchers suggest multiple deep learning strategies to differentiate these photo sets utilizing thorough feature analysis. These models are either complex or do not handle image sub-components, decreasing efficiency in large-scale applications. These models fail categorically. To address these issues, this work proposes a novel high-density bio-inspired feature analysis deep learning model for natural and synthetic image sub-classification. A YoLo model initially recognizes objects in input image sets. Processed separately, a hybrid LSTM/GRU model predicts high-density feature sets, which are processed by Elephant Herding Optimization (EHO) Models to identify high inter-class variance feature sets. A customized 1D CNN model is used to categorize the desired features into natural and synthetic components. These classification results establish whether the input image is natural, synthetic, or both. In real-time scenarios, the proposed model is able to improve standard classification models with 8.7% greater accuracy, 10.9% higher precision, 3.2% higher recall, and 8.4% higher AUC.</p>","PeriodicalId":717,"journal":{"name":"National Academy Science Letters","volume":"73 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Multimodal Feature Representation Model for Transfer-Learning-Based Identification of Images\",\"authors\":\"Nupoor Yawale, Neeraj Sahu, Nikkoo Khalsa\",\"doi\":\"10.1007/s40009-024-01402-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Digital image classification assists in distinguishing natural and synthetic images to detect computer-generated objects. However, CGI improvements make it difficult to discern synthetic photos from genuine ones. Researchers suggest multiple deep learning strategies to differentiate these photo sets utilizing thorough feature analysis. These models are either complex or do not handle image sub-components, decreasing efficiency in large-scale applications. These models fail categorically. To address these issues, this work proposes a novel high-density bio-inspired feature analysis deep learning model for natural and synthetic image sub-classification. A YoLo model initially recognizes objects in input image sets. Processed separately, a hybrid LSTM/GRU model predicts high-density feature sets, which are processed by Elephant Herding Optimization (EHO) Models to identify high inter-class variance feature sets. A customized 1D CNN model is used to categorize the desired features into natural and synthetic components. These classification results establish whether the input image is natural, synthetic, or both. In real-time scenarios, the proposed model is able to improve standard classification models with 8.7% greater accuracy, 10.9% higher precision, 3.2% higher recall, and 8.4% higher AUC.</p>\",\"PeriodicalId\":717,\"journal\":{\"name\":\"National Academy Science Letters\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"National Academy Science Letters\",\"FirstCategoryId\":\"4\",\"ListUrlMain\":\"https://doi.org/10.1007/s40009-024-01402-7\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"National Academy Science Letters","FirstCategoryId":"4","ListUrlMain":"https://doi.org/10.1007/s40009-024-01402-7","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A Multimodal Feature Representation Model for Transfer-Learning-Based Identification of Images
Digital image classification assists in distinguishing natural and synthetic images to detect computer-generated objects. However, CGI improvements make it difficult to discern synthetic photos from genuine ones. Researchers suggest multiple deep learning strategies to differentiate these photo sets utilizing thorough feature analysis. These models are either complex or do not handle image sub-components, decreasing efficiency in large-scale applications. These models fail categorically. To address these issues, this work proposes a novel high-density bio-inspired feature analysis deep learning model for natural and synthetic image sub-classification. A YoLo model initially recognizes objects in input image sets. Processed separately, a hybrid LSTM/GRU model predicts high-density feature sets, which are processed by Elephant Herding Optimization (EHO) Models to identify high inter-class variance feature sets. A customized 1D CNN model is used to categorize the desired features into natural and synthetic components. These classification results establish whether the input image is natural, synthetic, or both. In real-time scenarios, the proposed model is able to improve standard classification models with 8.7% greater accuracy, 10.9% higher precision, 3.2% higher recall, and 8.4% higher AUC.
期刊介绍:
The National Academy Science Letters is published by the National Academy of Sciences, India, since 1978. The publication of this unique journal was started with a view to give quick and wide publicity to the innovations in all fields of science