Jinxun Li, Tingjun Wang, Chao Ma, Yunxuan Lin, Qing Yan
{"title":"A file archival integrity check method based on the BiLSTM + CNN model and deep learning","authors":"Jinxun Li, Tingjun Wang, Chao Ma, Yunxuan Lin, Qing Yan","doi":"10.1016/j.eij.2024.100597","DOIUrl":null,"url":null,"abstract":"<div><div>Validating and integrity-checking archives ensures that files are authentic, trustworthy, and usable. In the age of digital technology, historical records must be genuine. Researching in archives raises ethical issues while having little to do with individuals. Traditional archive integrity solutions have scaling issues, real-time monitoring issues, and missed opportunities. An updated Archive File Integrity Check Method (AFICM) may solve these issues, and the paper explains it. Deep learning allows the combination of a Bidirectional Long-Short Term Memory (Bi-LSTM) with adaptive gating and an adaptive Temporal Convolutional Neural Network (TCNN) with multi-scale temporal attention. This method protects archived material against manipulation, which is crucial. The recommended method extracts complex sequential patterns and variants using adaptive TCNN trained on file data. Next, it analyzes these features using a Bi-LSTM network and attenuation method. It allows it to highlight significant temporal correlations while downplaying irrelevant data selectively. The hybrid model outperforms checksums in accuracy and dependability. It uses adaptive TCNNs for time-related feature extraction and attenuated Bi-LSTM for refinement. The F1 score, recall, accuracy, precision, and AU-ROC are critical measures for model evaluation. The AICM performed well overall, with 97.32% precision and 98.95% accuracy. This integrity check method outperforms others with an F1 score of 97.58, an AU-ROC of 0.983, and a recall rate of 98.18%. The findings set a new standard for archiving system integrity testing by showing the model’s dependability and security in several use scenarios.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"29 ","pages":"Article 100597"},"PeriodicalIF":5.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866524001609","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Validating and integrity-checking archives ensures that files are authentic, trustworthy, and usable. In the age of digital technology, historical records must be genuine. Researching in archives raises ethical issues while having little to do with individuals. Traditional archive integrity solutions have scaling issues, real-time monitoring issues, and missed opportunities. An updated Archive File Integrity Check Method (AFICM) may solve these issues, and the paper explains it. Deep learning allows the combination of a Bidirectional Long-Short Term Memory (Bi-LSTM) with adaptive gating and an adaptive Temporal Convolutional Neural Network (TCNN) with multi-scale temporal attention. This method protects archived material against manipulation, which is crucial. The recommended method extracts complex sequential patterns and variants using adaptive TCNN trained on file data. Next, it analyzes these features using a Bi-LSTM network and attenuation method. It allows it to highlight significant temporal correlations while downplaying irrelevant data selectively. The hybrid model outperforms checksums in accuracy and dependability. It uses adaptive TCNNs for time-related feature extraction and attenuated Bi-LSTM for refinement. The F1 score, recall, accuracy, precision, and AU-ROC are critical measures for model evaluation. The AICM performed well overall, with 97.32% precision and 98.95% accuracy. This integrity check method outperforms others with an F1 score of 97.58, an AU-ROC of 0.983, and a recall rate of 98.18%. The findings set a new standard for archiving system integrity testing by showing the model’s dependability and security in several use scenarios.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.