Using expert-reviewed CSAM to train CNNs and its anthropological analysis

IF 1.2 4区 医学 Q3 MEDICINE, LEGAL Journal of forensic and legal medicine Pub Date : 2023-11-17 DOI:10.1016/j.jflm.2023.102619
Wojciech Oronowicz-Jaśkowiak , Tomasz Kozłowski , Marta Polańska , Jerzy Wojciechowski , Piotr Wasilewski , Dominik Ślęzak , Mirosław Kowaluk
{"title":"Using expert-reviewed CSAM to train CNNs and its anthropological analysis","authors":"Wojciech Oronowicz-Jaśkowiak ,&nbsp;Tomasz Kozłowski ,&nbsp;Marta Polańska ,&nbsp;Jerzy Wojciechowski ,&nbsp;Piotr Wasilewski ,&nbsp;Dominik Ślęzak ,&nbsp;Mirosław Kowaluk","doi":"10.1016/j.jflm.2023.102619","DOIUrl":null,"url":null,"abstract":"<div><p>Machine learning methods for the identification of child sexual abuse materials (CSAM) have been previously studied, however, they have serious limitations. Firstly, the training sets used to train the appropriate machine learning algorithms were not previously annotated by a forensic expert in anthropology. Secondly, previously presented solutions have rarely used models trained using real pornographic content involving children. Thirdly, previous studies have not presented a detailed justification for the classification decisions made, which is important due to the recent guidelines of the European Commission (Artificial Intelligence Act).</p><p>The aim of the study was to train convolution neural networks (CNNs) using expert-labelled CSAM images and thereby identify the elements of the body and/or the environment that are critical for classifications by the neural network. To train and evaluate machine learning models, we used 60,000 images equally divided into four classes (CSAM images, images displaying sexual activity involving adults, images of people without sexual activity, and images not containing people). We used four neural network architectures: MobileNet, ResNet152, xResNet152 and its modification ResNet-s, designed for the purpose of research.</p><p>The trained models provided high accuracy of classifying CSAM images: xResNet152 (F1 = 0.93, 92,8%), xResNet-s (F1 = 0.93, 93,1%), ResNet152 (F1 = 0.90, 91,39%), MobileNet (F1 ranged from 0.85 to 0.87, accuracy ranged from 86% to 87%).</p><p>The results of the conducted research suggest that using expert knowledge (in sexology and anthropology) significantly improved the accuracy of the models. In regard to further anthropological analysis, the results indicate that the breasts, face and torso are crucial areas for the classification of pornographic content with children's participation. Results suggests that the ResNet-s neural network may be a reliable tool for clinical work and to support the work of experts witnesses in the field of anthropology.</p><p>The study design received a positive opinion of the Ethics Committee of the Faculty of Mathematics, Informatics and Mechanics of the University of Warsaw. The clinical material was used for research purposes with the consent of the relevant prosecutor's offices. Authors provided free version of Windows application to classify CSAM for forensic experts, policemen and prosecutors at the OSF repository (DOI: <span>10.17605/OSF.IO/RU7JX</span><svg><path></path></svg>).</p></div>","PeriodicalId":16098,"journal":{"name":"Journal of forensic and legal medicine","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1752928X23001373/pdfft?md5=e9ed5388f1812d832a99cac0f9500712&pid=1-s2.0-S1752928X23001373-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of forensic and legal medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1752928X23001373","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning methods for the identification of child sexual abuse materials (CSAM) have been previously studied, however, they have serious limitations. Firstly, the training sets used to train the appropriate machine learning algorithms were not previously annotated by a forensic expert in anthropology. Secondly, previously presented solutions have rarely used models trained using real pornographic content involving children. Thirdly, previous studies have not presented a detailed justification for the classification decisions made, which is important due to the recent guidelines of the European Commission (Artificial Intelligence Act).

The aim of the study was to train convolution neural networks (CNNs) using expert-labelled CSAM images and thereby identify the elements of the body and/or the environment that are critical for classifications by the neural network. To train and evaluate machine learning models, we used 60,000 images equally divided into four classes (CSAM images, images displaying sexual activity involving adults, images of people without sexual activity, and images not containing people). We used four neural network architectures: MobileNet, ResNet152, xResNet152 and its modification ResNet-s, designed for the purpose of research.

The trained models provided high accuracy of classifying CSAM images: xResNet152 (F1 = 0.93, 92,8%), xResNet-s (F1 = 0.93, 93,1%), ResNet152 (F1 = 0.90, 91,39%), MobileNet (F1 ranged from 0.85 to 0.87, accuracy ranged from 86% to 87%).

The results of the conducted research suggest that using expert knowledge (in sexology and anthropology) significantly improved the accuracy of the models. In regard to further anthropological analysis, the results indicate that the breasts, face and torso are crucial areas for the classification of pornographic content with children's participation. Results suggests that the ResNet-s neural network may be a reliable tool for clinical work and to support the work of experts witnesses in the field of anthropology.

The study design received a positive opinion of the Ethics Committee of the Faculty of Mathematics, Informatics and Mechanics of the University of Warsaw. The clinical material was used for research purposes with the consent of the relevant prosecutor's offices. Authors provided free version of Windows application to classify CSAM for forensic experts, policemen and prosecutors at the OSF repository (DOI: 10.17605/OSF.IO/RU7JX).

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用专家评审的CSAM训练cnn及其人类学分析。
以前已经研究过用于识别儿童性虐待材料(CSAM)的机器学习方法,然而,它们有严重的局限性。首先,用于训练适当机器学习算法的训练集以前没有由人类学法医专家注释。其次,以前提出的解决方案很少使用使用涉及儿童的真实色情内容训练的模型。第三,之前的研究并没有为所做的分类决策提供详细的理由,这一点很重要,因为欧盟委员会(人工智能法案)最近的指导方针。该研究的目的是使用专家标记的CSAM图像来训练卷积神经网络(cnn),从而识别对神经网络分类至关重要的身体和/或环境元素。为了训练和评估机器学习模型,我们使用了6万张图像,平均分为四类(CSAM图像,显示涉及成年人的性活动的图像,没有性活动的人的图像和不包含人的图像)。我们使用了四种神经网络架构:MobileNet, ResNet152, xResNet152及其修改版ResNet-s,旨在进行研究。训练的模型对CSAM图像的分类精度较高:xResNet152 (F1 = 0.93, 92,8%)、xResNet-s (F1 = 0.93, 93,1%)、ResNet152 (F1 = 0.90, 91,39%)、MobileNet (F1范围为0.85 ~ 0.87,准确率为86% ~ 87%)。所进行的研究结果表明,使用专家知识(性学和人类学)显著提高了模型的准确性。关于进一步的人类学分析,结果表明,乳房,面部和躯干是儿童参与的色情内容分类的关键区域。结果表明,ResNet-s神经网络可能是临床工作和支持人类学领域专家证人工作的可靠工具。该研究设计得到了华沙大学数学、信息学和力学学院伦理委员会的积极评价。临床材料经有关检察官办公室同意用于研究目的。作者在OSF存储库(DOI: 10.17605/OSF. io /RU7JX)中为法医专家,警察和检察官提供了免费版本的Windows应用程序来分类CSAM。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.70
自引率
6.70%
发文量
106
审稿时长
57 days
期刊介绍: The Journal of Forensic and Legal Medicine publishes topical articles on aspects of forensic and legal medicine. Specifically the Journal supports research that explores the medical principles of care and forensic assessment of individuals, whether adult or child, in contact with the judicial system. It is a fully peer-review hybrid journal with a broad international perspective. The Journal accepts submissions of original research, review articles, and pertinent case studies, editorials, and commentaries in relevant areas of Forensic and Legal Medicine, Context of Practice, and Education and Training. The Journal adheres to strict publication ethical guidelines, and actively supports a culture of inclusive and representative publication.
期刊最新文献
Transfer and persistence of microbiota markers from the human hand to the knife: A preliminary study Analysis of non-metric dental traits for ethnicity identification in a mixed population of Uttar Pradesh An evaluation of potential challenges in forensic psychiatric assessment in Parkinson's disease Femicide in Ireland 2012–2023 A critical assessment of the diatom test of rib bone marrow as a supporting procedure in the case of drowning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1