Contextual Fine-Tuning of Language Models with Classifier-Driven Content Moderation for Text Generation.

IF 2 3区物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY Entropy Pub Date : 2024-12-20 DOI:10.3390/e26121114

Matan Punnaivanam, Palani Velvizhy

{"title":"Contextual Fine-Tuning of Language Models with Classifier-Driven Content Moderation for Text Generation.","authors":"Matan Punnaivanam, Palani Velvizhy","doi":"10.3390/e26121114","DOIUrl":null,"url":null,"abstract":"<p><p>In today's digital age, ensuring the appropriateness of content for children is crucial for their cognitive and emotional development. The rise of automated text generation technologies, such as Large Language Models like LLaMA, Mistral, and Zephyr, has created a pressing need for effective tools to filter and classify suitable content. However, the existing methods often fail to effectively address the intricate details and unique characteristics of children's literature. This study aims to bridge this gap by developing a robust framework that utilizes fine-tuned language models, classification techniques, and contextual story generation to generate and classify children's stories based on their suitability. Employing a combination of fine-tuning techniques on models such as LLaMA, Mistral, and Zephyr, alongside a BERT-based classifier, we evaluated the generated stories against established metrics like ROUGE, METEOR, and BERT Scores. The fine-tuned Mistral-7B model achieved a ROUGE-1 score of 0.4785, significantly higher than the base model's 0.3185, while Zephyr-7B-Beta achieved a METEOR score of 0.4154 compared to its base counterpart's score of 0.3602. The results indicated that the fine-tuned models outperformed base models, generating content more aligned with human standards. Moreover, the BERT Classifier exhibited high precision (0.95) and recall (0.97) for identifying unsuitable content, further enhancing the reliability of content classification. These findings highlight the potential of advanced language models in generating age-appropriate stories and enhancing content moderation strategies. This research has broader implications for educational technology, content curation, and parental control systems, offering a scalable approach to ensuring children's exposure to safe and enriching narratives.</p>","PeriodicalId":11694,"journal":{"name":"Entropy","volume":"26 12","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11675295/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Entropy","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.3390/e26121114","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

In today's digital age, ensuring the appropriateness of content for children is crucial for their cognitive and emotional development. The rise of automated text generation technologies, such as Large Language Models like LLaMA, Mistral, and Zephyr, has created a pressing need for effective tools to filter and classify suitable content. However, the existing methods often fail to effectively address the intricate details and unique characteristics of children's literature. This study aims to bridge this gap by developing a robust framework that utilizes fine-tuned language models, classification techniques, and contextual story generation to generate and classify children's stories based on their suitability. Employing a combination of fine-tuning techniques on models such as LLaMA, Mistral, and Zephyr, alongside a BERT-based classifier, we evaluated the generated stories against established metrics like ROUGE, METEOR, and BERT Scores. The fine-tuned Mistral-7B model achieved a ROUGE-1 score of 0.4785, significantly higher than the base model's 0.3185, while Zephyr-7B-Beta achieved a METEOR score of 0.4154 compared to its base counterpart's score of 0.3602. The results indicated that the fine-tuned models outperformed base models, generating content more aligned with human standards. Moreover, the BERT Classifier exhibited high precision (0.95) and recall (0.97) for identifying unsuitable content, further enhancing the reliability of content classification. These findings highlight the potential of advanced language models in generating age-appropriate stories and enhancing content moderation strategies. This research has broader implications for educational technology, content curation, and parental control systems, offering a scalable approach to ensuring children's exposure to safe and enriching narratives.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于分类器驱动的文本生成内容调节的语言模型语境微调。

在今天的数字时代，确保儿童内容的适当性对他们的认知和情感发展至关重要。自动文本生成技术的兴起，如大型语言模型（如LLaMA、Mistral和Zephyr），产生了对过滤和分类合适内容的有效工具的迫切需求。然而，现有的方法往往不能有效地处理儿童文学的复杂细节和独特性。本研究旨在通过开发一个强大的框架来弥合这一差距，该框架利用微调的语言模型、分类技术和上下文故事生成来根据儿童故事的适用性生成和分类。在LLaMA、Mistral和Zephyr等模型上结合使用微调技术，以及基于BERT的分类器，我们根据ROUGE、METEOR和BERT Scores等既定指标评估生成的故事。微调后的Mistral-7B模型获得了0.4785的ROUGE-1得分，显著高于基础模型的0.3185，而Zephyr-7B-Beta模型获得了0.4154的METEOR得分，而基础模型的得分为0.3602。结果表明，经过微调的模型优于基本模型，生成的内容更符合人类标准。此外，BERT分类器在识别不合适内容方面具有较高的准确率（0.95）和召回率（0.97），进一步提高了内容分类的可靠性。这些发现强调了高级语言模型在生成适合年龄的故事和增强内容节制策略方面的潜力。这项研究对教育技术、内容管理和家长控制系统有更广泛的影响，提供了一种可扩展的方法来确保儿童接触安全和丰富的叙事。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Entropy PHYSICS, MULTIDISCIPLINARY-

CiteScore

4.90

自引率

11.10%

发文量

1580

审稿时长

21.05 days

期刊介绍： Entropy (ISSN 1099-4300), an international and interdisciplinary journal of entropy and information studies, publishes reviews, regular research papers and short notes. Our aim is to encourage scientists to publish as much as possible their theoretical and experimental details. There is no restriction on the length of the papers. If there are computation and the experiment, the details must be provided so that the results can be reproduced.