Zero-Shot Machine-Generated Text Detection Using Mixture of Large Language Models

arXiv - CS - Computation and Language Pub Date : 2024-09-11 DOI:arxiv-2409.07615

Matthieu Dubois, François Yvon, Pablo Piantanida

引用次数: 0

Abstract

The dissemination of Large Language Models (LLMs), trained at scale, and endowed with powerful text-generating abilities has vastly increased the threats posed by generative AI technologies by reducing the cost of producing harmful, toxic, faked or forged content. In response, various proposals have been made to automatically discriminate artificially generated from human-written texts, typically framing the problem as a classification problem. Most approaches evaluate an input document by a well-chosen detector LLM, assuming that low-perplexity scores reliably signal machine-made content. As using one single detector can induce brittleness of performance, we instead consider several and derive a new, theoretically grounded approach to combine their respective strengths. Our experiments, using a variety of generator LLMs, suggest that our method effectively increases the robustness of detection.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用大型语言模型混合物进行零镜头机器生成文本检测

经过大规模训练并具备强大文本生成能力的大型语言模型（LLM）的传播，降低了制作有害、有毒、伪造或伪造内容的成本，从而大大增加了生成式人工智能技术带来的威胁。为此，人们提出了各种建议，以自动区分人工生成的文本和人类撰写的文本，通常将这一问题视为一个分类问题。大多数方法都是通过精心选择的检测器 LLM 来评估输入文档，并假设低复杂度分数是机器生成内容的可靠信号。由于使用单个检测器会导致性能脆性，我们转而考虑多个检测器，并推导出一种基于理论的新方法来结合它们各自的优势。我们使用各种生成器 LLM 进行的实验表明，我们的方法能有效提高检测的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Computation and Language

自引率

0.00%

发文量