评估 ChatGPT 生成的文本摘要的多指标方法

Q1 Business, Management and Accounting IEEE Engineering Management Review Pub Date : 2024-06-01 DOI:10.1109/EMR.2024.3381176

Jonas Benedikt Arnold;Dominik Hörauf

{"title":"评估 ChatGPT 生成的文本摘要的多指标方法","authors":"Jonas Benedikt Arnold;Dominik Hörauf","doi":"10.1109/EMR.2024.3381176","DOIUrl":null,"url":null,"abstract":"This article investigates the summarization capabilities of ChatGPT, a language model seen as effectively shortening texts, employing a hypothesis-generating and explorative approach. Using a specific prompt, the study examines the expected lengths of generated summaries across various input word counts (IWC). A shortening ratio is introduced to describe these relationships, with identified dependencies on IWCs between 100 and 400 words. The study also explores coherence comparisons, highlighting that the ChatGPT-generated text is often evaluated as more coherent than the original. The article introduces a multimetric approach for the evaluation and discusses dependencies of best case summaries on different input word counts, providing insights into the model's performance characteristics.","PeriodicalId":35585,"journal":{"name":"IEEE Engineering Management Review","volume":"52 3","pages":"43-53"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Multimetric Approach for Evaluation of ChatGPT-Generated Text Summaries\",\"authors\":\"Jonas Benedikt Arnold;Dominik Hörauf\",\"doi\":\"10.1109/EMR.2024.3381176\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates the summarization capabilities of ChatGPT, a language model seen as effectively shortening texts, employing a hypothesis-generating and explorative approach. Using a specific prompt, the study examines the expected lengths of generated summaries across various input word counts (IWC). A shortening ratio is introduced to describe these relationships, with identified dependencies on IWCs between 100 and 400 words. The study also explores coherence comparisons, highlighting that the ChatGPT-generated text is often evaluated as more coherent than the original. The article introduces a multimetric approach for the evaluation and discusses dependencies of best case summaries on different input word counts, providing insights into the model's performance characteristics.\",\"PeriodicalId\":35585,\"journal\":{\"name\":\"IEEE Engineering Management Review\",\"volume\":\"52 3\",\"pages\":\"43-53\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Engineering Management Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10601574/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Business, Management and Accounting\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Engineering Management Review","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10601574/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Business, Management and Accounting","Score":null,"Total":0}

引用次数: 0

摘要

ChatGPT 是一种被视为能有效缩短文本的语言模型，本文采用假设生成和探索的方法，对 ChatGPT 的摘要能力进行了研究。该研究利用一个特定的提示，考察了不同输入字数（IWC）下生成摘要的预期长度。研究引入了缩短比率来描述这些关系，并确定了 100 到 400 字之间的 IWC 的依赖关系。研究还探讨了连贯性比较，强调 ChatGPT 生成的文本通常被评价为比原文更连贯。文章介绍了一种多指标评估方法，并讨论了最佳案例摘要对不同输入字数的依赖性，从而深入了解了该模型的性能特点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Multimetric Approach for Evaluation of ChatGPT-Generated Text Summaries

This article investigates the summarization capabilities of ChatGPT, a language model seen as effectively shortening texts, employing a hypothesis-generating and explorative approach. Using a specific prompt, the study examines the expected lengths of generated summaries across various input word counts (IWC). A shortening ratio is introduced to describe these relationships, with identified dependencies on IWCs between 100 and 400 words. The study also explores coherence comparisons, highlighting that the ChatGPT-generated text is often evaluated as more coherent than the original. The article introduces a multimetric approach for the evaluation and discusses dependencies of best case summaries on different input word counts, providing insights into the model's performance characteristics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Engineering Management Review Business, Management and Accounting-Management of Technology and Innovation

CiteScore

7.40

自引率

0.00%

发文量

期刊介绍： Reprints articles from other publications of significant interest to members. The papers are aimed at those engaged in managing research, development, or engineering activities. Reprints make it possible for the readers to receive the best of today"s literature without having to subscribe to and read other periodicals.

期刊最新文献

TechRxiv: Share Your Preprint Research With the World! Call for Papers for IEEE Engineering Management Review Call for Papers TEMSCON Global Montreal 2026 Call for Papers for IEEE Engineering Management Review Call for Papers TEMSCON Global Montreal 2026