WorkloadGPT: A Large Language Model Approach to Real-Time Detection of Pilot Workload

Q1 Mathematics Applied Sciences Pub Date : 2024-09-13 DOI:10.3390/app14188274

Yijing Gao, Lishengsa Yue, Jiahang Sun, Xiaonian Shan, Yihan Liu, Xuerui Wu

{"title":"WorkloadGPT: A Large Language Model Approach to Real-Time Detection of Pilot Workload","authors":"Yijing Gao, Lishengsa Yue, Jiahang Sun, Xiaonian Shan, Yihan Liu, Xuerui Wu","doi":"10.3390/app14188274","DOIUrl":null,"url":null,"abstract":"The occurrence of flight risks and accidents is closely related to pilot workload. Effective detection of pilot workload has been a key research area in the aviation industry. However, traditional methods for detecting pilot workload have several shortcomings: firstly, the collection of metrics via contact-based devices can interfere with pilots; secondly, real-time detection of pilot workload is challenging, making it difficult to capture sudden increases in workload; thirdly, the detection accuracy of these models is limited; fourthly, the models lack cross-pilot generalization. To address these challenges, this study proposes a large language model, WorkloadGPT, which utilizes low-interference indicators: eye movement and seat pressure. Specifically, features are extracted in 10 s time windows and input into WorkloadGPT for classification into low, medium, and high workload categories. Additionally, this article presents the design of an appropriate text template to serialize the tabular feature dataset into natural language, incorporating individual difference prompts during instance construction to enhance cross-pilot generalization. Finally, the LoRA algorithm was used to fine-tune the pre-trained large language model ChatGLM3-6B, resulting in WorkloadGPT. During the training process of WorkloadGPT, the GAN-Ensemble algorithm was employed to augment the experimental raw data, constructing a realistic and robust extended dataset for model training. The results show that WorkloadGPT achieved a classification accuracy of 87.3%, with a cross-pilot standard deviation of only 2.1% and a response time of just 1.76 s, overall outperforming existing studies in terms of accuracy, real-time performance, and cross-pilot generalization capability, thereby providing a solid foundation for enhancing flight safety.","PeriodicalId":8224,"journal":{"name":"Applied Sciences","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/app14188274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

The occurrence of flight risks and accidents is closely related to pilot workload. Effective detection of pilot workload has been a key research area in the aviation industry. However, traditional methods for detecting pilot workload have several shortcomings: firstly, the collection of metrics via contact-based devices can interfere with pilots; secondly, real-time detection of pilot workload is challenging, making it difficult to capture sudden increases in workload; thirdly, the detection accuracy of these models is limited; fourthly, the models lack cross-pilot generalization. To address these challenges, this study proposes a large language model, WorkloadGPT, which utilizes low-interference indicators: eye movement and seat pressure. Specifically, features are extracted in 10 s time windows and input into WorkloadGPT for classification into low, medium, and high workload categories. Additionally, this article presents the design of an appropriate text template to serialize the tabular feature dataset into natural language, incorporating individual difference prompts during instance construction to enhance cross-pilot generalization. Finally, the LoRA algorithm was used to fine-tune the pre-trained large language model ChatGLM3-6B, resulting in WorkloadGPT. During the training process of WorkloadGPT, the GAN-Ensemble algorithm was employed to augment the experimental raw data, constructing a realistic and robust extended dataset for model training. The results show that WorkloadGPT achieved a classification accuracy of 87.3%, with a cross-pilot standard deviation of only 2.1% and a response time of just 1.76 s, overall outperforming existing studies in terms of accuracy, real-time performance, and cross-pilot generalization capability, thereby providing a solid foundation for enhancing flight safety.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

WorkloadGPT：实时检测试点工作量的大型语言模型方法

飞行风险和事故的发生与飞行员的工作量密切相关。有效检测飞行员工作量一直是航空业的重点研究领域。然而，传统的飞行员工作量检测方法存在一些不足：首先，通过接触式设备收集指标会干扰飞行员；其次，飞行员工作量的实时检测具有挑战性，难以捕捉突然增加的工作量；第三，这些模型的检测精度有限；第四，模型缺乏跨飞行员的泛化。为了应对这些挑战，本研究提出了一种大型语言模型 WorkloadGPT，它利用了低干扰指标：眼球运动和座椅压力。具体来说，在 10 秒的时间窗口中提取特征并输入 WorkloadGPT，以便将其分为低、中和高工作量类别。此外，本文还介绍了如何设计适当的文本模板，将表格特征数据集序列化为自然语言，并在实例构建过程中纳入个体差异提示，以增强跨飞行员泛化能力。最后，使用 LoRA 算法对预先训练好的大型语言模型 ChatGLM3-6B 进行微调，最终形成 WorkloadGPT。在 WorkloadGPT 的训练过程中，采用了 GAN-Ensemble 算法来增强实验原始数据，为模型训练构建了一个真实、稳健的扩展数据集。结果表明，WorkloadGPT 的分类准确率达到 87.3%，跨飞行员标准偏差仅为 2.1%，响应时间仅为 1.76 秒，在准确率、实时性和跨飞行员泛化能力方面全面超越了现有研究，从而为提高飞行安全奠定了坚实的基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Applied Sciences Mathematics-Applied Mathematics

CiteScore

6.40

自引率

0.00%

发文量

审稿时长

11 weeks

期刊介绍： APPS is an international journal. APPS covers a wide spectrum of pure and applied mathematics in science and technology, promoting especially papers presented at Carpato-Balkan meetings. The Editorial Board of APPS takes a very active role in selecting and refereeing papers, ensuring the best quality of contemporary mathematics and its applications. APPS is abstracted in Zentralblatt für Mathematik. The APPS journal uses Double blind peer review.