X K Ai, W Zheng, M Zhang, Y H Ding, D L Chen, Z Y Chen, C S Shen, B H Guo, N C Wang, Z J Yang, Z P Chen, Y Pan, B Shen, B J Xiao, J-TEXT Team1
{"title":"Cross-tokamak deployment study of plasma disruption predictors based on convolutional autoencoder","authors":"X K Ai, W Zheng, M Zhang, Y H Ding, D L Chen, Z Y Chen, C S Shen, B H Guo, N C Wang, Z J Yang, Z P Chen, Y Pan, B Shen, B J Xiao, J-TEXT Team1","doi":"10.1088/1361-6587/ad5934","DOIUrl":null,"url":null,"abstract":"In the initial stages of operation for future tokamak, facing limited data availability, deploying data-driven disruption predictors requires optimal performance with minimal use of new device data. This paper studies the issue of data utilization in data-driven disruption predictor during cross tokamak deployment. Current predictors primarily employ supervised learning methods and require a large number of disruption and non-disruption shots for training. However, the scarcity and high cost of obtaining disruption shots for future tokamaks result in imbalanced training datasets, reducing the performance of supervised learning predictors. To solve this problem, we propose the Enhanced Convolutional Autoencoder Anomaly Detection (E-CAAD) predictor. E-CAAD can be trained only by non-disruption samples and can also be trained by disruption precursor samples when disruption shots occur. This model not only overcomes the sample imbalance in supervised learning predictors, but also overcomes the inefficient dataset utilization faced by traditional anomaly detection predictors that cannot use disruption precursor samples for training, making it more suitable for the unpredictable datasets of future tokamaks. Compared to traditional anomaly detection predictors, the E-CAAD predictor performs better in disruption prediction and is deployed faster on new devices. Additionally, we explore strategies to accelerate the deployment of the E-CAAD predictor on the new device by using data from existing devices. Two deployment strategies are presented: mixing data from existing devices and fine-tuning the predictor trained on existing devices. Our comparisons indicate that the data from existing device can accelerate the deployment of predictor on new device. Notably, the fine-tuning strategy yields the fastest deployment on new device among the designed strategies.","PeriodicalId":20239,"journal":{"name":"Plasma Physics and Controlled Fusion","volume":"42 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plasma Physics and Controlled Fusion","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1088/1361-6587/ad5934","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, FLUIDS & PLASMAS","Score":null,"Total":0}
引用次数: 0
Abstract
In the initial stages of operation for future tokamak, facing limited data availability, deploying data-driven disruption predictors requires optimal performance with minimal use of new device data. This paper studies the issue of data utilization in data-driven disruption predictor during cross tokamak deployment. Current predictors primarily employ supervised learning methods and require a large number of disruption and non-disruption shots for training. However, the scarcity and high cost of obtaining disruption shots for future tokamaks result in imbalanced training datasets, reducing the performance of supervised learning predictors. To solve this problem, we propose the Enhanced Convolutional Autoencoder Anomaly Detection (E-CAAD) predictor. E-CAAD can be trained only by non-disruption samples and can also be trained by disruption precursor samples when disruption shots occur. This model not only overcomes the sample imbalance in supervised learning predictors, but also overcomes the inefficient dataset utilization faced by traditional anomaly detection predictors that cannot use disruption precursor samples for training, making it more suitable for the unpredictable datasets of future tokamaks. Compared to traditional anomaly detection predictors, the E-CAAD predictor performs better in disruption prediction and is deployed faster on new devices. Additionally, we explore strategies to accelerate the deployment of the E-CAAD predictor on the new device by using data from existing devices. Two deployment strategies are presented: mixing data from existing devices and fine-tuning the predictor trained on existing devices. Our comparisons indicate that the data from existing device can accelerate the deployment of predictor on new device. Notably, the fine-tuning strategy yields the fastest deployment on new device among the designed strategies.
期刊介绍:
Plasma Physics and Controlled Fusion covers all aspects of the physics of hot, highly ionised plasmas. This includes results of current experimental and theoretical research on all aspects of the physics of high-temperature plasmas and of controlled nuclear fusion, including the basic phenomena in highly-ionised gases in the laboratory, in the ionosphere and in space, in magnetic-confinement and inertial-confinement fusion as well as related diagnostic methods.
Papers with a technological emphasis, for example in such topics as plasma control, fusion technology and diagnostics, are welcomed when the plasma physics is an integral part of the paper or when the technology is unique to plasma applications or new to the field of plasma physics. Papers on dusty plasma physics are welcome when there is a clear relevance to fusion.