{"title":"基于聚类思想的科学仪器数据文件格式分析方法","authors":"Jianhui Zhou, Feng Sun, Hao Shi, Shuai Shen","doi":"10.1117/12.2653568","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of low analytical efficiency in the current analysis methods of data file format of scientific instruments, a data file format analysis method based on clustering was proposed to improve the efficiency of file format analysis. According to the file storage structure and the characteristics of cluster distribution, the selection principle of file samples in cluster analysis is formulated. At the same time, the corresponding format analysis auxiliary tool software is developed, which can automatically judge the rationality of the selected files and automatically group them, simplifying the corresponding format analysis process. The method and the developed tool are used to analyze the format of MS data generated by a mass spectrometry model. The experimental results show that the format of MS data obtained by this method is accurate and the efficiency is significantly improved. This method can effectively promote the sharing of data resources of large-scale scientific instruments and improve the utilization rate of data resources.","PeriodicalId":32903,"journal":{"name":"JITeCS Journal of Information Technology and Computer Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis method of scientific instrument data file format based on clustering idea\",\"authors\":\"Jianhui Zhou, Feng Sun, Hao Shi, Shuai Shen\",\"doi\":\"10.1117/12.2653568\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem of low analytical efficiency in the current analysis methods of data file format of scientific instruments, a data file format analysis method based on clustering was proposed to improve the efficiency of file format analysis. According to the file storage structure and the characteristics of cluster distribution, the selection principle of file samples in cluster analysis is formulated. At the same time, the corresponding format analysis auxiliary tool software is developed, which can automatically judge the rationality of the selected files and automatically group them, simplifying the corresponding format analysis process. The method and the developed tool are used to analyze the format of MS data generated by a mass spectrometry model. The experimental results show that the format of MS data obtained by this method is accurate and the efficiency is significantly improved. This method can effectively promote the sharing of data resources of large-scale scientific instruments and improve the utilization rate of data resources.\",\"PeriodicalId\":32903,\"journal\":{\"name\":\"JITeCS Journal of Information Technology and Computer Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JITeCS Journal of Information Technology and Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2653568\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JITeCS Journal of Information Technology and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2653568","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis method of scientific instrument data file format based on clustering idea
Aiming at the problem of low analytical efficiency in the current analysis methods of data file format of scientific instruments, a data file format analysis method based on clustering was proposed to improve the efficiency of file format analysis. According to the file storage structure and the characteristics of cluster distribution, the selection principle of file samples in cluster analysis is formulated. At the same time, the corresponding format analysis auxiliary tool software is developed, which can automatically judge the rationality of the selected files and automatically group them, simplifying the corresponding format analysis process. The method and the developed tool are used to analyze the format of MS data generated by a mass spectrometry model. The experimental results show that the format of MS data obtained by this method is accurate and the efficiency is significantly improved. This method can effectively promote the sharing of data resources of large-scale scientific instruments and improve the utilization rate of data resources.