{"title":"Qualifying data on railroad track vibrations: a hybrid data preprocessing flow of statistical and machine learning approaches","authors":"Chih-Chiang Lin, Zheng-Yun Zhuang","doi":"10.1080/02533839.2023.2262718","DOIUrl":null,"url":null,"abstract":"ABSTRACT With the growing trend for increased train speed, steel rails may suffer from quality problems due to both overloading and/or the high speed of moving trains. However, before any further analysis can be performed to gain in-depth knowledge, the relevant vibration data sets must be curated, cleansed, preprocessed, and filtered very carefully after they are recorded and collected by the installed sensor equipment. This study proposes a systematic methodological flow to obtain data sets ready for subsequent analysis from messy source data. It hybridized several statistical and unsupervised machine learning methods, with the final aim to establish meaningful rules to determine suitable data sets by referring to domain knowledge. This flow was verified using a relatively large database of records of physical vibrations measured in 2019 at specific locations along a curve of an actual railroad track. As the flow can be used to qualify empirical data sets required in practice, further analysis is provided for the effectiveness of each rule, differences in determination between the rules, and the effects of combining more than one rule.","PeriodicalId":17313,"journal":{"name":"Journal of the Chinese Institute of Engineers","volume":"74 1","pages":"0"},"PeriodicalIF":1.0000,"publicationDate":"2023-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Chinese Institute of Engineers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/02533839.2023.2262718","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACT With the growing trend for increased train speed, steel rails may suffer from quality problems due to both overloading and/or the high speed of moving trains. However, before any further analysis can be performed to gain in-depth knowledge, the relevant vibration data sets must be curated, cleansed, preprocessed, and filtered very carefully after they are recorded and collected by the installed sensor equipment. This study proposes a systematic methodological flow to obtain data sets ready for subsequent analysis from messy source data. It hybridized several statistical and unsupervised machine learning methods, with the final aim to establish meaningful rules to determine suitable data sets by referring to domain knowledge. This flow was verified using a relatively large database of records of physical vibrations measured in 2019 at specific locations along a curve of an actual railroad track. As the flow can be used to qualify empirical data sets required in practice, further analysis is provided for the effectiveness of each rule, differences in determination between the rules, and the effects of combining more than one rule.
期刊介绍:
Encompassing a wide range of engineering disciplines and industrial applications, JCIE includes the following topics:
1.Chemical engineering
2.Civil engineering
3.Computer engineering
4.Electrical engineering
5.Electronics
6.Mechanical engineering
and fields related to the above.