Vitaly Stepanyants, Mantsa Andzhusheva, A. Romanov
{"title":"A Pipeline for Traffic Accident Dataset Development","authors":"Vitaly Stepanyants, Mantsa Andzhusheva, A. Romanov","doi":"10.1109/SmartIndustryCon57312.2023.10110794","DOIUrl":null,"url":null,"abstract":"Many traffic accidents happen on the roads every day and a lot of them are captured on traffic or dashboard cameras. This data could be used to train machine learning models to predict dangerous situations so that they can be prevented. For that, it should be organized into datasets. Nowadays a limited amount of traffic accident datasets is available and those are not as well annotated as, for example, driving datasets with no accidents, used for training automated vehicles. Following this, our paper presents a review of existing traffic accident datasets. The search was carried out to provide a list of video datasets relevant to the analysis of dangerous situations involving vehicles. For each dataset under consideration, a brief description of the process of data collection and annotation is presented, and the structure and format of videos are analyzed. In addition, the sources of the video, their amount, and the method of splitting the video into fragments are indicated. Where possible, software tools used for video processing are listed. Further, the paper explores existing solutions for dataset development and annotation. Based on the performed analysis, we propose a pipeline for traffic accident dataset development.","PeriodicalId":157877,"journal":{"name":"2023 International Russian Smart Industry Conference (SmartIndustryCon)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Russian Smart Industry Conference (SmartIndustryCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartIndustryCon57312.2023.10110794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Many traffic accidents happen on the roads every day and a lot of them are captured on traffic or dashboard cameras. This data could be used to train machine learning models to predict dangerous situations so that they can be prevented. For that, it should be organized into datasets. Nowadays a limited amount of traffic accident datasets is available and those are not as well annotated as, for example, driving datasets with no accidents, used for training automated vehicles. Following this, our paper presents a review of existing traffic accident datasets. The search was carried out to provide a list of video datasets relevant to the analysis of dangerous situations involving vehicles. For each dataset under consideration, a brief description of the process of data collection and annotation is presented, and the structure and format of videos are analyzed. In addition, the sources of the video, their amount, and the method of splitting the video into fragments are indicated. Where possible, software tools used for video processing are listed. Further, the paper explores existing solutions for dataset development and annotation. Based on the performed analysis, we propose a pipeline for traffic accident dataset development.