Boyu Zhao, Zhicheng Dong, Jie Li, Bin Zhao, Pengfei Li
{"title":"DetachMix: Color Channels Fusion for Training Object Detection Neural Networks","authors":"Boyu Zhao, Zhicheng Dong, Jie Li, Bin Zhao, Pengfei Li","doi":"10.1109/ICESIT53460.2021.9696486","DOIUrl":null,"url":null,"abstract":"Pre-training strategies greatly improve various image classification model's accuracy. They have proved to be effective for guiding the model to attend on objects in complex samples, thereby letting the network perform better without adding extra inference cost. However, the training strategies and pipelines dramatically vary among different models, and they only change the complexity of the samples, but do not really combine the targets with the training process of the network. We therefore propose the DetachMix augmentation strategy, and it is divided into two steps: the first step is to segment the picture according to the color channels and train them separately on the network.The second step is to merge the first convolutional layer of the obtained weight files to replace the one obtained during normal training. The network model gained through DetachMix can not only combine images with network training, but also can be used in combination with other methods of processing samples (such as cutmix [1], Mixup [2], etc.). The method we proposed improves network performance by improving the network's ability to extract original semantic information from picture, and it is applicable to all models with the same reasoning time. We conducted a test on the collected Person Head State dataset and compared our method with the latest data processing methods. DetachMix can improve up to at most 8.3% precision compared to state-of-the-art baselines,reached 91.43 %.","PeriodicalId":164745,"journal":{"name":"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICESIT53460.2021.9696486","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pre-training strategies greatly improve various image classification model's accuracy. They have proved to be effective for guiding the model to attend on objects in complex samples, thereby letting the network perform better without adding extra inference cost. However, the training strategies and pipelines dramatically vary among different models, and they only change the complexity of the samples, but do not really combine the targets with the training process of the network. We therefore propose the DetachMix augmentation strategy, and it is divided into two steps: the first step is to segment the picture according to the color channels and train them separately on the network.The second step is to merge the first convolutional layer of the obtained weight files to replace the one obtained during normal training. The network model gained through DetachMix can not only combine images with network training, but also can be used in combination with other methods of processing samples (such as cutmix [1], Mixup [2], etc.). The method we proposed improves network performance by improving the network's ability to extract original semantic information from picture, and it is applicable to all models with the same reasoning time. We conducted a test on the collected Person Head State dataset and compared our method with the latest data processing methods. DetachMix can improve up to at most 8.3% precision compared to state-of-the-art baselines,reached 91.43 %.