首页 > 最新文献

Proceedings of the 6th International Conference on Digital Signal Processing最新文献

英文 中文
Joint Power and Bandwidth Allocation for 3D Video SoftCast 3D视频软播的联合功率和带宽分配
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529600
Saqr Khalil Saeed Thabet, Emmanuel Osei-Mensah, Lei Luo, Ce Zhu
Unlike digital video transmission systems, SoftCast avoids the cliff effect and achieves a linear video quality transition that is commensurate with the wireless channel conditions. When SoftCast is applied to three-Dimension Video (3DV) transmission, resource allocation issues arise: 1) allocating the limited power budget to texture and depth to achieve the optimal overall quality. 2) distributing the suitable number of texture and depth chunks to adapt to bandwidth constraints. This work aims to efficiently solve the optimal joint power and bandwidth allocation problem. First, a power-distortion optimization problem is formulated to calculate the optimal Power Allocation Ratio (PAR) between texture and depth, then mapped to an unconstrained problem and solved using the Lagrangian multiplier. Finally, based on the closed-form of the optimal solution, an iterative algorithm is proposed to choose the suitable number of texture/depth chunks for a given bandwidth constraint. The proposed method achieves better performance than its counterpart default fixed-ratio power allocation between texture/depth. Further, we observe a graceful video quality transition with the improvement of channel conditions under bandwidth constraints.
与数字视频传输系统不同,SoftCast避免了悬崖效应,实现了与无线信道条件相称的线性视频质量过渡。当SoftCast应用于3DV (three- dimensional Video)传输时,出现了资源分配问题:1)将有限的功率预算分配给纹理和深度,以实现最佳的整体质量。2)分配适当数量的纹理和深度块,以适应带宽限制。该工作旨在有效地解决联合功率和带宽的最优分配问题。首先,建立功率失真优化问题,计算纹理和深度之间的最优功率分配比(PAR),然后将其映射为无约束问题,利用拉格朗日乘子进行求解。最后,基于最优解的封闭形式,提出了一种迭代算法,在给定带宽约束下选择合适数量的纹理/深度块。该方法比默认的纹理/深度之间的固定比例功率分配方法具有更好的性能。此外,在带宽限制下,随着信道条件的改善,我们观察到优美的视频质量过渡。
{"title":"Joint Power and Bandwidth Allocation for 3D Video SoftCast","authors":"Saqr Khalil Saeed Thabet, Emmanuel Osei-Mensah, Lei Luo, Ce Zhu","doi":"10.1145/3529570.3529600","DOIUrl":"https://doi.org/10.1145/3529570.3529600","url":null,"abstract":"Unlike digital video transmission systems, SoftCast avoids the cliff effect and achieves a linear video quality transition that is commensurate with the wireless channel conditions. When SoftCast is applied to three-Dimension Video (3DV) transmission, resource allocation issues arise: 1) allocating the limited power budget to texture and depth to achieve the optimal overall quality. 2) distributing the suitable number of texture and depth chunks to adapt to bandwidth constraints. This work aims to efficiently solve the optimal joint power and bandwidth allocation problem. First, a power-distortion optimization problem is formulated to calculate the optimal Power Allocation Ratio (PAR) between texture and depth, then mapped to an unconstrained problem and solved using the Lagrangian multiplier. Finally, based on the closed-form of the optimal solution, an iterative algorithm is proposed to choose the suitable number of texture/depth chunks for a given bandwidth constraint. The proposed method achieves better performance than its counterpart default fixed-ratio power allocation between texture/depth. Further, we observe a graceful video quality transition with the improvement of channel conditions under bandwidth constraints.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115649013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kalman Filter Using SOV Model with Maximum Versoria Criterion for Short-Term Traffic Flow Forecasting 基于最大Versoria准则的SOV模型卡尔曼滤波短期交通流预测
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529579
Tingting Jiang, Zhao Zhang
This paper proposes a prediction method by combining second-order Volterra (SOV) model and Kalman filter to further improve prediction accuracy of the traditional Kalman model in short-term traffic flow forecasting. Nonlinear relationship may exist in traffic flow data, but the traditional Kalman model cannot deal with this problem. Due to the second-order Volterra (SOV) filter can deal with a general class of nonlinear systems, the traditional Kalman combines with second-order Volterra model, named SOV-KF model, is presented. Furthermore, since the Gaussian assumption is not always be fulfilled in the traffic flow data and traditional minimum mean square error (MMSE) criterion do not perform well under non-Gaussian noises. By introducing maximum Versoria criterion, another prediction method called SOV-MVKF model is also proposed. Simulation results show that the SOV-KF model and SOV-MVKF model provide higher prediction accuracy compared to traditional Kalman model.
本文提出了一种二阶Volterra (SOV)模型与卡尔曼滤波相结合的预测方法,进一步提高了传统卡尔曼模型在短期交通流预测中的预测精度。交通流数据中可能存在非线性关系,而传统的卡尔曼模型无法处理这一问题。由于二阶Volterra (SOV)滤波器可以处理一般的非线性系统,提出了传统的Kalman与二阶Volterra模型相结合的SOV- kf模型。此外,由于交通流数据并不总是满足高斯假设,传统的最小均方误差(MMSE)准则在非高斯噪声下表现不佳。通过引入最大Versoria准则,提出了另一种预测方法SOV-MVKF模型。仿真结果表明,与传统的卡尔曼模型相比,SOV-KF模型和SOV-MVKF模型具有更高的预测精度。
{"title":"Kalman Filter Using SOV Model with Maximum Versoria Criterion for Short-Term Traffic Flow Forecasting","authors":"Tingting Jiang, Zhao Zhang","doi":"10.1145/3529570.3529579","DOIUrl":"https://doi.org/10.1145/3529570.3529579","url":null,"abstract":"This paper proposes a prediction method by combining second-order Volterra (SOV) model and Kalman filter to further improve prediction accuracy of the traditional Kalman model in short-term traffic flow forecasting. Nonlinear relationship may exist in traffic flow data, but the traditional Kalman model cannot deal with this problem. Due to the second-order Volterra (SOV) filter can deal with a general class of nonlinear systems, the traditional Kalman combines with second-order Volterra model, named SOV-KF model, is presented. Furthermore, since the Gaussian assumption is not always be fulfilled in the traffic flow data and traditional minimum mean square error (MMSE) criterion do not perform well under non-Gaussian noises. By introducing maximum Versoria criterion, another prediction method called SOV-MVKF model is also proposed. Simulation results show that the SOV-KF model and SOV-MVKF model provide higher prediction accuracy compared to traditional Kalman model.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124376267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameter Estimation of LFM Signal Based on Improved Fractional Fourier Transform 基于改进分数阶傅里叶变换的LFM信号参数估计
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529607
Guanyu Qiao, D. Dai, Caikun Zhang
Given the current computational complexity and the unsatisfactory anti-noise performance of the linear frequency modulated (LFM) signal parameter estimation method, this paper proposes a rather innovative and efficient method based on improved fractional Fourier transform (FrFT). This method first obtains the energy distribution of time-frequency domain (TFD) through short-time Fourier transform (STFT), which is used to determine the order search range of FrFT. On the other hand, incoherent accumulation is also employed to improve the anti-noise performance under low signal-to-noise-ratio (SNR) environments. Extensive computer simulations verified that the algorithm displays a good anti-noise performance while reducing the amount of calculation.
针对目前线性调频(LFM)信号参数估计方法计算量大、抗噪性能不理想的问题,提出了一种基于改进分数阶傅立叶变换(FrFT)的新颖高效的参数估计方法。该方法首先通过短时傅里叶变换(STFT)得到时频域(TFD)的能量分布,并以此确定FrFT的阶数搜索范围;另一方面,为了提高低信噪比环境下的抗噪声性能,还采用了非相干积累技术。大量的计算机仿真验证了该算法在减少计算量的同时具有良好的抗噪声性能。
{"title":"Parameter Estimation of LFM Signal Based on Improved Fractional Fourier Transform","authors":"Guanyu Qiao, D. Dai, Caikun Zhang","doi":"10.1145/3529570.3529607","DOIUrl":"https://doi.org/10.1145/3529570.3529607","url":null,"abstract":"Given the current computational complexity and the unsatisfactory anti-noise performance of the linear frequency modulated (LFM) signal parameter estimation method, this paper proposes a rather innovative and efficient method based on improved fractional Fourier transform (FrFT). This method first obtains the energy distribution of time-frequency domain (TFD) through short-time Fourier transform (STFT), which is used to determine the order search range of FrFT. On the other hand, incoherent accumulation is also employed to improve the anti-noise performance under low signal-to-noise-ratio (SNR) environments. Extensive computer simulations verified that the algorithm displays a good anti-noise performance while reducing the amount of calculation.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124819071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A SAR Image Denoising Method for Target Shadow Tracking Task 一种用于目标阴影跟踪的SAR图像去噪方法
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529598
Yankun Huang, Guangcai Sun, M. Xing
The interpretation of Synthetic Aperture Radar (SAR) image is considered to be a challenging task, especially when tracking the target shadow in Video SAR (ViSAR), the speckle noise needs to be considered. Based on this, this paper proposes a SAR image denoising algorithm based on the improved wavelet threshold function. Different from the existing denoising methods, this algorithm combines the characteristics of hard threshold function and soft threshold function in traditional wavelet transform denoising, constructs a new threshold function, and improves the equivalent number of looks (ENL) of denoised SAR image. When the denoised image is applied to the tracking task, the target features are enhanced by k-means algorithm and binarization method, so as to improve the tracking accuracy. Experimental results show that the algorithm improves the tracking accuracy on the basis of ensuring the real-time performance of tracking and makes the tracking task highly robust to the noise of SAR image.
合成孔径雷达(SAR)图像的解译是一项具有挑战性的任务,特别是在视频SAR (ViSAR)中,在跟踪目标阴影时,需要考虑散斑噪声。在此基础上,提出了一种基于改进小波阈值函数的SAR图像去噪算法。与现有的去噪方法不同,该算法结合了传统小波变换去噪中硬阈值函数和软阈值函数的特点,构建了新的阈值函数,提高了去噪后SAR图像的等效外观数(ENL)。将去噪后的图像应用于跟踪任务时,通过k-means算法和二值化方法对目标特征进行增强,从而提高跟踪精度。实验结果表明,该算法在保证跟踪实时性的基础上提高了跟踪精度,并使跟踪任务对SAR图像噪声具有较强的鲁棒性。
{"title":"A SAR Image Denoising Method for Target Shadow Tracking Task","authors":"Yankun Huang, Guangcai Sun, M. Xing","doi":"10.1145/3529570.3529598","DOIUrl":"https://doi.org/10.1145/3529570.3529598","url":null,"abstract":"The interpretation of Synthetic Aperture Radar (SAR) image is considered to be a challenging task, especially when tracking the target shadow in Video SAR (ViSAR), the speckle noise needs to be considered. Based on this, this paper proposes a SAR image denoising algorithm based on the improved wavelet threshold function. Different from the existing denoising methods, this algorithm combines the characteristics of hard threshold function and soft threshold function in traditional wavelet transform denoising, constructs a new threshold function, and improves the equivalent number of looks (ENL) of denoised SAR image. When the denoised image is applied to the tracking task, the target features are enhanced by k-means algorithm and binarization method, so as to improve the tracking accuracy. Experimental results show that the algorithm improves the tracking accuracy on the basis of ensuring the real-time performance of tracking and makes the tracking task highly robust to the noise of SAR image.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114507615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient Cable Surface Defect Detection with Deep Learning 基于深度学习的高效电缆表面缺陷检测
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529595
Guo-Chung Chen, Feng Xu, Guihua Liu, Yanjie Chen, Zhiqiang Liang
Efficient detection of cable surface defects can prevent and reduce the potential dangers in the process of high voltage transmission. In order to achieve efficient detection of cable surface defects and solve the problem of low detection accuracy of small and unobvious defects on cable surface, we propose an efficient cable surface defect detection model with deep learning. Firstly, the lightweight backbone feature extraction network is used to extract the preliminary defect features. Secondly, the parallel convolution module and serial convolution module are designed to obtain abundant defect features and reduce the number of model parameters. Then, the feature fusion module is designed to fuse the shallow features with deep features to enhance the features of small and unobvious defects. Finally, the obtained features are put into the corresponding detection head to get the final prediction results. The experimental results on local cable dataset show that our method achieves favorable trade-off between the accuracy, speed and model size of the cable surface defect detection, which meets the requirements of high accuracy, high speed and small model in industrial application.
对电缆表面缺陷进行有效的检测,可以预防和减少高压输电过程中的潜在危险。为了实现对电缆表面缺陷的高效检测,解决电缆表面小而不明显缺陷检测精度低的问题,我们提出了一种基于深度学习的高效电缆表面缺陷检测模型。首先,利用轻量级骨干特征提取网络提取初步缺陷特征;其次,设计并行卷积模块和串行卷积模块,获取丰富的缺陷特征,减少模型参数数量;然后,设计特征融合模块,将浅特征与深特征融合,增强缺陷小而不明显的特征;最后,将得到的特征输入到相应的检测头中,得到最终的预测结果。在电缆局部数据集上的实验结果表明,该方法在电缆表面缺陷检测的精度、速度和模型尺寸之间取得了较好的平衡,满足了工业应用中高精度、高速度和小模型的要求。
{"title":"Efficient Cable Surface Defect Detection with Deep Learning","authors":"Guo-Chung Chen, Feng Xu, Guihua Liu, Yanjie Chen, Zhiqiang Liang","doi":"10.1145/3529570.3529595","DOIUrl":"https://doi.org/10.1145/3529570.3529595","url":null,"abstract":"Efficient detection of cable surface defects can prevent and reduce the potential dangers in the process of high voltage transmission. In order to achieve efficient detection of cable surface defects and solve the problem of low detection accuracy of small and unobvious defects on cable surface, we propose an efficient cable surface defect detection model with deep learning. Firstly, the lightweight backbone feature extraction network is used to extract the preliminary defect features. Secondly, the parallel convolution module and serial convolution module are designed to obtain abundant defect features and reduce the number of model parameters. Then, the feature fusion module is designed to fuse the shallow features with deep features to enhance the features of small and unobvious defects. Finally, the obtained features are put into the corresponding detection head to get the final prediction results. The experimental results on local cable dataset show that our method achieves favorable trade-off between the accuracy, speed and model size of the cable surface defect detection, which meets the requirements of high accuracy, high speed and small model in industrial application.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125598021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Some Evaluations on Spectrogram Art Communications Exchanging Secret Visual Messages 对交换秘密视觉信息的谱图艺术传播的若干评价
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529583
N. Aoki, Kenichi Ikeda, H. Yasuda, Ying Shen, Jin Hou
This study investigates the possibility of visual communications using spectrogram arts drawn on sound signals. This approach attempts to increase the readability of the communications so that their messages are directly understandable by human users without any advanced decoders. This paper describes some pilot studies of our spectrogram art communication system that exchanges text messages in some chatting services. The experimental results indicate that the proposed technique may have certain appropriateness. It may be employed as a sort of steganography techniques enabling sub-channel communications in a covert manner.
本研究探讨了利用声音信号绘制的频谱图艺术进行视觉交流的可能性。这种方法试图增加通信的可读性,这样它们的消息就可以被人类用户直接理解,而不需要任何高级解码器。本文介绍了我们的谱图艺术通信系统在一些聊天服务中交换文本信息的一些初步研究。实验结果表明,该方法具有一定的适用性。它可以作为一种隐写技术,以隐蔽的方式实现子信道通信。
{"title":"Some Evaluations on Spectrogram Art Communications Exchanging Secret Visual Messages","authors":"N. Aoki, Kenichi Ikeda, H. Yasuda, Ying Shen, Jin Hou","doi":"10.1145/3529570.3529583","DOIUrl":"https://doi.org/10.1145/3529570.3529583","url":null,"abstract":"This study investigates the possibility of visual communications using spectrogram arts drawn on sound signals. This approach attempts to increase the readability of the communications so that their messages are directly understandable by human users without any advanced decoders. This paper describes some pilot studies of our spectrogram art communication system that exchanges text messages in some chatting services. The experimental results indicate that the proposed technique may have certain appropriateness. It may be employed as a sort of steganography techniques enabling sub-channel communications in a covert manner.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"155 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116373675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion Constrained Least Mean M-estimate Algorithm for Adaptive Networks 自适应网络的扩散约束最小均值m估计算法
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529601
Wenjing Xu, Haiquan Zhao
Distributed adaptive networks are widely used in many fields. Most of the existing distributed adaptive algorithms are designed to solve the problem of network optimization under unconstrained conditions. However, in actual situations, there exist some network optimization problem under constrained conditions need to be solved, and considering that the distributed network is usually interfered by impulsive noise, a novel diffusion algorithm called diffusion constrained least mean M-estimate (D-CLMM) is proposed by using the modified Huber (MH) function, which can provide robust learning ability when network is disturbed by impulsive interference. Finally, the performance of the proposed algorithm is verified under different non-Gaussian noise environments. Simulation results show that the D-CLMM algorithm performs better than the diffusion-constrained least mean square algorithm (D-CLMS) based on mean square error (MSE) criterion.
分布式自适应网络广泛应用于许多领域。现有的分布式自适应算法大多是为了解决无约束条件下的网络优化问题。然而,在实际情况中,存在一些约束条件下的网络优化问题需要解决,并且考虑到分布式网络经常受到脉冲噪声的干扰,利用改进的Huber (MH)函数提出了一种新的扩散约束最小均值m估计(D-CLMM)算法,该算法在网络受到脉冲干扰时能够提供鲁棒的学习能力。最后,在不同的非高斯噪声环境下验证了该算法的性能。仿真结果表明,D-CLMM算法优于基于均方误差(MSE)准则的扩散约束最小均方算法(D-CLMS)。
{"title":"Diffusion Constrained Least Mean M-estimate Algorithm for Adaptive Networks","authors":"Wenjing Xu, Haiquan Zhao","doi":"10.1145/3529570.3529601","DOIUrl":"https://doi.org/10.1145/3529570.3529601","url":null,"abstract":"Distributed adaptive networks are widely used in many fields. Most of the existing distributed adaptive algorithms are designed to solve the problem of network optimization under unconstrained conditions. However, in actual situations, there exist some network optimization problem under constrained conditions need to be solved, and considering that the distributed network is usually interfered by impulsive noise, a novel diffusion algorithm called diffusion constrained least mean M-estimate (D-CLMM) is proposed by using the modified Huber (MH) function, which can provide robust learning ability when network is disturbed by impulsive interference. Finally, the performance of the proposed algorithm is verified under different non-Gaussian noise environments. Simulation results show that the D-CLMM algorithm performs better than the diffusion-constrained least mean square algorithm (D-CLMS) based on mean square error (MSE) criterion.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131038824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
CCTV Latent Representations for Reducing Accident Response Times 减少事故响应时间的CCTV潜在表征
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529582
Shafinul Haque
Emergency Medical Services’ response times to accidents are crucial to saving lives in vehicle accidents. Using deep learning to instantly detect accidents in public cameras and automatically alerting authorities could help this issue. However, this would require a large set of data on public cameras to train on, but this type of data hardly exists in a usable form. Current deep learning approaches to vehicle accidents typically use first-person cameras, which are not helpful for reducing response time as we do not have access to these cameras at all times. Also, public cameras such as closed-circuit television (CCTV) pick up a much larger amount of street activity than private cameras. Thus, we create a video dataset from live closed-circuit television, so we have access to the cameras at all times. We annotate the videos with metadata to help with future trend prediction as well as give further information for each video, as they are unlabeled. We create an unsupervised learning model to train on this video dataset, and visualize latent space representations of this data in order to cluster different types of street activity and pinpoint vehicle accidents. 1
紧急医疗服务对事故的反应时间对于挽救交通事故中的生命至关重要。利用深度学习技术即时检测公共摄像头中的事故,并自动向当局发出警报,可能有助于解决这一问题。然而,这将需要公共摄像机上的大量数据来进行训练,但这类数据几乎不存在可用的形式。目前处理交通事故的深度学习方法通常使用第一人称摄像头,这对减少响应时间没有帮助,因为我们并不是随时都能使用这些摄像头。此外,闭路电视(CCTV)等公共摄像头比私人摄像头捕捉到更多的街头活动。因此,我们从现场闭路电视中创建了一个视频数据集,这样我们就可以随时访问摄像机。我们用元数据注释视频,以帮助预测未来的趋势,并为每个视频提供进一步的信息,因为它们是未标记的。我们创建了一个无监督学习模型来训练这个视频数据集,并将这些数据的潜在空间表示可视化,以便聚类不同类型的街道活动并查明车辆事故。1
{"title":"CCTV Latent Representations for Reducing Accident Response Times","authors":"Shafinul Haque","doi":"10.1145/3529570.3529582","DOIUrl":"https://doi.org/10.1145/3529570.3529582","url":null,"abstract":"Emergency Medical Services’ response times to accidents are crucial to saving lives in vehicle accidents. Using deep learning to instantly detect accidents in public cameras and automatically alerting authorities could help this issue. However, this would require a large set of data on public cameras to train on, but this type of data hardly exists in a usable form. Current deep learning approaches to vehicle accidents typically use first-person cameras, which are not helpful for reducing response time as we do not have access to these cameras at all times. Also, public cameras such as closed-circuit television (CCTV) pick up a much larger amount of street activity than private cameras. Thus, we create a video dataset from live closed-circuit television, so we have access to the cameras at all times. We annotate the videos with metadata to help with future trend prediction as well as give further information for each video, as they are unlabeled. We create an unsupervised learning model to train on this video dataset, and visualize latent space representations of this data in order to cluster different types of street activity and pinpoint vehicle accidents. 1","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132323442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Quantization Matrices for Image Coding by Machine Learning 基于机器学习的图像编码量化矩阵改进
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529590
Wei Ke, Ka‐Hou Chan
We investigate the generation of quantization matrices for image coding in the scenario to balance compression ratio and quality. We make use of machine learning to train and determine those quantization matrices that can achieve the best compression ratio while reaching the quality settings. By introducing the trainable parameters and considering the impact of the quantization module on task performance and compression ratio, the DCT and quantization modules are jointly optimized to minimize the total coding cost. We evaluate the well-trained quantization matrices under various quality settings of JPEG. The results indicate that the proposed scheme can be combined with quality settings to consistently achieve better compression performance.
我们研究了图像编码场景中量化矩阵的生成,以平衡压缩比和质量。我们利用机器学习来训练和确定那些量化矩阵,这些量化矩阵可以在达到质量设置的同时获得最佳压缩比。通过引入可训练参数,考虑量化模块对任务性能和压缩比的影响,对DCT和量化模块进行联合优化,使总编码成本最小。我们在不同的JPEG质量设置下评估训练良好的量化矩阵。结果表明,该方案可以与质量设置相结合,以一致地获得更好的压缩性能。
{"title":"Improving Quantization Matrices for Image Coding by Machine Learning","authors":"Wei Ke, Ka‐Hou Chan","doi":"10.1145/3529570.3529590","DOIUrl":"https://doi.org/10.1145/3529570.3529590","url":null,"abstract":"We investigate the generation of quantization matrices for image coding in the scenario to balance compression ratio and quality. We make use of machine learning to train and determine those quantization matrices that can achieve the best compression ratio while reaching the quality settings. By introducing the trainable parameters and considering the impact of the quantization module on task performance and compression ratio, the DCT and quantization modules are jointly optimized to minimize the total coding cost. We evaluate the well-trained quantization matrices under various quality settings of JPEG. The results indicate that the proposed scheme can be combined with quality settings to consistently achieve better compression performance.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125549514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Interpretable Analysis and Pruning of Modulation Recognition Network Based on Deep Learning 基于深度学习的调制识别网络可解释性分析与剪枝
Pub Date : 2022-02-25 DOI: 10.1145/3529570.3529577
Fan Ni, Min Luo
Concerning poor interpretability and complexity of deep model in modulation recognition (MR) based on deep learning, an interpretable analysis and pruning framework of MR network based on Gradient-weighted Class Activation Mapping (Grad-CAM) is accordingly proposed in this paper. The framework first extracts the amplitude, phase and spectrum from the original modulated signal, and it uses the Smoothed Pseudo Wigner-Ville Distribution (SPWVD) to obtain the two-dimensional time-frequency spectrum of the modulated signal. Then, the key features in the deep model are visualized from the perspective of one-dimensional features and two-dimensional features at the input respectively. The framework visually displays and compares the differences and commonalities of the depth features of hidden layer with different models, extract the values of different filters of each layer in the deep neural network (DNN), and prune the network according to the values. The experiment results show that the interpretable and pruning framework of MR network based on Grad-CAM in this paper can achieve effective explanation and analysis on the MR network, and can greatly reduce the redundancy of the network. The running speed of the pruned network is 3.83 times higher than that of the original network. The size of the pruned network is 72% lower than that of the original network. Besides, the accuracy of the pruned network is 0.3% higher than that of the original network.
针对基于深度学习的调制识别(MR)中深度模型可解释性差、复杂性大的问题,提出了一种基于梯度加权类激活映射(Grad-CAM)的MR网络可解释性分析与剪枝框架。该框架首先从原始调制信号中提取幅值、相位和频谱,然后利用平滑伪Wigner-Ville分布(SPWVD)得到调制信号的二维时频频谱。然后,分别从输入的一维特征和二维特征的角度对深度模型中的关键特征进行可视化。该框架可视化地显示和比较不同模型下隐藏层深度特征的差异和共性,提取深度神经网络(DNN)中每层不同过滤器的值,并根据这些值对网络进行修剪。实验结果表明,本文提出的基于Grad-CAM的核磁共振网络可解释和剪枝框架能够对核磁共振网络进行有效的解释和分析,并能大大降低网络的冗余度。修剪后的网络运行速度是原始网络运行速度的3.83倍。修剪后的网络比原来的网络小72%。此外,修剪后的网络比原始网络的准确率提高了0.3%。
{"title":"Interpretable Analysis and Pruning of Modulation Recognition Network Based on Deep Learning","authors":"Fan Ni, Min Luo","doi":"10.1145/3529570.3529577","DOIUrl":"https://doi.org/10.1145/3529570.3529577","url":null,"abstract":"Concerning poor interpretability and complexity of deep model in modulation recognition (MR) based on deep learning, an interpretable analysis and pruning framework of MR network based on Gradient-weighted Class Activation Mapping (Grad-CAM) is accordingly proposed in this paper. The framework first extracts the amplitude, phase and spectrum from the original modulated signal, and it uses the Smoothed Pseudo Wigner-Ville Distribution (SPWVD) to obtain the two-dimensional time-frequency spectrum of the modulated signal. Then, the key features in the deep model are visualized from the perspective of one-dimensional features and two-dimensional features at the input respectively. The framework visually displays and compares the differences and commonalities of the depth features of hidden layer with different models, extract the values of different filters of each layer in the deep neural network (DNN), and prune the network according to the values. The experiment results show that the interpretable and pruning framework of MR network based on Grad-CAM in this paper can achieve effective explanation and analysis on the MR network, and can greatly reduce the redundancy of the network. The running speed of the pruned network is 3.83 times higher than that of the original network. The size of the pruned network is 72% lower than that of the original network. Besides, the accuracy of the pruned network is 0.3% higher than that of the original network.","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121274068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 6th International Conference on Digital Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1