首页 > 最新文献

Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing最新文献

英文 中文
Research on License Plate Recognition Based on Deep Learning in Complex Scenarios 复杂场景下基于深度学习的车牌识别研究
Yinqing Tang, Benguo Yu, Fengning Liu, Anran Wang
The license plate angle is unfixed, the vehicle position is ununiform, and the picture is not sufficiently illuminated which leads to the decrease of license plate recognition accuracy. In order to improve the accuracy of license plate recognition, a deep learning-based license plate recognition method is proposed. For license plate location problem, YOLOv3 algorithm is used. The algorithm is more capable of recognizing small targets and is suitable for license plate location recognition that requires precise positioning. For the problem of license plate character recognition, the CNN plus multitask recognition method is proposed for recognition. The experimental results show that the accuracy of the license plate recognition method proposed in this paper reaches 96%, and the intelligent license plate recognition is realized.
车牌角度不固定,车辆位置不均匀,图像光照不足,导致车牌识别精度下降。为了提高车牌识别的准确率,提出了一种基于深度学习的车牌识别方法。车牌定位问题采用YOLOv3算法。该算法具有较强的小目标识别能力,适用于需要精确定位的车牌定位识别。针对车牌字符识别问题,提出了CNN +多任务识别方法进行识别。实验结果表明,本文提出的车牌识别方法准确率达到96%,实现了智能车牌识别。
{"title":"Research on License Plate Recognition Based on Deep Learning in Complex Scenarios","authors":"Yinqing Tang, Benguo Yu, Fengning Liu, Anran Wang","doi":"10.1145/3583788.3583805","DOIUrl":"https://doi.org/10.1145/3583788.3583805","url":null,"abstract":"The license plate angle is unfixed, the vehicle position is ununiform, and the picture is not sufficiently illuminated which leads to the decrease of license plate recognition accuracy. In order to improve the accuracy of license plate recognition, a deep learning-based license plate recognition method is proposed. For license plate location problem, YOLOv3 algorithm is used. The algorithm is more capable of recognizing small targets and is suitable for license plate location recognition that requires precise positioning. For the problem of license plate character recognition, the CNN plus multitask recognition method is proposed for recognition. The experimental results show that the accuracy of the license plate recognition method proposed in this paper reaches 96%, and the intelligent license plate recognition is realized.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128262309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Indoor Fingerprint Positioning Method Based on Real 5G Signals 基于5G真实信号的室内指纹定位方法
Changhao Wang, Jin Xi, Changqing Xia, Chi Xu, Yong Duan
Indoor positioning services are being used more and more widely. However, existing indoor positioning techniques cannot simultaneously take into account low cost, ease of use, high precision, and seamless switching between indoor and outdoor positioning. With the maturity of 5G techniques, 5G-based indoor positioning is gradually being paid attention to. 5G-based indoor positioning does not require additional equipment, and supports flexible indoor and outdoor switching under the same system. However, the 5G-related information used in existing research on 5G indoor positioning is not open to users. Therefore, in this paper, we propose an indoor fingerprint positioning method based on measured 5G signals. This method first collects 5G signals in the positioning area, and processes them to form a fingerprint database. Then, a machine learning algorithm is used to match the signal to be located with the fingerprint database to obtain the positioning result. Finally, we conduct experiments in real field, and the experimental result demonstrates that the positioning accuracy of our proposed method can reach 96%.
室内定位服务的应用越来越广泛。然而,现有的室内定位技术无法同时兼顾低成本、易用性、高精度以及室内外定位的无缝切换。随着5G技术的成熟,基于5G的室内定位逐渐受到重视。基于5g的室内定位不需要额外的设备,在同一系统下支持灵活的室内外切换。然而,现有的5G室内定位研究中使用的5G相关信息并没有向用户开放。因此,本文提出了一种基于实测5G信号的室内指纹定位方法。该方法首先采集定位区域内的5G信号,并对其进行处理形成指纹数据库。然后,利用机器学习算法将待定位信号与指纹库进行匹配,得到定位结果。最后,我们进行了实际的现场实验,实验结果表明,我们提出的方法的定位精度可以达到96%。
{"title":"Indoor Fingerprint Positioning Method Based on Real 5G Signals","authors":"Changhao Wang, Jin Xi, Changqing Xia, Chi Xu, Yong Duan","doi":"10.1145/3583788.3583819","DOIUrl":"https://doi.org/10.1145/3583788.3583819","url":null,"abstract":"Indoor positioning services are being used more and more widely. However, existing indoor positioning techniques cannot simultaneously take into account low cost, ease of use, high precision, and seamless switching between indoor and outdoor positioning. With the maturity of 5G techniques, 5G-based indoor positioning is gradually being paid attention to. 5G-based indoor positioning does not require additional equipment, and supports flexible indoor and outdoor switching under the same system. However, the 5G-related information used in existing research on 5G indoor positioning is not open to users. Therefore, in this paper, we propose an indoor fingerprint positioning method based on measured 5G signals. This method first collects 5G signals in the positioning area, and processes them to form a fingerprint database. Then, a machine learning algorithm is used to match the signal to be located with the fingerprint database to obtain the positioning result. Finally, we conduct experiments in real field, and the experimental result demonstrates that the positioning accuracy of our proposed method can reach 96%.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129176854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Neural Network Optimization Objective Vector Representation based on Genetic Algorithm and Its Multi-objective Optimization Method 基于遗传算法的神经网络优化目标向量表示及其多目标优化方法
Yunke Xiong, Qun Hou, Xin Liu
Deep learning algorithms mostly have network parameters that can affect their training results, and the combination of neural network architectures also has a significant impact on the algorithm performance. The performance of deep learning algorithms is usually proportional to the overall number of network parameters, leading to excessive resource consumption for exploring neural network architectures with a large number of hyper-parameters. To solve this problem, a vector representation is proposed which for neural network architectures, and a multi-objective optimization model is established based on genetic algorithms in this paper, and it is short for “NNOO Vector Representation based on GA and Its Optimization Method”. The multi-objective optimization model can automatically optimize the neural network architecture and hyper-parameters in the network, improve the network accuracy, and reduce the overall number of network parameters. It is shown in the test results with the MNIST data set, and the accuracy is 95.61% for the traditional empirical setting network, and the average accuracy is 86.2% for the network optimized by TensorFlow’s optimization algorithm. While the network accuracy is improved to 96.86% with the proposed optimization method in this paper and the network parameters are reduced by 32.6% compared with the traditional empirical network, and the network parameters are reduced by13.2% compared with the network by TensorFlow’s optimization algorithm. Therefore, the method is presented which has obvious practical application value in neural network optimization problems and provides a new way of thinking for large and deep network optimization problems.
深度学习算法大多具有影响其训练结果的网络参数,神经网络架构的组合对算法性能也有显著影响。深度学习算法的性能通常与网络参数的总体数量成正比,这导致在探索具有大量超参数的神经网络架构时资源消耗过多。为了解决这一问题,本文提出了一种面向神经网络架构的向量表示,并建立了一种基于遗传算法的多目标优化模型,简称“基于遗传算法的NNOO向量表示及其优化方法”。多目标优化模型可以自动优化网络中的神经网络结构和超参数,提高网络精度,减少网络参数总数。在MNIST数据集上的测试结果表明,传统经验设置网络的准确率为95.61%,TensorFlow优化算法优化后的网络平均准确率为86.2%。而本文提出的优化方法将网络准确率提高到96.86%,与传统经验网络相比,网络参数降低了32.6%,与使用TensorFlow优化算法的网络相比,网络参数降低了13.2%。因此,该方法在神经网络优化问题中具有明显的实际应用价值,为大型深度网络优化问题提供了一种新的思路。
{"title":"Neural Network Optimization Objective Vector Representation based on Genetic Algorithm and Its Multi-objective Optimization Method","authors":"Yunke Xiong, Qun Hou, Xin Liu","doi":"10.1145/3583788.3583796","DOIUrl":"https://doi.org/10.1145/3583788.3583796","url":null,"abstract":"Deep learning algorithms mostly have network parameters that can affect their training results, and the combination of neural network architectures also has a significant impact on the algorithm performance. The performance of deep learning algorithms is usually proportional to the overall number of network parameters, leading to excessive resource consumption for exploring neural network architectures with a large number of hyper-parameters. To solve this problem, a vector representation is proposed which for neural network architectures, and a multi-objective optimization model is established based on genetic algorithms in this paper, and it is short for “NNOO Vector Representation based on GA and Its Optimization Method”. The multi-objective optimization model can automatically optimize the neural network architecture and hyper-parameters in the network, improve the network accuracy, and reduce the overall number of network parameters. It is shown in the test results with the MNIST data set, and the accuracy is 95.61% for the traditional empirical setting network, and the average accuracy is 86.2% for the network optimized by TensorFlow’s optimization algorithm. While the network accuracy is improved to 96.86% with the proposed optimization method in this paper and the network parameters are reduced by 32.6% compared with the traditional empirical network, and the network parameters are reduced by13.2% compared with the network by TensorFlow’s optimization algorithm. Therefore, the method is presented which has obvious practical application value in neural network optimization problems and provides a new way of thinking for large and deep network optimization problems.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"503 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115221176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resolving Context Contradictions in the Neural Dialogue System based on Sentiment Information 基于情感信息的神经对话系统语境矛盾消解
Shingo Hanahira, Xin Kang
Chatbots trained on large corpus generate fluent responses, but often suffer from the problem of generating responses that contradict past utterances. Recent research treats dialogue contradiction detection as a task of natural language inference (NLI), and a method to remove contradiction from responses has been proposed and has shown high performance. However, these datasets do not provide explicit information about emotions, and these models cannot capture changes in emotions. In this work, we create a new dataset by explicitly labeling emotional information on an existing contradiction detection dataset and use this dataset to train an NLI model. Furthermore, we train the NLI model on the original dataset as well and compare the accuracy of both in dialogue contradiction detection.
在大型语料库上训练的聊天机器人可以产生流利的反应,但经常遇到与过去的话语相矛盾的问题。近年来的研究将对话矛盾检测作为一项自然语言推理任务,提出了一种消除对话矛盾的方法,并取得了良好的效果。然而,这些数据集不能提供关于情绪的明确信息,而且这些模型不能捕捉情绪的变化。在这项工作中,我们通过在现有的矛盾检测数据集上明确标记情感信息来创建一个新的数据集,并使用该数据集来训练NLI模型。此外,我们还在原始数据集上训练了NLI模型,并比较了两者在对话矛盾检测中的准确性。
{"title":"Resolving Context Contradictions in the Neural Dialogue System based on Sentiment Information","authors":"Shingo Hanahira, Xin Kang","doi":"10.1145/3583788.3583816","DOIUrl":"https://doi.org/10.1145/3583788.3583816","url":null,"abstract":"Chatbots trained on large corpus generate fluent responses, but often suffer from the problem of generating responses that contradict past utterances. Recent research treats dialogue contradiction detection as a task of natural language inference (NLI), and a method to remove contradiction from responses has been proposed and has shown high performance. However, these datasets do not provide explicit information about emotions, and these models cannot capture changes in emotions. In this work, we create a new dataset by explicitly labeling emotional information on an existing contradiction detection dataset and use this dataset to train an NLI model. Furthermore, we train the NLI model on the original dataset as well and compare the accuracy of both in dialogue contradiction detection.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128107619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nighttime Semantic Segmentation with Instance-level Data Augmentation: a Case Study of the Dark Zurich Benchmark 夜间语义分割与实例级数据增强:黑暗苏黎世基准的案例研究
Alex Liu, Zhifeng Xiao
Semantic segmentation has been a core learning task in the autonomous driving technology stack. However, current deep learning-based models do not perform well at nighttime due to the low illumination. In this study, we present an instance-level data augmentation method to increase the quantity and diversity for the low-resource classes to feed more instances of these classes to the training algorithm, with an aim to encourage the model to learn more features and patterns to better distinguish the low-resource classes presented in the original training set. We validate the method on the Dark Zurich dataset, a typical dataset that contains driving scene images taking at daytime e, twilight, and nighttime. We take the ``person'' class as an example and apply the instance-level data augmentation method. Experimental results have shown significant improvement compared to the SOTA, lifting the IoU by 4.52%. The results demonstrate the efficacy of the proposed method, indicating that the augmenting low-resource classes at the instance level is a promising strategy and can be an effective complement alongside other performance boosting methods.
语义分割一直是自动驾驶技术栈中的核心学习任务。然而,目前基于深度学习的模型在夜间由于光照不足而表现不佳。在本研究中,我们提出了一种实例级的数据增强方法,增加低资源类的数量和多样性,从而为训练算法提供更多的低资源类的实例,从而鼓励模型学习更多的特征和模式,从而更好地区分原始训练集中呈现的低资源类。我们在Dark Zurich数据集上验证了该方法,这是一个典型的数据集,包含白天、黄昏和夜间拍摄的驾驶场景图像。以“person”类为例,应用实例级数据增强方法。实验结果表明,与SOTA相比,IoU提高了4.52%。结果证明了所提出方法的有效性,表明在实例级增加低资源类是一种有前途的策略,可以与其他性能提升方法一起有效补充。
{"title":"Nighttime Semantic Segmentation with Instance-level Data Augmentation: a Case Study of the Dark Zurich Benchmark","authors":"Alex Liu, Zhifeng Xiao","doi":"10.1145/3583788.3583814","DOIUrl":"https://doi.org/10.1145/3583788.3583814","url":null,"abstract":"Semantic segmentation has been a core learning task in the autonomous driving technology stack. However, current deep learning-based models do not perform well at nighttime due to the low illumination. In this study, we present an instance-level data augmentation method to increase the quantity and diversity for the low-resource classes to feed more instances of these classes to the training algorithm, with an aim to encourage the model to learn more features and patterns to better distinguish the low-resource classes presented in the original training set. We validate the method on the Dark Zurich dataset, a typical dataset that contains driving scene images taking at daytime e, twilight, and nighttime. We take the ``person'' class as an example and apply the instance-level data augmentation method. Experimental results have shown significant improvement compared to the SOTA, lifting the IoU by 4.52%. The results demonstrate the efficacy of the proposed method, indicating that the augmenting low-resource classes at the instance level is a promising strategy and can be an effective complement alongside other performance boosting methods.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133376453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Cursive Font Recognition Based on Improved Hash Algorithm 基于改进哈希算法的草书字体识别研究
Benguo Yu, Yinqing Tang, Yang Yang
This paper mainly explores the method of Chinese cursive character recognition, establishes the Standard Cursive database in the process of research, and puts forward the similarity distance to measure the similarity between the cursive font to be recognized and the character set, and improves the calculation method of the similarity distance. Through experimental comparison, pH algorithm performs best in cursive character recognition.
本文主要探讨了中国草书字符识别方法,在研究过程中建立了标准草书数据库,提出了相似距离来衡量待识别草书字体与字符集之间的相似度,并对相似距离的计算方法进行了改进。通过实验比较,pH算法在草书字符识别中表现最好。
{"title":"Research on Cursive Font Recognition Based on Improved Hash Algorithm","authors":"Benguo Yu, Yinqing Tang, Yang Yang","doi":"10.1145/3583788.3583808","DOIUrl":"https://doi.org/10.1145/3583788.3583808","url":null,"abstract":"This paper mainly explores the method of Chinese cursive character recognition, establishes the Standard Cursive database in the process of research, and puts forward the similarity distance to measure the similarity between the cursive font to be recognized and the character set, and improves the calculation method of the similarity distance. Through experimental comparison, pH algorithm performs best in cursive character recognition.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127462746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CascadeTransformer: Multi-label Classification with Transformer in Chronic Disease Prediction CascadeTransformer:多标签分类与Transformer在慢性疾病预测中的应用
Bo Zeng, Donghai Zhai, Bo Peng, Y. Yao
Chronic diseases are serious threats to human safety and major public health problems worldwide. Many chronic diseases tend to have co-morbidities. Most machine learning techniques nowadays tend to focus on predicting a single disease while ignoring the study of co-morbidities. It is urgent to develop an artificial intelligence-based multi-label classification model based on patients' physical data, which is useful for the early detection and treatment of patients' diseases. In this study, we proposed a layer-by-layer processing structure, termed CascadeTransformer, that applies the Transformer architecture as weak classifiers, to solve the multi-label prediction problem of chronic diseases. We built a chronic diseases dataset using real-world data from West China Hospital, which consists of 1174 anonymous instances and 131 features. Systematic experiments show that our method shows better experimental performance compared to other methods on our chronic disease dataset.
慢性疾病是对人类安全的严重威胁,也是世界范围内重大的公共卫生问题。许多慢性病往往有合并症。如今,大多数机器学习技术倾向于专注于预测单一疾病,而忽略了对合并症的研究。迫切需要开发一种基于患者身体数据的基于人工智能的多标签分类模型,这有助于患者疾病的早期发现和治疗。在这项研究中,我们提出了一种分层处理结构,称为CascadeTransformer,它将Transformer架构作为弱分类器来解决慢性病的多标签预测问题。我们利用华西医院的真实数据建立了一个慢性病数据集,该数据集由1174个匿名实例和131个特征组成。系统实验表明,在我们的慢性病数据集上,与其他方法相比,我们的方法具有更好的实验性能。
{"title":"CascadeTransformer: Multi-label Classification with Transformer in Chronic Disease Prediction","authors":"Bo Zeng, Donghai Zhai, Bo Peng, Y. Yao","doi":"10.1145/3583788.3583817","DOIUrl":"https://doi.org/10.1145/3583788.3583817","url":null,"abstract":"Chronic diseases are serious threats to human safety and major public health problems worldwide. Many chronic diseases tend to have co-morbidities. Most machine learning techniques nowadays tend to focus on predicting a single disease while ignoring the study of co-morbidities. It is urgent to develop an artificial intelligence-based multi-label classification model based on patients' physical data, which is useful for the early detection and treatment of patients' diseases. In this study, we proposed a layer-by-layer processing structure, termed CascadeTransformer, that applies the Transformer architecture as weak classifiers, to solve the multi-label prediction problem of chronic diseases. We built a chronic diseases dataset using real-world data from West China Hospital, which consists of 1174 anonymous instances and 131 features. Systematic experiments show that our method shows better experimental performance compared to other methods on our chronic disease dataset.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"182 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117170056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning on Spectral Data from Miniature Devices for Food Quality Analysis - A Case Study 用于食品质量分析的微型设备光谱数据的机器学习-一个案例研究
Fayas Asharindavida, O. Nibouche, J. Uhomoibhi, Jun Liu, Hui Wang
Food quality analysis can be carried out by spectral data acquired from spectrometers with its advantage of non-destructive way of testing. Portable and miniature spectroscopy can be a suitable solution when it meets the specifications such as portability, cost, and short processing time requirements, to enable ordinary citizens to use such a device in the fight against food fraud. Compared to more expensive, bulky, and non-portable devices, the data collected using miniature and portable spectrometers is of a lower quality and thus adversely affect the quality of the analysis. Research have been carried out to use machine learning (ML) classifiers on spectral data analysis for food quality assessment. The present work focuses on two aspects: firstly, preliminary exploratory statistical analysis is conducted on the real spectral data on different food products including oils, fruits and spices acquired from such miniature devices, which aims to evaluate and illustrate the distinctive characteristics of such spectral data, data distribution and difference in the spectra across multiple data acquisitions etc. along with a summary of the key challenges to face and explore. Secondly, a case study for the differentiation of extra virgin olive from adulterated with vegetable oil is provided to analyze and evaluate how some commonly used ML classifiers can be used for classification, while the impact of different preprocessing methods to improve the accuracy and efficiency is also provided. The case study demonstrates the good potential of using data analytics for spectral data from miniature device, although the overall performance of those ML classifiers is not exceptional (the classification rates of up to 83.32%) which is partially due to the quality of data, and partially due to limiting to only some classifiers. More elaborate data pre-processing and cleaning methods can be used to address the key challenges of the spectral data from miniature device, and other types of classifiers can be also explored further in future work.
利用光谱仪采集的光谱数据进行食品质量分析,具有无损检测的优点。便携式和微型光谱仪在满足便携性、成本和处理时间短等规格要求时,可以成为一种合适的解决方案,使普通公民能够使用这种设备来打击食品欺诈。与更昂贵、体积更大、不可携带的设备相比,使用微型和便携式光谱仪收集的数据质量较低,从而对分析质量产生不利影响。已经开展了使用机器学习(ML)分类器对光谱数据分析进行食品质量评估的研究。本工作主要集中在两个方面:首先,对该微型装置获取的油脂、水果、香料等不同食品的真实光谱数据进行初步探索性统计分析,评价和说明该光谱数据的鲜明特征、数据分布、多次数据采集的光谱差异等,总结需要面临和探索的关键挑战;其次,以特级初榨橄榄油与掺假植物油的鉴别为例,分析和评价了几种常用的ML分类器如何进行分类,以及不同预处理方法对提高准确率和效率的影响。该案例研究显示了对来自微型设备的光谱数据使用数据分析的良好潜力,尽管这些ML分类器的总体性能并不出色(分类率高达83.32%),这部分是由于数据质量,部分是由于仅限于某些分类器。更精细的数据预处理和清洗方法可以用来解决微型设备光谱数据的关键挑战,其他类型的分类器也可以在未来的工作中进一步探索。
{"title":"Machine Learning on Spectral Data from Miniature Devices for Food Quality Analysis - A Case Study","authors":"Fayas Asharindavida, O. Nibouche, J. Uhomoibhi, Jun Liu, Hui Wang","doi":"10.1145/3583788.3583801","DOIUrl":"https://doi.org/10.1145/3583788.3583801","url":null,"abstract":"Food quality analysis can be carried out by spectral data acquired from spectrometers with its advantage of non-destructive way of testing. Portable and miniature spectroscopy can be a suitable solution when it meets the specifications such as portability, cost, and short processing time requirements, to enable ordinary citizens to use such a device in the fight against food fraud. Compared to more expensive, bulky, and non-portable devices, the data collected using miniature and portable spectrometers is of a lower quality and thus adversely affect the quality of the analysis. Research have been carried out to use machine learning (ML) classifiers on spectral data analysis for food quality assessment. The present work focuses on two aspects: firstly, preliminary exploratory statistical analysis is conducted on the real spectral data on different food products including oils, fruits and spices acquired from such miniature devices, which aims to evaluate and illustrate the distinctive characteristics of such spectral data, data distribution and difference in the spectra across multiple data acquisitions etc. along with a summary of the key challenges to face and explore. Secondly, a case study for the differentiation of extra virgin olive from adulterated with vegetable oil is provided to analyze and evaluate how some commonly used ML classifiers can be used for classification, while the impact of different preprocessing methods to improve the accuracy and efficiency is also provided. The case study demonstrates the good potential of using data analytics for spectral data from miniature device, although the overall performance of those ML classifiers is not exceptional (the classification rates of up to 83.32%) which is partially due to the quality of data, and partially due to limiting to only some classifiers. More elaborate data pre-processing and cleaning methods can be used to address the key challenges of the spectral data from miniature device, and other types of classifiers can be also explored further in future work.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128145241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LDRC: Long-tail Distantly Supervised Relation Extraction via Contrastive Learning 基于对比学习的长尾远程监督关系提取
Tingwei Li, Zhi Wang
Long-tail problem is one of the major challenges in distantly supervised relation extraction. Some recent works on the long-tail problem attempt to transfer knowledge from data-rich and semantically similar head classes to data-poor tail classes using a relation hierarchical tree. These methods, however, are based on the assumption that long-tail and head relations have a strong correlation, which does not always hold true, and the model’s ability to learn long-tail relations is essentially not improved. In this paper, a novel joint learning framework that combines relation extraction and contrastive learning is proposed, allowing the model to directly learn the subtle differences between different categories to improve long-tail relation extraction. Experimental results show that our proposed model outperforms the current state-of-the-art (SOTA) model on various mainstream datasets.
长尾问题是远程监督关系抽取的主要挑战之一。最近关于长尾问题的一些研究尝试使用关系层次树将知识从数据丰富且语义相似的头部类转移到数据贫乏的尾部类。然而,这些方法是基于长尾和头部关系有很强相关性的假设,这并不总是正确的,模型学习长尾关系的能力本质上没有提高。本文提出了一种结合关系提取和对比学习的新型联合学习框架,使模型能够直接学习不同类别之间的细微差异,从而提高长尾关系提取。实验结果表明,我们提出的模型在各种主流数据集上都优于当前最先进的SOTA模型。
{"title":"LDRC: Long-tail Distantly Supervised Relation Extraction via Contrastive Learning","authors":"Tingwei Li, Zhi Wang","doi":"10.1145/3583788.3583804","DOIUrl":"https://doi.org/10.1145/3583788.3583804","url":null,"abstract":"Long-tail problem is one of the major challenges in distantly supervised relation extraction. Some recent works on the long-tail problem attempt to transfer knowledge from data-rich and semantically similar head classes to data-poor tail classes using a relation hierarchical tree. These methods, however, are based on the assumption that long-tail and head relations have a strong correlation, which does not always hold true, and the model’s ability to learn long-tail relations is essentially not improved. In this paper, a novel joint learning framework that combines relation extraction and contrastive learning is proposed, allowing the model to directly learn the subtle differences between different categories to improve long-tail relation extraction. Experimental results show that our proposed model outperforms the current state-of-the-art (SOTA) model on various mainstream datasets.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124765204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-Temporal Deep Fusion Graph Convolutional Networks for Crime Prediction 犯罪预测的时空深度融合图卷积网络
Bingbin Chen, Yong Liao
Effective crime prediction plays a key role in sustaining the stability of society. In recent years, researchers have proposed a number of prediction methods that extract spatial and temporal features separately and fuse afterward. However, the strict distinction between spatial feature extraction and temporal feature extraction can result in the loss of useful information. To this end, we propose a spatio-temporal deep fusion graph convolution network (STDGCN), which embodies the intra-region spatio-temporal features and the inter-region spatio-temporal associations on a single graph. STDGCN performs the convolution without distinguishing between space and time to simultaneously extract spatio-temporal features. Our evaluations of two real-world datasets demonstrate the effectiveness of STDGCN.
有效的犯罪预测对维持社会稳定起着关键作用。近年来,研究人员提出了许多分别提取时空特征并进行融合的预测方法。然而,空间特征提取和时间特征提取的严格区分会导致有用信息的丢失。为此,我们提出了一种时空深度融合图卷积网络(STDGCN),该网络将区域内的时空特征和区域间的时空关联集中在一张图上。STDGCN不区分空间和时间进行卷积,同时提取时空特征。我们对两个真实世界数据集的评估证明了STDGCN的有效性。
{"title":"Spatio-Temporal Deep Fusion Graph Convolutional Networks for Crime Prediction","authors":"Bingbin Chen, Yong Liao","doi":"10.1145/3583788.3583799","DOIUrl":"https://doi.org/10.1145/3583788.3583799","url":null,"abstract":"Effective crime prediction plays a key role in sustaining the stability of society. In recent years, researchers have proposed a number of prediction methods that extract spatial and temporal features separately and fuse afterward. However, the strict distinction between spatial feature extraction and temporal feature extraction can result in the loss of useful information. To this end, we propose a spatio-temporal deep fusion graph convolution network (STDGCN), which embodies the intra-region spatio-temporal features and the inter-region spatio-temporal associations on a single graph. STDGCN performs the convolution without distinguishing between space and time to simultaneously extract spatio-temporal features. Our evaluations of two real-world datasets demonstrate the effectiveness of STDGCN.","PeriodicalId":292167,"journal":{"name":"Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127222486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1