首页 > 最新文献

Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence最新文献

英文 中文
Automatic segmentation for meniscus magnetic resonance images of knee joint based on Mask region-based convolution neural network 基于掩模区域卷积神经网络的膝关节半月板磁共振图像自动分割
Liyan Zhang, Hao Zhou, Juan Wang, Lei Wang, Cheng-yi Xia
Over the past two decades, magnetic resonance imaging (MRI) has been widely applied into the diagnosis of knee joint diseases. Due to the complexity and diversity of MRI data, traditional feature extraction requires manual searching for features to segment meniscus, and the final segmentation results still need to be further filtered. Therefore, it is necessary to design a novel method to automatically extract features directly from images. In this study, we develop a framework to implement this goal by using a mask region-based convolution neural network (Mask R-CNN) without manual intervention. In order to highlight the proportion of meniscus, we first preprocess the original image data so that it is reduced to about 1/8 of the original size, and then input the preprocessed image data into the trained Mask R-CNN. Afterwards, transfer learning is used to generate the weight of our network. By testing 1000 images, the mean intersection over union (IOU) and dice similarity coefficient (DSC) are up to 83.68% and 91.13%, respectively. The current results demonstrate that our approach is feasible and has a potential significance in clinical practice.
近二十年来,磁共振成像(MRI)已广泛应用于膝关节疾病的诊断。由于MRI数据的复杂性和多样性,传统的特征提取需要人工搜索特征来分割半月板,最终的分割结果还需要进一步滤波。因此,有必要设计一种新的方法来直接从图像中自动提取特征。在本研究中,我们开发了一个框架,通过使用基于掩模区域的卷积神经网络(mask R-CNN)来实现这一目标,而无需人工干预。为了突出半月板的比例,我们首先对原始图像数据进行预处理,使其减小到原始尺寸的1/8左右,然后将预处理后的图像数据输入到训练好的Mask R-CNN中。然后,使用迁移学习来生成我们网络的权值。通过对1000幅图像的测试,平均交联(IOU)和骰子相似系数(DSC)分别达到83.68%和91.13%。目前的结果表明,我们的方法是可行的,在临床实践中具有潜在的意义。
{"title":"Automatic segmentation for meniscus magnetic resonance images of knee joint based on Mask region-based convolution neural network","authors":"Liyan Zhang, Hao Zhou, Juan Wang, Lei Wang, Cheng-yi Xia","doi":"10.1145/3507548.3507556","DOIUrl":"https://doi.org/10.1145/3507548.3507556","url":null,"abstract":"Over the past two decades, magnetic resonance imaging (MRI) has been widely applied into the diagnosis of knee joint diseases. Due to the complexity and diversity of MRI data, traditional feature extraction requires manual searching for features to segment meniscus, and the final segmentation results still need to be further filtered. Therefore, it is necessary to design a novel method to automatically extract features directly from images. In this study, we develop a framework to implement this goal by using a mask region-based convolution neural network (Mask R-CNN) without manual intervention. In order to highlight the proportion of meniscus, we first preprocess the original image data so that it is reduced to about 1/8 of the original size, and then input the preprocessed image data into the trained Mask R-CNN. Afterwards, transfer learning is used to generate the weight of our network. By testing 1000 images, the mean intersection over union (IOU) and dice similarity coefficient (DSC) are up to 83.68% and 91.13%, respectively. The current results demonstrate that our approach is feasible and has a potential significance in clinical practice.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124078459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Margin Ranking for Supervised Cross-modal Retrieval 监督跨模态检索的自适应余量排序
Tianyuan Xu, Xueliang Liu
Cross-modal retrieval is to achieve flexible query between different modalities. Many approaches solve the problem by learning a common feature space under to separate the multimodal instances from different categories. But it is challenge to design an effective projecting function. In this paper, we propose a novel cross-modal retrieval method, called Adaptive Margin Ranking for Supervised Cross-modal Retrieval (AMRS). In the solution, we design a neural network as the nonlinear mapping function. To maximize the discrimination of multimodal feature in common representation space, we keep away the samples with different semantic by an adaptive margin, and jointly force the modality invariance to eliminate cross-modal discrepancy. Experimental results on widely used benchmark datasets show that the proposed method is effective in cross-modal learning.
跨模态检索是为了实现不同模态之间的灵活查询。许多方法通过学习一个公共特征空间来分离不同类别的多模态实例来解决这个问题。但如何设计有效的投影功能是一个挑战。在本文中,我们提出了一种新的跨模态检索方法,称为自适应边际排序的监督跨模态检索(AMRS)。在解决方案中,我们设计了一个神经网络作为非线性映射函数。为了最大限度地提高公共表示空间中多模态特征的识别率,我们通过自适应边界将不同语义的样本隔离,并共同强制模态不变性以消除跨模态差异。在广泛使用的基准数据集上的实验结果表明,该方法在跨模态学习中是有效的。
{"title":"Adaptive Margin Ranking for Supervised Cross-modal Retrieval","authors":"Tianyuan Xu, Xueliang Liu","doi":"10.1145/3507548.3507599","DOIUrl":"https://doi.org/10.1145/3507548.3507599","url":null,"abstract":"Cross-modal retrieval is to achieve flexible query between different modalities. Many approaches solve the problem by learning a common feature space under to separate the multimodal instances from different categories. But it is challenge to design an effective projecting function. In this paper, we propose a novel cross-modal retrieval method, called Adaptive Margin Ranking for Supervised Cross-modal Retrieval (AMRS). In the solution, we design a neural network as the nonlinear mapping function. To maximize the discrimination of multimodal feature in common representation space, we keep away the samples with different semantic by an adaptive margin, and jointly force the modality invariance to eliminate cross-modal discrepancy. Experimental results on widely used benchmark datasets show that the proposed method is effective in cross-modal learning.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121385688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Random Polygon Cover for Oracle Bone Character Recognition 随机多边形覆盖的甲骨文字符识别
Liu Dazheng
Deep Convolutional neural networks are widely used in computer vision research because of their good feature extraction ability, which can often show good performance in related tasks. Performance of deep convolution network models is not only related to its own architecture design, but also closely related to training data. When training model with large dataset and the images are clear and noise in images is less, it can get good result. But in the case of small dataset and low image quality, it is easy to appear that the model can fit the data in the training process and perform badly in testing, that is, overfitting problem. Our work proposes random polygon cover algorithm to simulate the possible damage object and partial content loss in training dataset, which is also a data augmentation technique. We'll use experiments to prove the effectiveness of this approach, while trying to reveal how data augmentation works and how our method differs from dropout.
深度卷积神经网络由于其良好的特征提取能力,在计算机视觉研究中得到了广泛的应用,在相关任务中往往能表现出良好的性能。深度卷积网络模型的性能不仅与其自身的架构设计有关,而且与训练数据密切相关。当训练模型数据量大、图像清晰、图像噪声少时,可以得到较好的训练效果。但在数据集小、图像质量低的情况下,很容易出现模型在训练过程中能够拟合数据而在测试中表现不佳的情况,即过拟合问题。我们的工作提出了随机多边形覆盖算法来模拟训练数据集中可能的损坏对象和部分内容丢失,这也是一种数据增强技术。我们将使用实验来证明这种方法的有效性,同时试图揭示数据增强是如何工作的,以及我们的方法与dropout有何不同。
{"title":"Random Polygon Cover for Oracle Bone Character Recognition","authors":"Liu Dazheng","doi":"10.1145/3507548.3507569","DOIUrl":"https://doi.org/10.1145/3507548.3507569","url":null,"abstract":"Deep Convolutional neural networks are widely used in computer vision research because of their good feature extraction ability, which can often show good performance in related tasks. Performance of deep convolution network models is not only related to its own architecture design, but also closely related to training data. When training model with large dataset and the images are clear and noise in images is less, it can get good result. But in the case of small dataset and low image quality, it is easy to appear that the model can fit the data in the training process and perform badly in testing, that is, overfitting problem. Our work proposes random polygon cover algorithm to simulate the possible damage object and partial content loss in training dataset, which is also a data augmentation technique. We'll use experiments to prove the effectiveness of this approach, while trying to reveal how data augmentation works and how our method differs from dropout.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127217877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning Static and Dynamic Movie Attributes for Box Office Prediction 用于票房预测的深度学习静态和动态电影属性
Linxi Chen
The daily audience data and static movie attributes are both important factors that influence the movie's succeeding box offices. This paper proposes the first framework that utilizes both daily audience data and static movie attributes to accurately the performance of movies’ box-office prediction. To use the daily audience data dynamics, we utilized the recent proposed rank pooling strategy to encode multi-scale audience data dynamics. Meanwhile, we also consider 15 static movie attributes. Both static and dynamic features are combined in a multi-stream residual network for box-office prediction. The experiments conducted on the dataset that contains 120 movies’ daily audience data show that the proposed multi-scale dynamic encoding achieved promising results in prediction the next one- or two-days’ box office while the static-dynamic fusion model achieved the best performance under all conditions
日常的观众数据和静态的电影属性都是影响电影后续票房的重要因素。本文提出了第一个利用日常观众数据和静态电影属性来准确预测电影票房的框架。为了利用日常受众数据动态,我们利用最近提出的秩池策略对多尺度受众数据动态进行编码。同时,我们还考虑了15个静态电影属性。在多流残差网络中结合静态和动态特征进行票房预测。在包含120部电影每日观众数据的数据集上进行的实验表明,所提出的多尺度动态编码在预测未来一天或两天的票房方面取得了很好的效果,而静态-动态融合模型在所有条件下都取得了最好的性能
{"title":"Deep Learning Static and Dynamic Movie Attributes for Box Office Prediction","authors":"Linxi Chen","doi":"10.1145/3507548.3507610","DOIUrl":"https://doi.org/10.1145/3507548.3507610","url":null,"abstract":"The daily audience data and static movie attributes are both important factors that influence the movie's succeeding box offices. This paper proposes the first framework that utilizes both daily audience data and static movie attributes to accurately the performance of movies’ box-office prediction. To use the daily audience data dynamics, we utilized the recent proposed rank pooling strategy to encode multi-scale audience data dynamics. Meanwhile, we also consider 15 static movie attributes. Both static and dynamic features are combined in a multi-stream residual network for box-office prediction. The experiments conducted on the dataset that contains 120 movies’ daily audience data show that the proposed multi-scale dynamic encoding achieved promising results in prediction the next one- or two-days’ box office while the static-dynamic fusion model achieved the best performance under all conditions","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126437496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and implementation of CTD profile observation data accumulation system based on MySQL 基于MySQL的CTD剖面观测数据积累系统的设计与实现
Xing-min Li, Tao Dong, Xin-peng Wang, Li-Shan Ma
In order to realize the accumulation of sea temperature, conductivity, pressure, depth, salinity, density and sound profile observation data, a ocean observation data storage system based on database is designed. For efficiently realizing accumulation of ocean observation data, the system makes full use of the advantages of MYSQL database management platform, and orderly stores tens of thousands ofConductivity-Temperature-Depth(CTD) profile observation data. At the same time, the data quality control will be applied to the received hydrological observation data, which improves the quality of the data stored in the database and enhances the data usability. After the system was developed, a simulation environment was set up to test the system. The results show that the system hasrational function design and stable operation, which can well realize the accumulation of CTD profile observation data. The realization of ocean hydrologic profile observation data accumulation provides reliable data source for the subsequent in-depth data-mining and utilization of ocean hydrologic observation data.
为了实现海温、电导率、压力、深度、盐度、密度和声廓线观测数据的积累,设计了基于数据库的海洋观测数据存储系统。为高效实现海洋观测数据的积累,系统充分利用MYSQL数据库管理平台的优势,有序存储数万条CTD剖面观测数据。同时对接收到的水文观测数据进行数据质量控制,提高了数据库存储数据的质量,增强了数据的可用性。系统开发完成后,搭建了仿真环境对系统进行了测试。结果表明,该系统功能设计合理,运行稳定,能够很好地实现CTD剖面观测数据的积累。海洋水文剖面观测数据积累的实现,为后续海洋水文观测数据的深度挖掘和利用提供了可靠的数据源。
{"title":"Design and implementation of CTD profile observation data accumulation system based on MySQL","authors":"Xing-min Li, Tao Dong, Xin-peng Wang, Li-Shan Ma","doi":"10.1145/3507548.3507590","DOIUrl":"https://doi.org/10.1145/3507548.3507590","url":null,"abstract":"In order to realize the accumulation of sea temperature, conductivity, pressure, depth, salinity, density and sound profile observation data, a ocean observation data storage system based on database is designed. For efficiently realizing accumulation of ocean observation data, the system makes full use of the advantages of MYSQL database management platform, and orderly stores tens of thousands ofConductivity-Temperature-Depth(CTD) profile observation data. At the same time, the data quality control will be applied to the received hydrological observation data, which improves the quality of the data stored in the database and enhances the data usability. After the system was developed, a simulation environment was set up to test the system. The results show that the system hasrational function design and stable operation, which can well realize the accumulation of CTD profile observation data. The realization of ocean hydrologic profile observation data accumulation provides reliable data source for the subsequent in-depth data-mining and utilization of ocean hydrologic observation data.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126479519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Text Recommendation Algorithm Fused with BERT Semantic Information 融合BERT语义信息的文本推荐算法
Xingyun Xie, Zifeng Ren, Yuming Gu, Chengwen Zhang
Faced with the problem of text recommendation with massive data on the Internet, the use of a recommendation method based on deep learning combined with semantic information will improve the accuracy of the recommendation results. Therefore, we propose a HyReB (Hybrid Recommendation algorithm combining BERT and CNN network). The algorithm HyReB uses the BERT word vector as the input of the CNN network and incorporates external semantic information in features extraction and topic classification. Then we combine BERT and TextRank algorithms to extract text keywords and calculate the BERT word vector similarity of topic word. Finally, we do the weighted calculation of the label proportion of the recommended text and the similarity of the topic word vector to make the text top-N recommendation. The HyReB algorithm makes user interest extraction more refined and incorporates BERT semantic information into the text recommendation. Experiments show that the feature extraction of HyReB is more accurate and has a better recommendation effect when performing small-scale accurate text recommendation.
面对互联网上海量数据的文本推荐问题,采用基于深度学习与语义信息相结合的推荐方法,将会提高推荐结果的准确性。因此,我们提出了一种HyReB(结合BERT和CNN网络的混合推荐算法)。HyReB算法使用BERT词向量作为CNN网络的输入,并在特征提取和主题分类中加入外部语义信息。然后结合BERT和TextRank算法提取文本关键词,计算主题词的BERT词向量相似度。最后,对被推荐文本的标签比例和主题词向量的相似度进行加权计算,得到top-N的推荐文本。HyReB算法使用户兴趣提取更加精细,并将BERT语义信息融入到文本推荐中。实验表明,在进行小规模的精确文本推荐时,HyReB的特征提取更加准确,推荐效果更好。
{"title":"Text Recommendation Algorithm Fused with BERT Semantic Information","authors":"Xingyun Xie, Zifeng Ren, Yuming Gu, Chengwen Zhang","doi":"10.1145/3507548.3507582","DOIUrl":"https://doi.org/10.1145/3507548.3507582","url":null,"abstract":"Faced with the problem of text recommendation with massive data on the Internet, the use of a recommendation method based on deep learning combined with semantic information will improve the accuracy of the recommendation results. Therefore, we propose a HyReB (Hybrid Recommendation algorithm combining BERT and CNN network). The algorithm HyReB uses the BERT word vector as the input of the CNN network and incorporates external semantic information in features extraction and topic classification. Then we combine BERT and TextRank algorithms to extract text keywords and calculate the BERT word vector similarity of topic word. Finally, we do the weighted calculation of the label proportion of the recommended text and the similarity of the topic word vector to make the text top-N recommendation. The HyReB algorithm makes user interest extraction more refined and incorporates BERT semantic information into the text recommendation. Experiments show that the feature extraction of HyReB is more accurate and has a better recommendation effect when performing small-scale accurate text recommendation.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116542942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of deep learning network 深度学习网络综述
Liming Chen, Bin Xie, YingChun Chen
∗Deep learning is a technology that uses the hierarchical structure of neural network to learn features. It allows computer models with multiple processing layers to learn and represent data like the brain’s perception and understanding of multimodal information, so as to implicitly capture complex large-scale data. The whole system of deep learning network forms a hierarchical and powerful feature representation structure, which enables it to analyze and extract useful knowledge from a large amount of data. This paper mainly introduces the development and application of supervised convolution neural network, unsupervised convolution neural network and generative countermeasure network, and analyzes the research status and challenges of deep learning network. Through the review and introduction of important papers on deep learning network, it provides researchers with accessible scientific research materials.
*深度学习是一种利用神经网络的层次结构来学习特征的技术。它允许具有多个处理层的计算机模型像大脑对多模态信息的感知和理解一样学习和表示数据,从而隐式捕获复杂的大规模数据。整个深度学习网络系统形成了层次化、功能强大的特征表示结构,使其能够从大量数据中分析和提取有用的知识。本文主要介绍了有监督卷积神经网络、无监督卷积神经网络和生成对策网络的发展和应用,分析了深度学习网络的研究现状和面临的挑战。通过对深度学习网络相关重要论文的综述和介绍,为研究人员提供可获取的科研资料。
{"title":"Review of deep learning network","authors":"Liming Chen, Bin Xie, YingChun Chen","doi":"10.1145/3507548.3507601","DOIUrl":"https://doi.org/10.1145/3507548.3507601","url":null,"abstract":"∗Deep learning is a technology that uses the hierarchical structure of neural network to learn features. It allows computer models with multiple processing layers to learn and represent data like the brain’s perception and understanding of multimodal information, so as to implicitly capture complex large-scale data. The whole system of deep learning network forms a hierarchical and powerful feature representation structure, which enables it to analyze and extract useful knowledge from a large amount of data. This paper mainly introduces the development and application of supervised convolution neural network, unsupervised convolution neural network and generative countermeasure network, and analyzes the research status and challenges of deep learning network. Through the review and introduction of important papers on deep learning network, it provides researchers with accessible scientific research materials.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129841004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Theoretically Accurate Regularization Technique for Matrix Factorization based Recommender Systems 基于矩阵分解的推荐系统的理论精确正则化技术
Hao Wang
Regularization is a popular technique to solve the overfitting problem of machine learning algorithms. Most regularization technique relies on parameter selection of the regularization coefficient. Plug-in method and cross-validation approach are two most common parameter selection approaches for regression methods such as Ridge Regression, Lasso Regression and Kernel Regression. Matrix factorization based recommendation system also has heavy reliance on the regularization technique. Most people select a single scalar value to regularize the user feature vector and item feature vector independently or collectively. In this paper, we prove that such approach of selecting regularization coefficient is invalid, and we provide a theoretically accurate method that outperforms the most widely used approach in both accuracy and fairness metrics.
正则化是解决机器学习算法过拟合问题的常用技术。大多数正则化技术依赖于正则化系数的参数选择。插件法和交叉验证法是岭回归、Lasso回归和核回归等回归方法中最常用的两种参数选择方法。基于矩阵分解的推荐系统也严重依赖正则化技术。大多数人选择单个标量值单独或共同正则化用户特征向量和项目特征向量。在本文中,我们证明了这种选择正则化系数的方法是无效的,并且我们提供了一个理论上准确的方法,在准确性和公平性指标上都优于最广泛使用的方法。
{"title":"Theoretically Accurate Regularization Technique for Matrix Factorization based Recommender Systems","authors":"Hao Wang","doi":"10.1145/3507548.3507587","DOIUrl":"https://doi.org/10.1145/3507548.3507587","url":null,"abstract":"Regularization is a popular technique to solve the overfitting problem of machine learning algorithms. Most regularization technique relies on parameter selection of the regularization coefficient. Plug-in method and cross-validation approach are two most common parameter selection approaches for regression methods such as Ridge Regression, Lasso Regression and Kernel Regression. Matrix factorization based recommendation system also has heavy reliance on the regularization technique. Most people select a single scalar value to regularize the user feature vector and item feature vector independently or collectively. In this paper, we prove that such approach of selecting regularization coefficient is invalid, and we provide a theoretically accurate method that outperforms the most widely used approach in both accuracy and fairness metrics.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127684504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BERT-Based Detection of Sexual Harassment in Dialogues 基于bert的对话中性骚扰检测
Mingrui Yan, Xudong Luo
It tends to become a trend of booking transportation through network equipment with the further integration of the Internet into people's life, and online car-hailing platforms have sprung up. However, a new social crisis has also come with it. Because of the need for platform expansion, most online ride-hailing drivers have not undergone strict professional ethics reviews, which increases the risk of passengers taking the car. Primarily, female users are more susceptible to abuse and harassment and even persecution by drivers due to their disadvantaged position. Unfortunately, this phenomenon is happening every day and even getting worse. Regarding this aspect of supervision, it is difficult for relevant departments to have a more direct management plan. However, it is difficult for relevant departments to give a more natural and effective management plan. Therefore, ensuring the safety of passengers (predominantly female passengers) using online car-hailing becomes particularly important. In the Chinese field, few people try to improve this problem from the perspective of natural language. This work expects to use natural language technology to evaluate the driver's language and determine the degree of potential danger and criminal tendency, thus protecting the passenger and providing evidence for the judicial authorities. We first collected many dialogues between drivers and passengers, then used back translation to expand the corpus. Finally, we adopted various BERT-based model methods to compare and analyze the performance of different variants.
随着互联网进一步融入人们的生活,通过网络设备预约出行将成为一种趋势,网约车平台如雨后春笋般涌现。然而,一个新的社会危机也随之而来。由于平台扩张的需要,大多数网约车司机没有经过严格的职业道德审查,这增加了乘客乘车的风险。首先,由于女性用户的弱势地位,她们更容易受到司机的虐待和骚扰,甚至迫害。不幸的是,这种现象每天都在发生,甚至越来越严重。对于这方面的监管,相关部门很难有更直接的管理方案。然而,相关部门很难给出一个更自然有效的管理方案。因此,确保使用网约车的乘客(主要是女性乘客)的安全变得尤为重要。在汉语领域,很少有人尝试从自然语言的角度来改善这一问题。本工作期望利用自然语言技术对驾驶员的语言进行评估,确定潜在危险程度和犯罪倾向,从而保护乘客,为司法机关提供证据。我们首先收集了许多司机和乘客之间的对话,然后使用反向翻译来扩展语料库。最后,我们采用了各种基于bert的模型方法来比较和分析不同变体的性能。
{"title":"BERT-Based Detection of Sexual Harassment in Dialogues","authors":"Mingrui Yan, Xudong Luo","doi":"10.1145/3507548.3507603","DOIUrl":"https://doi.org/10.1145/3507548.3507603","url":null,"abstract":"It tends to become a trend of booking transportation through network equipment with the further integration of the Internet into people's life, and online car-hailing platforms have sprung up. However, a new social crisis has also come with it. Because of the need for platform expansion, most online ride-hailing drivers have not undergone strict professional ethics reviews, which increases the risk of passengers taking the car. Primarily, female users are more susceptible to abuse and harassment and even persecution by drivers due to their disadvantaged position. Unfortunately, this phenomenon is happening every day and even getting worse. Regarding this aspect of supervision, it is difficult for relevant departments to have a more direct management plan. However, it is difficult for relevant departments to give a more natural and effective management plan. Therefore, ensuring the safety of passengers (predominantly female passengers) using online car-hailing becomes particularly important. In the Chinese field, few people try to improve this problem from the perspective of natural language. This work expects to use natural language technology to evaluate the driver's language and determine the degree of potential danger and criminal tendency, thus protecting the passenger and providing evidence for the judicial authorities. We first collected many dialogues between drivers and passengers, then used back translation to expand the corpus. Finally, we adopted various BERT-based model methods to compare and analyze the performance of different variants.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127844424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Novel Sine Cosine Algorithm for Global Optimization 一种新的正弦余弦全局优化算法
Yuan xia Shen, Chuan hua Zeng, Xiao yan Wang
Sine cosine algorithm (SCA) has a fast convergence speed and is easy to implement. In order to overcome the evolutionary stagnation of swarm, this paper presents a novel SCA (NSCA) in which three learning strategies are used to update individuals and a selection mechanism is developed to guide each individual to choose a proper updating strategy. The selection mechanism is designed by the credit assignment method and Upper Confidence Bound (UCB). The proposed algorithm has been experimentally validated on 18 benchmark functions. Compared with SCA variants and other swarm intelligence algorithms, experimental results show NSCA is competitive in solving most functions.
正弦余弦算法(SCA)收敛速度快,易于实现。为了克服群体进化停滞的问题,本文提出了一种新的集群学习策略(NSCA),该策略使用三种学习策略来更新个体,并建立了一种选择机制来指导每个个体选择合适的更新策略。选择机制采用信用分配法和上置信区间(UCB)设计。该算法在18个基准函数上进行了实验验证。实验结果表明,与SCA变体和其他群智能算法相比,NSCA在求解大多数函数方面具有竞争力。
{"title":"A Novel Sine Cosine Algorithm for Global Optimization","authors":"Yuan xia Shen, Chuan hua Zeng, Xiao yan Wang","doi":"10.1145/3507548.3507579","DOIUrl":"https://doi.org/10.1145/3507548.3507579","url":null,"abstract":"Sine cosine algorithm (SCA) has a fast convergence speed and is easy to implement. In order to overcome the evolutionary stagnation of swarm, this paper presents a novel SCA (NSCA) in which three learning strategies are used to update individuals and a selection mechanism is developed to guide each individual to choose a proper updating strategy. The selection mechanism is designed by the credit assignment method and Upper Confidence Bound (UCB). The proposed algorithm has been experimentally validated on 18 benchmark functions. Compared with SCA variants and other swarm intelligence algorithms, experimental results show NSCA is competitive in solving most functions.","PeriodicalId":414908,"journal":{"name":"Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence","volume":"5 6part2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120843631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1