首页 > 最新文献

Icon最新文献

英文 中文
A Framework for Early Detection of Cyberbullying in Chinese-English Code-Mixed Social Media Text Using Natural Language Processing and Machine Learning 基于自然语言处理和机器学习的中英文码混合社交媒体文本网络欺凌早期检测框架
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00061
Carlin Chun-Fai Chu, Raymond So, Simon Siu-Wai Li, Ernest Kan-Lam Kwong, Chun-Hung Chiu
This study develops a new expert system framework to address the issue of early detection of cyberbullying incidents in Chinese-English code-mixed language on social media networks. The framework covers the crawling of session-based social media texts with potential cyberbullying messages with a crowdsourcing web application to systematically retrieve and manually annotate a cyberbullying dataset, and most importantly establishes an explainable artificial intelligence model based on natural language processing algorithm for identification of targeted emotional colloquial slang phrases and machine learning method using Shapley value and transfer learning approach for automatic early detection of cyberbullying incidents in Chinese-English codemixed language.
本研究开发了一个新的专家系统框架,以解决社交媒体网络中中英文码混合语言网络欺凌事件的早期检测问题。该框架涵盖了基于会话的社交媒体文本的爬行,其中包含潜在的网络欺凌信息,通过众包网络应用程序系统地检索和手动注释网络欺凌数据集。最重要的是建立了一个基于自然语言处理算法的可解释的人工智能模型,用于识别有针对性的情感俗语俚语短语,以及基于Shapley值和迁移学习方法的机器学习方法用于自动早期检测汉英码混合语言中的网络欺凌事件。
{"title":"A Framework for Early Detection of Cyberbullying in Chinese-English Code-Mixed Social Media Text Using Natural Language Processing and Machine Learning","authors":"Carlin Chun-Fai Chu, Raymond So, Simon Siu-Wai Li, Ernest Kan-Lam Kwong, Chun-Hung Chiu","doi":"10.1109/ICNLP58431.2023.00061","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00061","url":null,"abstract":"This study develops a new expert system framework to address the issue of early detection of cyberbullying incidents in Chinese-English code-mixed language on social media networks. The framework covers the crawling of session-based social media texts with potential cyberbullying messages with a crowdsourcing web application to systematically retrieve and manually annotate a cyberbullying dataset, and most importantly establishes an explainable artificial intelligence model based on natural language processing algorithm for identification of targeted emotional colloquial slang phrases and machine learning method using Shapley value and transfer learning approach for automatic early detection of cyberbullying incidents in Chinese-English codemixed language.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76823823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ASSA-Net: Semantic Segmentation Network for Point Clouds Based on Adaptive Sampling and Self-Attention 基于自适应采样和自关注的点云语义分割网络
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00018
Da Ai, Ce Xu, Xiaoyang Zhang, Yu Ai, Yansong Bai, Y. Liu
Point cloud semantic segmentation is widely used in scene analysis. We propose a point cloud semantic segmentation network based on adaptive random sampling and self-attention. The network extracts local centroids using random sampling, enriches feature information of the centroids using the proposed adaptive optimization module, and then learns correlations and differences between feature vectors using a feature aggregation module based on the self-attentiveness mechanism to make feature cross-fertilization more adequate, which effectively improves the performance of semantic segmentation. Experimental results on S3DIS show that the network consumes less computing time, but improves the Mean Intersection over Union (mIou) by 14.4% and overall accuracy (oAcc) by 6.4% over the baseline network PointNet++.
点云语义分割在场景分析中有着广泛的应用。提出了一种基于自适应随机采样和自关注的点云语义分割网络。该网络采用随机采样的方法提取局部质心,利用提出的自适应优化模块丰富质心的特征信息,然后利用基于自关注机制的特征聚合模块学习特征向量之间的相关性和差异性,使特征相互作用更加充分,有效地提高了语义分割的性能。在S3DIS上的实验结果表明,该网络消耗的计算时间更少,但比基线网络PointNet++提高了14.4%的平均交联率(mIou)和6.4%的总精度(oAcc)。
{"title":"ASSA-Net: Semantic Segmentation Network for Point Clouds Based on Adaptive Sampling and Self-Attention","authors":"Da Ai, Ce Xu, Xiaoyang Zhang, Yu Ai, Yansong Bai, Y. Liu","doi":"10.1109/ICNLP58431.2023.00018","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00018","url":null,"abstract":"Point cloud semantic segmentation is widely used in scene analysis. We propose a point cloud semantic segmentation network based on adaptive random sampling and self-attention. The network extracts local centroids using random sampling, enriches feature information of the centroids using the proposed adaptive optimization module, and then learns correlations and differences between feature vectors using a feature aggregation module based on the self-attentiveness mechanism to make feature cross-fertilization more adequate, which effectively improves the performance of semantic segmentation. Experimental results on S3DIS show that the network consumes less computing time, but improves the Mean Intersection over Union (mIou) by 14.4% and overall accuracy (oAcc) by 6.4% over the baseline network PointNet++.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84327513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on I-PageRank algorithm model of Process knowledge graph based on K-Shell decomposition algorithm 基于K-Shell分解算法的过程知识图谱I-PageRank算法模型研究
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00082
Yanwei Huo, Hongyu Cheng
PageRank algorithm in the calculation of nodes is equally distributed to the node chain of all nodes, but in the actual production of manufacturing enterprises, the importance of process knowledge in process documents is different, if according to the PageRank algorithm PR value equal transfer to calculate the importance of the artifact, efficiency and accuracy is generally low, so the importance of PR value transfer difference should be considered. Therefore, this paper introduces K-Shell decomposition algorithm in PageRank algorithm, constructs a new I-PageRank algorithm model, adding the importance of each node in the linked network to the PageRank algorithm, which improves the efficiency and accuracy of PageRank algorithm in identifying key nodes.
PageRank算法在计算节点时是均匀分布到节点链的所有节点上,但在实际生产制造企业中,工艺知识在工艺文档中的重要性是不同的,如果按照PageRank算法的PR值相等转移来计算工件的重要性,效率和准确性一般较低,因此应考虑PR值转移的重要性差异。因此,本文在PageRank算法中引入K-Shell分解算法,构建新的I-PageRank算法模型,将链接网络中每个节点的重要性加入到PageRank算法中,提高了PageRank算法识别关键节点的效率和准确性。
{"title":"Research on I-PageRank algorithm model of Process knowledge graph based on K-Shell decomposition algorithm","authors":"Yanwei Huo, Hongyu Cheng","doi":"10.1109/ICNLP58431.2023.00082","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00082","url":null,"abstract":"PageRank algorithm in the calculation of nodes is equally distributed to the node chain of all nodes, but in the actual production of manufacturing enterprises, the importance of process knowledge in process documents is different, if according to the PageRank algorithm PR value equal transfer to calculate the importance of the artifact, efficiency and accuracy is generally low, so the importance of PR value transfer difference should be considered. Therefore, this paper introduces K-Shell decomposition algorithm in PageRank algorithm, constructs a new I-PageRank algorithm model, adding the importance of each node in the linked network to the PageRank algorithm, which improves the efficiency and accuracy of PageRank algorithm in identifying key nodes.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89226952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Mode Detection Based on DRM System 基于DRM系统的鲁棒模式检测
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/icnlp58431.2023.00077
Yani Qiao, Bo Li, Wen Cui, Yuji Li
This paper proposes a mode detection algorithm based on digital radio (DRM) system. DRM system is a digital broadcasting standard including long wave, medium wave and short wave, which is applicable to digital broadcasting below 30dB. By selecting different transmission modes, signals can be effectively transmitted in different channels and under different conditions. DRM system standard defines four different transmission modes (also known as robust mode). The main differences between these modes are the number of subcarriers, subcarrier spacing, pilot and other structures of OFDM symbols. Therefore, on the basis of completing the time synchronization corresponding to the four transmission modes, this paper proposes a mode detection algorithm based on synchronization analysis, which can effectively identify the transmission mode used by the transmission signal when obtaining the transmission signal, so as to better complete the follow-up signal processing work such as synchronization and channel estimation.
提出了一种基于数字无线电(DRM)系统的模式检测算法。DRM系统是一种包含长波、中波和短波的数字广播标准,适用于30dB以下的数字广播。通过选择不同的传输方式,可以在不同的信道、不同的条件下有效地传输信号。DRM系统标准定义了四种不同的传输模式(也称为鲁棒模式)。这些模式之间的主要区别在于OFDM符号的子载波数、子载波间距、导频和其他结构。因此,本文在完成四种传输模式对应的时间同步的基础上,提出了一种基于同步分析的模式检测算法,该算法在获取传输信号时,能够有效识别传输信号所使用的传输模式,从而更好地完成同步、信道估计等后续信号处理工作。
{"title":"Robust Mode Detection Based on DRM System","authors":"Yani Qiao, Bo Li, Wen Cui, Yuji Li","doi":"10.1109/icnlp58431.2023.00077","DOIUrl":"https://doi.org/10.1109/icnlp58431.2023.00077","url":null,"abstract":"This paper proposes a mode detection algorithm based on digital radio (DRM) system. DRM system is a digital broadcasting standard including long wave, medium wave and short wave, which is applicable to digital broadcasting below 30dB. By selecting different transmission modes, signals can be effectively transmitted in different channels and under different conditions. DRM system standard defines four different transmission modes (also known as robust mode). The main differences between these modes are the number of subcarriers, subcarrier spacing, pilot and other structures of OFDM symbols. Therefore, on the basis of completing the time synchronization corresponding to the four transmission modes, this paper proposes a mode detection algorithm based on synchronization analysis, which can effectively identify the transmission mode used by the transmission signal when obtaining the transmission signal, so as to better complete the follow-up signal processing work such as synchronization and channel estimation.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89516420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A study of Chinese Text Classification based on a new type of BERT pre-training 基于新型BERT预训练的中文文本分类研究
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00062
Youyao Liu, Haimei Huang, Jialei Gao, Shihao Gai
Chinese Text Classification (TC) is the process of mapping text to a pre-given topics category. In recent years, it has been found that TC is mainly based on RNN and BERT, so the development of different novel pre-training applied to Chinese TC is described as based on BERT pre-training. BERT combined with convolutional neural network is proposed to extend the BERT-CNN model for the problem of lack of semantic knowledge of BERT to derive a good classification effect. The second RoBERTa model performs feature extraction and fusion to obtain the feature word vector as the text output vector, which can solve the problem of insufficient BERT extracted features. The BERT-BiGRU model, on the other hand, mainly avoids the increase in the number of texts leading to long training time and overfitting, and uses a simpler GRU bi-word network as the main network to fully extract the contextual information of Chinese texts and finally complete the classification of Chinese texts; at the same time, it makes an outlook and conclusion on the new pre-training model for Chinese TC.
中文文本分类(TC)是将文本映射到预先给定的主题类别的过程。近年来,人们发现机器学习主要是基于RNN和BERT,因此我们将各种新的预训练方法应用于中文机器学习的发展描述为基于BERT预训练。针对BERT缺乏语义知识的问题,提出BERT结合卷积神经网络对BERT- cnn模型进行扩展,以获得较好的分类效果。第二个RoBERTa模型进行特征提取和融合,得到特征词向量作为文本输出向量,解决了BERT提取特征不足的问题。而BERT-BiGRU模型则主要避免了文本数量增加导致训练时间过长和过拟合的问题,使用更简单的GRU双词网络作为主网络,充分提取中文文本的语境信息,最终完成中文文本的分类;同时,对新的汉语翻译预训练模型进行了展望和总结。
{"title":"A study of Chinese Text Classification based on a new type of BERT pre-training","authors":"Youyao Liu, Haimei Huang, Jialei Gao, Shihao Gai","doi":"10.1109/ICNLP58431.2023.00062","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00062","url":null,"abstract":"Chinese Text Classification (TC) is the process of mapping text to a pre-given topics category. In recent years, it has been found that TC is mainly based on RNN and BERT, so the development of different novel pre-training applied to Chinese TC is described as based on BERT pre-training. BERT combined with convolutional neural network is proposed to extend the BERT-CNN model for the problem of lack of semantic knowledge of BERT to derive a good classification effect. The second RoBERTa model performs feature extraction and fusion to obtain the feature word vector as the text output vector, which can solve the problem of insufficient BERT extracted features. The BERT-BiGRU model, on the other hand, mainly avoids the increase in the number of texts leading to long training time and overfitting, and uses a simpler GRU bi-word network as the main network to fully extract the contextual information of Chinese texts and finally complete the classification of Chinese texts; at the same time, it makes an outlook and conclusion on the new pre-training model for Chinese TC.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82681100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Two Stage Learning Algorithm for Hyperspectral Image Classification 高光谱图像分类的两阶段学习算法
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00022
Shuying Li, Qiang Zhang, Lei Cheng, Baidong Peng
Since the excellent performance of Support Vector Machine (SVM) in handling with high-dimensional data, it is often used in the field of hyperspectral image (HSI) classification. However, traditional SVM methods only uses a single Mercer kernel function as base kernel, which does not represent the similarity of samples well. Meanwhile, it cannot utilize the spatial background information to enhance the HSI classification results. To address these issues, the paper proposes a two-stage learning (TSL) algorithm for HSI classification. In the first stage, a new Kernel Singular Value Decomposition-Multiple Kernel learning (KSVD-MKL) method is proposed for SVM Multiple Kernel Learning (MKL), which allows the best combination of kernels to be composed by using Gaussian kernels with different bandwidth scales. In the second stage, the KSVD-MKL classification is used as the initial spectral term classification results. Then, spatial information is modeled by using Conditional Random Field (CRF) observation fields and labels, and the KSVD-MKL classification results are optimized. Experiment results on public Indian pines and Botswana datasets demonstrate that the classification accuracy of the proposed method is effectively improved against existing algorithms.
由于支持向量机(SVM)在处理高维数据方面的优异性能,它经常被用于高光谱图像(HSI)分类领域。然而,传统的支持向量机方法仅使用单一的Mercer核函数作为基核,不能很好地代表样本的相似性。同时,无法利用空间背景信息增强HSI分类结果。为了解决这些问题,本文提出了一种用于HSI分类的两阶段学习(TSL)算法。首先,针对支持向量机多核学习(MKL),提出了一种新的核奇异值分解-多核学习(KSVD-MKL)方法,利用不同带宽尺度的高斯核组成最佳的核组合;第二阶段使用KSVD-MKL分类作为初始光谱项分类结果。然后利用条件随机场(Conditional Random Field, CRF)观测场和标签对空间信息进行建模,并对KSVD-MKL分类结果进行优化。在印度松树和博茨瓦纳公共数据集上的实验结果表明,与现有算法相比,本文方法的分类精度得到了有效提高。
{"title":"A Two Stage Learning Algorithm for Hyperspectral Image Classification","authors":"Shuying Li, Qiang Zhang, Lei Cheng, Baidong Peng","doi":"10.1109/ICNLP58431.2023.00022","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00022","url":null,"abstract":"Since the excellent performance of Support Vector Machine (SVM) in handling with high-dimensional data, it is often used in the field of hyperspectral image (HSI) classification. However, traditional SVM methods only uses a single Mercer kernel function as base kernel, which does not represent the similarity of samples well. Meanwhile, it cannot utilize the spatial background information to enhance the HSI classification results. To address these issues, the paper proposes a two-stage learning (TSL) algorithm for HSI classification. In the first stage, a new Kernel Singular Value Decomposition-Multiple Kernel learning (KSVD-MKL) method is proposed for SVM Multiple Kernel Learning (MKL), which allows the best combination of kernels to be composed by using Gaussian kernels with different bandwidth scales. In the second stage, the KSVD-MKL classification is used as the initial spectral term classification results. Then, spatial information is modeled by using Conditional Random Field (CRF) observation fields and labels, and the KSVD-MKL classification results are optimized. Experiment results on public Indian pines and Botswana datasets demonstrate that the classification accuracy of the proposed method is effectively improved against existing algorithms.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73765973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Lightweight Human Pose Estimation Algorithm Based on High Resolution Network 一种基于高分辨率网络的轻量级人体姿态估计算法
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/icnlp58431.2023.00020
Sai Ma, Haibo Ge, Wenhao He, Chaofeng Huang, Yu An, Ting Zhou
Human pose estimation is an important research direction in the field of computer vision. At present, the mainstream human pose estimation algorithms have high complexity, large amount of calculation, and cannot be run on resource-constrained devices such as mobile terminals, which severely limits the popularization and application of this technology. Aiming at the problem of increased network model parameters and computational complexity, based on the High-Resolution Network (HRNet), a lightweight human pose estimation network incorporating Ghost module and attention mechanism is proposed. Replaced with Ghost convolution, and added the attention mechanism Concurrent Spatial and Channel Squeeze and Channel Excitation Net module on this basis to ensure the prediction accuracy of the network. Under the same image resolution and environment configuration, the experimental results on the COCO dataset show that the improved network model reduces the number of parameters by 98.3% compared to the high-resolution network model, and reduces the computational complexity by 67.6%. The experimental results show that the improved network model can effectively reduce the amount of network parameters and reduce the computational complexity while maintaining a certain prediction accuracy.
人体姿态估计是计算机视觉领域的一个重要研究方向。目前主流的人体姿态估计算法复杂度高、计算量大,且无法在移动终端等资源受限的设备上运行,严重限制了该技术的推广应用。针对网络模型参数增加和计算复杂度高的问题,基于高分辨率网络(HRNet),提出了一种结合Ghost模块和注意机制的轻量级人体姿态估计网络。替换为Ghost卷积,并在此基础上增加注意机制并发空间通道挤压和通道激励网模块,保证网络的预测精度。在相同的图像分辨率和环境配置下,在COCO数据集上的实验结果表明,改进的网络模型与高分辨率网络模型相比,参数数量减少了98.3%,计算复杂度降低了67.6%。实验结果表明,改进后的网络模型在保持一定预测精度的同时,能有效地减少网络参数的数量,降低计算复杂度。
{"title":"A Lightweight Human Pose Estimation Algorithm Based on High Resolution Network","authors":"Sai Ma, Haibo Ge, Wenhao He, Chaofeng Huang, Yu An, Ting Zhou","doi":"10.1109/icnlp58431.2023.00020","DOIUrl":"https://doi.org/10.1109/icnlp58431.2023.00020","url":null,"abstract":"Human pose estimation is an important research direction in the field of computer vision. At present, the mainstream human pose estimation algorithms have high complexity, large amount of calculation, and cannot be run on resource-constrained devices such as mobile terminals, which severely limits the popularization and application of this technology. Aiming at the problem of increased network model parameters and computational complexity, based on the High-Resolution Network (HRNet), a lightweight human pose estimation network incorporating Ghost module and attention mechanism is proposed. Replaced with Ghost convolution, and added the attention mechanism Concurrent Spatial and Channel Squeeze and Channel Excitation Net module on this basis to ensure the prediction accuracy of the network. Under the same image resolution and environment configuration, the experimental results on the COCO dataset show that the improved network model reduces the number of parameters by 98.3% compared to the high-resolution network model, and reduces the computational complexity by 67.6%. The experimental results show that the improved network model can effectively reduce the amount of network parameters and reduce the computational complexity while maintaining a certain prediction accuracy.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77942641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chinese Semantic Role Labeling Based on BILSTM-CRF Extended Model 基于BILSTM-CRF扩展模型的汉语语义角色标注
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/icnlp58431.2023.00039
Youyao Liu, Jialei Gao, Haimei Huang, Yifan Liu
Semantic role labeling (SRL) is a technique to analyze the structure of predicates and thesis elements in a sentence as a unit. It plays an important role in Chinese information recognition processing. Among the models of SRL studied in recent years, most of them are based on bidirectional long and short term memory loop network and conditional random field. In this paper, we first narrate the SRL model based on BILSTM-CRF, based on which the second model narrates the SRL model integrating Bert and BILSTM-CRF models due to the ability of pre-training and fine-tuning of Bert model. However, since the word vectors in Chinese text are obtained based on word stitching in the context window, making the words between them influence each other, the word vectors depend on this joint relationship. Therefore, for this, Gate filtering mechanism is integrated to adjust it, and in the third model, Gate mechanism is added to filter and denoise the word vectors based on BILSTM-CRF to further improve the recognition ability of SRL.
语义角色标注(SRL)是一种将句子中的谓语和命题元素作为一个单位进行结构分析的技术。它在汉语信息识别处理中起着重要的作用。在近年研究的SRL模型中,大多是基于双向长短期记忆环路网络和条件随机场的模型。本文首先叙述了基于BILSTM-CRF的SRL模型,在此基础上,利用Bert模型的预训练和微调能力,叙述了结合Bert和BILSTM-CRF模型的SRL模型。然而,由于中文文本中的词向量是通过上下文窗口中的词拼接获得的,使得它们之间的词相互影响,所以词向量依赖于这种联合关系。为此,我们集成了Gate滤波机制对其进行调整,在第三个模型中,我们在BILSTM-CRF的基础上,加入Gate机制对词向量进行滤波降噪,进一步提高SRL的识别能力。
{"title":"Chinese Semantic Role Labeling Based on BILSTM-CRF Extended Model","authors":"Youyao Liu, Jialei Gao, Haimei Huang, Yifan Liu","doi":"10.1109/icnlp58431.2023.00039","DOIUrl":"https://doi.org/10.1109/icnlp58431.2023.00039","url":null,"abstract":"Semantic role labeling (SRL) is a technique to analyze the structure of predicates and thesis elements in a sentence as a unit. It plays an important role in Chinese information recognition processing. Among the models of SRL studied in recent years, most of them are based on bidirectional long and short term memory loop network and conditional random field. In this paper, we first narrate the SRL model based on BILSTM-CRF, based on which the second model narrates the SRL model integrating Bert and BILSTM-CRF models due to the ability of pre-training and fine-tuning of Bert model. However, since the word vectors in Chinese text are obtained based on word stitching in the context window, making the words between them influence each other, the word vectors depend on this joint relationship. Therefore, for this, Gate filtering mechanism is integrated to adjust it, and in the third model, Gate mechanism is added to filter and denoise the word vectors based on BILSTM-CRF to further improve the recognition ability of SRL.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82945049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
N-ary Relational Link Prediction Algorithm Fusing Graph Attributes 融合图属性的n元关联链接预测算法
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00081
Chenlin Xing, Tao Luo, Jie Lv, Zhilong Zhang
Knowledge graph is widely used in real life, but there is still a lot of missing information, which makes the completion of knowledge graph very important. Link prediction is one of the main methods to complete knowledge graph. In addition to binary relation facts which have received a lot of attention, there are also hyper-relation facts that are ubiquitous in the real world, namely n-ary relation facts. In this paper, we focus on link prediction algorithms for n-ary relation facts and find that the existing algorithms ignore the graph attribute information of nary relation facts themselves in the calculation process. Consequently, the distribution of entities and relations in n-ary relational datasets is analyzed first. The results show the fact that some of the n-ary relation facts are very important, while others are less important. This indicates that they have the characteristics of the scale-free network. Then, the global graph parameter (GGP) is introduced to describe the importance of entities and relations, and weighted to the link prediction process to improve the accuracy performance. Finally, extensive evaluation on commonly used n-ary datasets JF17K, WikiPeople, and their specific arity subsets validate the superiority of the proposed algorithm.
知识图谱在现实生活中得到了广泛的应用,但仍然存在大量的信息缺失,这使得知识图谱的完善变得非常重要。链接预测是完成知识图谱的主要方法之一。除了备受关注的二元关系事实外,还有在现实世界中普遍存在的超关系事实,即n元关系事实。本文重点研究了n元关系事实的链路预测算法,发现现有算法在计算过程中忽略了n元关系事实本身的图属性信息。因此,首先分析了n元关系数据集中实体和关系的分布。结果表明,某些n元关系事实非常重要,而另一些则不那么重要。这表明它们具有无标度网络的特征。然后,引入全局图参数(GGP)来描述实体和关系的重要性,并将其加权到链路预测过程中,以提高预测精度。最后,对常用的n元数据集JF17K、WikiPeople及其特定的基数子集进行了广泛的评估,验证了该算法的优越性。
{"title":"N-ary Relational Link Prediction Algorithm Fusing Graph Attributes","authors":"Chenlin Xing, Tao Luo, Jie Lv, Zhilong Zhang","doi":"10.1109/ICNLP58431.2023.00081","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00081","url":null,"abstract":"Knowledge graph is widely used in real life, but there is still a lot of missing information, which makes the completion of knowledge graph very important. Link prediction is one of the main methods to complete knowledge graph. In addition to binary relation facts which have received a lot of attention, there are also hyper-relation facts that are ubiquitous in the real world, namely n-ary relation facts. In this paper, we focus on link prediction algorithms for n-ary relation facts and find that the existing algorithms ignore the graph attribute information of nary relation facts themselves in the calculation process. Consequently, the distribution of entities and relations in n-ary relational datasets is analyzed first. The results show the fact that some of the n-ary relation facts are very important, while others are less important. This indicates that they have the characteristics of the scale-free network. Then, the global graph parameter (GGP) is introduced to describe the importance of entities and relations, and weighted to the link prediction process to improve the accuracy performance. Finally, extensive evaluation on commonly used n-ary datasets JF17K, WikiPeople, and their specific arity subsets validate the superiority of the proposed algorithm.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86693892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the implementation of the algorithm for representation of discontinuity in natural language 自然语言中不连续表示算法的实现
Q3 Arts and Humanities Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00059
Ratna Nirupama, Prakash Mondal
The present paper is a demonstration of the algorithm to integrate the representational principles of three grammar formalisms: constituency by Phrase Structure Grammar (PSG), head-dependency relations by Dependency Grammar (DG) and functor-argument relations by Categorial Grammar (CG) for achieving a unified representation. This algorithm is written for analyzing both continuous and discontinuous sentences in natural language and thereby provides a unique solution towards discontinuity in natural language. For mustrative purposes, a discontinuous relative clause from Salish has been taken to show the implementation of the algorithm. A discussion on the significance of this algorithm and the unified representation is present towards the end of the paper, followed by a conclusion.
本文演示了一种将三种语法形式的表示原则:用短语结构语法(PSG)来表示选区,用依存语法(DG)来表示头部依赖关系,用范畴语法(CG)来表示函子-论证关系来实现统一表示的算法。该算法对自然语言中的连续句和不连续句进行分析,从而为自然语言中的不连续句提供了一种独特的解决方案。为了说明问题,本文采用了萨利什语中的一个不连续关系从句来说明该算法的实现。本文最后讨论了该算法的意义和统一表示,并给出了结论。
{"title":"On the implementation of the algorithm for representation of discontinuity in natural language","authors":"Ratna Nirupama, Prakash Mondal","doi":"10.1109/ICNLP58431.2023.00059","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00059","url":null,"abstract":"The present paper is a demonstration of the algorithm to integrate the representational principles of three grammar formalisms: constituency by Phrase Structure Grammar (PSG), head-dependency relations by Dependency Grammar (DG) and functor-argument relations by Categorial Grammar (CG) for achieving a unified representation. This algorithm is written for analyzing both continuous and discontinuous sentences in natural language and thereby provides a unique solution towards discontinuity in natural language. For mustrative purposes, a discontinuous relative clause from Salish has been taken to show the implementation of the algorithm. A discussion on the significance of this algorithm and the unified representation is present towards the end of the paper, followed by a conclusion.","PeriodicalId":53637,"journal":{"name":"Icon","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89328728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Icon
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1