Sabenabanu Abdulkadhar, Gurusamy Murugesan, J. Natarajan
{"title":"用于生物医学文献中蛋白质-蛋白质相互作用文章分类的递归卷积神经网络","authors":"Sabenabanu Abdulkadhar, Gurusamy Murugesan, J. Natarajan","doi":"10.1109/ICRCICN.2017.8234505","DOIUrl":null,"url":null,"abstract":"Text classification (TC) is a task that assigns a text to one or more classes and predefined categories. Constructing text classifiers with high accuracy is a vital task in biomedical field, given the wealth of information hidden in unlabelled documents. Because of large feature spaces, traditionally discriminative approaches, such as logistic regression and support vector machines with n-gram and semantic features have been utilizedfor biomedical text classification. In this study, we propose Recurrent Convolution Neural Networks (RCNN) based automated technique for classifying protein-protein interaction (PPI) articles. In RCNN model we utilized a recurrent structure to detain the contextual information from word embedding features. Max pooling layer was configured to extract important semantic keywords from the text. We evaluated our approach on two benchmark PPI datasets BioCreative II and BioCreative III. An experimental results show that RCNN based protein-protein interaction classification approach performs better than other state of the art approaches.","PeriodicalId":166298,"journal":{"name":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","volume":"37 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Recurrent convolution neural networks for classification of protein-protein interaction articles from biomedical literature\",\"authors\":\"Sabenabanu Abdulkadhar, Gurusamy Murugesan, J. Natarajan\",\"doi\":\"10.1109/ICRCICN.2017.8234505\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text classification (TC) is a task that assigns a text to one or more classes and predefined categories. Constructing text classifiers with high accuracy is a vital task in biomedical field, given the wealth of information hidden in unlabelled documents. Because of large feature spaces, traditionally discriminative approaches, such as logistic regression and support vector machines with n-gram and semantic features have been utilizedfor biomedical text classification. In this study, we propose Recurrent Convolution Neural Networks (RCNN) based automated technique for classifying protein-protein interaction (PPI) articles. In RCNN model we utilized a recurrent structure to detain the contextual information from word embedding features. Max pooling layer was configured to extract important semantic keywords from the text. We evaluated our approach on two benchmark PPI datasets BioCreative II and BioCreative III. An experimental results show that RCNN based protein-protein interaction classification approach performs better than other state of the art approaches.\",\"PeriodicalId\":166298,\"journal\":{\"name\":\"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)\",\"volume\":\"37 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRCICN.2017.8234505\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Third International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCICN.2017.8234505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recurrent convolution neural networks for classification of protein-protein interaction articles from biomedical literature
Text classification (TC) is a task that assigns a text to one or more classes and predefined categories. Constructing text classifiers with high accuracy is a vital task in biomedical field, given the wealth of information hidden in unlabelled documents. Because of large feature spaces, traditionally discriminative approaches, such as logistic regression and support vector machines with n-gram and semantic features have been utilizedfor biomedical text classification. In this study, we propose Recurrent Convolution Neural Networks (RCNN) based automated technique for classifying protein-protein interaction (PPI) articles. In RCNN model we utilized a recurrent structure to detain the contextual information from word embedding features. Max pooling layer was configured to extract important semantic keywords from the text. We evaluated our approach on two benchmark PPI datasets BioCreative II and BioCreative III. An experimental results show that RCNN based protein-protein interaction classification approach performs better than other state of the art approaches.