Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9994864
Shuting Sun, Chang Yan, Juntong Lyu, Yueran Xin, Jieyuan Zheng, Zhaolong Yu, B. Hu
Neural circuit dysfunction underlies the biological mechanisms of major depressive disorder (MDD). However, little is known about how the brain’s dynamic connectomes differentiate between depressed patients and normal controls. As a result, we collected resting-state Electroencephalography from 16 MDD patients and 16 controls using 128-electrode geodesic sensor net. Static and dynamic network metrics were later applied to explore the abnormal topological structure of MDD patients and identify them from normal controls using traditional machine learning algorithms with feature selection methods. Results showed that the MDD tend to have a more randomized formation both in static and dynamic network. We also found that the combined static-dynamic feature set usually outperformed others with a highest accuracy of 79.25% under delta band. Lower frequency band (delta, theta) showed relatively better outcomes compared to higher frequency band (alpha, beta). It also indicate the role of functional segregation features as a potential biomarker for depression. In conclusion, neuropathological mechanism of depression may be more objectively quantified and evaluated from the perspective of combining static and dynamic network.
{"title":"EEG Based Depression Recognition by Employing Static and Dynamic Network Metrics","authors":"Shuting Sun, Chang Yan, Juntong Lyu, Yueran Xin, Jieyuan Zheng, Zhaolong Yu, B. Hu","doi":"10.1109/BIBM55620.2022.9994864","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9994864","url":null,"abstract":"Neural circuit dysfunction underlies the biological mechanisms of major depressive disorder (MDD). However, little is known about how the brain’s dynamic connectomes differentiate between depressed patients and normal controls. As a result, we collected resting-state Electroencephalography from 16 MDD patients and 16 controls using 128-electrode geodesic sensor net. Static and dynamic network metrics were later applied to explore the abnormal topological structure of MDD patients and identify them from normal controls using traditional machine learning algorithms with feature selection methods. Results showed that the MDD tend to have a more randomized formation both in static and dynamic network. We also found that the combined static-dynamic feature set usually outperformed others with a highest accuracy of 79.25% under delta band. Lower frequency band (delta, theta) showed relatively better outcomes compared to higher frequency band (alpha, beta). It also indicate the role of functional segregation features as a potential biomarker for depression. In conclusion, neuropathological mechanism of depression may be more objectively quantified and evaluated from the perspective of combining static and dynamic network.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121207013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9995287
Jiawei Li, Fan Yang, Fang Wang, Yu Rong, P. Zhao, Shizhan Chen, Jianhua Yao, Jijun Tang, Fei Guo
Inferring gene regulatory networks based on single-cell transcriptomes is critical for systematically understanding cell-specific regulatory networks and discovering drug targets in tumor cells. Here we show that existing methods mainly perform co-expression analysis and apply the image-based model to deal with the non-euclidean scRNA-seq data, which may not reasonably handle the dropout problem and not fully take advantage of the validated gene regulatory topology. We propose a graph-based end-to-end deep learning model for GRN inference (GRNInfer) with the help of known regulatory relations through transductive learning. The robustness and superiority of the model are demonstrated by comparative experiments.
{"title":"Integrating Prior Knowledge with Graph Encoder for Gene Regulatory Inference from Single-cell RNA-Seq Data","authors":"Jiawei Li, Fan Yang, Fang Wang, Yu Rong, P. Zhao, Shizhan Chen, Jianhua Yao, Jijun Tang, Fei Guo","doi":"10.1109/BIBM55620.2022.9995287","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995287","url":null,"abstract":"Inferring gene regulatory networks based on single-cell transcriptomes is critical for systematically understanding cell-specific regulatory networks and discovering drug targets in tumor cells. Here we show that existing methods mainly perform co-expression analysis and apply the image-based model to deal with the non-euclidean scRNA-seq data, which may not reasonably handle the dropout problem and not fully take advantage of the validated gene regulatory topology. We propose a graph-based end-to-end deep learning model for GRN inference (GRNInfer) with the help of known regulatory relations through transductive learning. The robustness and superiority of the model are demonstrated by comparative experiments.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"429 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116542202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9994976
Zhi-Yuan Li, Ying-Lian Gao, Zhen Niu, Shasha Yuan, C. Zheng, Jin-Xing Liu
Ensemble learning is to train and combine multiple learners to complete the corresponding learning tasks. It can improve the stability of the overall model, and a good ensemble method can further improve the accuracy of the model. At the same time, as one of the outstanding representatives of machine learning, Extreme Learning Machine has attracted the continuous attention of experts and scholars. to get a better representation of the feature space, we extend the Gaussian kernel in the kernel risk-sensitive loss and propose a Kernel Risk-Sensitive Loss of q-Gaussian kernel and Hyper-graph Regularized Extreme Learning Machine method. Since the contingency in the ELM training process cannot be completely avoided, the stability of most ELM methods is affected to some extent. What’s more, we introduce the voting mechanism and a new ELM classification model named Kernel Risk-Sensitive Loss of q-Gaussian kernel and Hyper-graph Regularized Integrated Extreme Learning Machine based on Voting Mechanism is proposed. It improves the stability of the model through the idea of ensemble learning. We apply the new model on six real data sets, and through observation and analysis of experimental results, we find that the new model has certain competitiveness, especially in classification accuracy and stability.
{"title":"An integrated Extreme learning machine based on kernel risk-sensitive loss of q-Gaussian and voting mechanism for sample classification","authors":"Zhi-Yuan Li, Ying-Lian Gao, Zhen Niu, Shasha Yuan, C. Zheng, Jin-Xing Liu","doi":"10.1109/BIBM55620.2022.9994976","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9994976","url":null,"abstract":"Ensemble learning is to train and combine multiple learners to complete the corresponding learning tasks. It can improve the stability of the overall model, and a good ensemble method can further improve the accuracy of the model. At the same time, as one of the outstanding representatives of machine learning, Extreme Learning Machine has attracted the continuous attention of experts and scholars. to get a better representation of the feature space, we extend the Gaussian kernel in the kernel risk-sensitive loss and propose a Kernel Risk-Sensitive Loss of q-Gaussian kernel and Hyper-graph Regularized Extreme Learning Machine method. Since the contingency in the ELM training process cannot be completely avoided, the stability of most ELM methods is affected to some extent. What’s more, we introduce the voting mechanism and a new ELM classification model named Kernel Risk-Sensitive Loss of q-Gaussian kernel and Hyper-graph Regularized Integrated Extreme Learning Machine based on Voting Mechanism is proposed. It improves the stability of the model through the idea of ensemble learning. We apply the new model on six real data sets, and through observation and analysis of experimental results, we find that the new model has certain competitiveness, especially in classification accuracy and stability.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116742978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objective: This study aimed to explore the mechanism of action of Wumei Pills (WMP) in treating gastric cancer (GC) based on network pharmacology and molecular docking. Methods: The Wumei Pills’ active ingredients were obtained from the traditional Chinese medicine system pharmacology database, and the target sites were obtained from the PharmMapper database. GC’ s target genes were identified through GeneCards, the Therapeutic Target Database, and other databases. The intersection of the two was used to determine the target of active ingredients of WMP that were related to GC. Cytoscape 3.7.0 was used to establish the network map of “ compound-traditional Chinese medicine-ingredient-target” to screen the core components. The Search Tool for the Retrieval of Interacting Genes/Proteins database and Cytoscape 3.7.0 were used to analyze and visualize potential genes of WMP in treating GC. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment were conducted through Metascape. The “ target-critical path” network diagram was created by screening relevant pathways with the enrichment score. KM plotter and Gene Expression Profiling Interactive Analysis database were used to draw GC related survival curve online for core genes. AutoDock Vina and PyMol software were used to conduct molecular docking and visualization. Results: There were 99 intersection targets of the active ingredients of WMP and the disease. Protein-protein interaction network topology analysis revealed ALB, EGFR, SRC, and other key targets. Molecular docking results showed that the key active components had good binding with the core target, and ALB and ESR1 genes were significant in survival analysis. Conclusion:WMP could treat GC via beta-sitosterol, stigmasterol, and other active ingredients that acted on ALB, EGFR, SRC, and other targets. The mechanism could be related to the epithelial cell signal transduction pathway in Helicobacter pylori infection, which played a multi-target and multi-pathway therapeutic role.
{"title":"The Mechanism of Action of Network Pharmacology Integrated with Molecular Docking to Explore Wumei Pills in Treating Gastric Cancer","authors":"Zhongwen Lu, Shuang Zhang, Fei Teng, Xuanhe Tian, Xijian Liu, Xiaochun Han","doi":"10.1109/BIBM55620.2022.9995670","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995670","url":null,"abstract":"Objective: This study aimed to explore the mechanism of action of Wumei Pills (WMP) in treating gastric cancer (GC) based on network pharmacology and molecular docking. Methods: The Wumei Pills’ active ingredients were obtained from the traditional Chinese medicine system pharmacology database, and the target sites were obtained from the PharmMapper database. GC’ s target genes were identified through GeneCards, the Therapeutic Target Database, and other databases. The intersection of the two was used to determine the target of active ingredients of WMP that were related to GC. Cytoscape 3.7.0 was used to establish the network map of “ compound-traditional Chinese medicine-ingredient-target” to screen the core components. The Search Tool for the Retrieval of Interacting Genes/Proteins database and Cytoscape 3.7.0 were used to analyze and visualize potential genes of WMP in treating GC. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment were conducted through Metascape. The “ target-critical path” network diagram was created by screening relevant pathways with the enrichment score. KM plotter and Gene Expression Profiling Interactive Analysis database were used to draw GC related survival curve online for core genes. AutoDock Vina and PyMol software were used to conduct molecular docking and visualization. Results: There were 99 intersection targets of the active ingredients of WMP and the disease. Protein-protein interaction network topology analysis revealed ALB, EGFR, SRC, and other key targets. Molecular docking results showed that the key active components had good binding with the core target, and ALB and ESR1 genes were significant in survival analysis. Conclusion:WMP could treat GC via beta-sitosterol, stigmasterol, and other active ingredients that acted on ALB, EGFR, SRC, and other targets. The mechanism could be related to the epithelial cell signal transduction pathway in Helicobacter pylori infection, which played a multi-target and multi-pathway therapeutic role.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123868116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9995116
Yacong Li, Lei Ma, Qince Li, Henggui Zhang, Kuanquan Wang
Biological pacemaker is a therapy for cardiac rhythm disease, which can be transformed from ventricular myocytes (VMs) by overexpressing HCN gene which codes the expression of hyperpolarization-activated current (${mathrm {I}}_{mathrm{f}}$) and knocking off Kir2.1 gene which codes inward-rectifier potassium current (${mathrm {I}}_{mathrm{K1}}$). Our previous study built a biological pacemaker single cell model and clarified the underlying mechanisms of how gene expressing levels influence the pacemaking activity of single pacemaker cell. But the pacemaking ability of pacemaker tissue has not been researched systematically. And what factors may have effects on pacemaker’s synchronization and spontaneous beating propagation are not clear. Biological research indicated that both sinoatrial node and pacemaker cells has less expression of connexin than unexcitable cardiac cells, which provides a possibility that improve pacemaking ability of pacemaker by decreasing its cell coupling. Another possible factor is the number of pacemaker cells. According to the common sense, increasing cell number can promote pacemaking behaviours, but overmuch pacemaker cells is unreasonable in clinic. As a result, the balance between pacemaker number and cell coupling is important when applying biological pacemaker. In this study, we constructed a two-dimensional cardiac tissue model with the description of electrophysiology to illustrate the relationship between gap junction and cell number. Based on this model, we modified the cell coupling between pacemaker cells by adjusting the diffusion coefficient of tissue with different pacemaker number. In different condition, the synchronization, pacemaking cycle length and electrical signal propagation were evaluated. It can be concluded that weakening cell coupling among pacemaker cells can lift the efficiency of bio-pacemaker therapy. This study may contribute to produce effective pacemaker in clinic.
{"title":"Effect of cell coupling between pacemaker cells on the biological pacemaker in cardiac tissue model","authors":"Yacong Li, Lei Ma, Qince Li, Henggui Zhang, Kuanquan Wang","doi":"10.1109/BIBM55620.2022.9995116","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995116","url":null,"abstract":"Biological pacemaker is a therapy for cardiac rhythm disease, which can be transformed from ventricular myocytes (VMs) by overexpressing HCN gene which codes the expression of hyperpolarization-activated current (${mathrm {I}}_{mathrm{f}}$) and knocking off Kir2.1 gene which codes inward-rectifier potassium current (${mathrm {I}}_{mathrm{K1}}$). Our previous study built a biological pacemaker single cell model and clarified the underlying mechanisms of how gene expressing levels influence the pacemaking activity of single pacemaker cell. But the pacemaking ability of pacemaker tissue has not been researched systematically. And what factors may have effects on pacemaker’s synchronization and spontaneous beating propagation are not clear. Biological research indicated that both sinoatrial node and pacemaker cells has less expression of connexin than unexcitable cardiac cells, which provides a possibility that improve pacemaking ability of pacemaker by decreasing its cell coupling. Another possible factor is the number of pacemaker cells. According to the common sense, increasing cell number can promote pacemaking behaviours, but overmuch pacemaker cells is unreasonable in clinic. As a result, the balance between pacemaker number and cell coupling is important when applying biological pacemaker. In this study, we constructed a two-dimensional cardiac tissue model with the description of electrophysiology to illustrate the relationship between gap junction and cell number. Based on this model, we modified the cell coupling between pacemaker cells by adjusting the diffusion coefficient of tissue with different pacemaker number. In different condition, the synchronization, pacemaking cycle length and electrical signal propagation were evaluated. It can be concluded that weakening cell coupling among pacemaker cells can lift the efficiency of bio-pacemaker therapy. This study may contribute to produce effective pacemaker in clinic.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121591447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9995387
Si-Jiu Wu, Tianyu Huang, Yihao Li
This paper proposes a shallow convolutional neural network (CNN) model to improve the efficiency and accuracy of real-time human activity recognition (HAR). In the traditional convolutional network, an Mix-Patch-Layer (MPL) block based on the attention mechanism is added to enhance the expressiveness of the network extracted features. This block makes the features in the network focus on the information between different parts of itself, which makes up for the loss of global information in temporal data features. Experiments show that the block can improve real-time human recognition accuracy and efficiency with a shallow network.
{"title":"A rehabilitation activity monitoring method based on Shallow-CNN","authors":"Si-Jiu Wu, Tianyu Huang, Yihao Li","doi":"10.1109/BIBM55620.2022.9995387","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995387","url":null,"abstract":"This paper proposes a shallow convolutional neural network (CNN) model to improve the efficiency and accuracy of real-time human activity recognition (HAR). In the traditional convolutional network, an Mix-Patch-Layer (MPL) block based on the attention mechanism is added to enhance the expressiveness of the network extracted features. This block makes the features in the network focus on the information between different parts of itself, which makes up for the loss of global information in temporal data features. Experiments show that the block can improve real-time human recognition accuracy and efficiency with a shallow network.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115827635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
single-cell RNA-sequencing (scRNA-seq) technology can reveal cellular heterogeneity with high throughput and resolution, facilitating the profiling of single-cell transcriptomes. However, due to some experimental factors, a large number of missing values are generated in scRNA-seq data, which are called dropout events, and this phenomenon affects the downstream analysis. Imputation is an effective denoising method, but existing imputation methods still face a huge challenge: lack of interpretability. In this study, we propose single-cell Self-Attention Generative Adversarial Networks(scSAGAN), a semi-supervised imputation method for scRNA-seq data. scSAGAN mainly uses Semi-Supervised Learning (SSL) and Probabilistic Latent Semantic Analysis (PLSA), which can not only learn the potential characteristics of different types of cells but explain their imputation behavior. In clustering experiments, scSAGAN exhibits better clustering performance than all baselines on 7 datasets. Next, we interpret the imputation behavior of scSAGAN on datasets such as Alzheimer’s disease and find causative genes associated with the corresponding datasets. scSAGAN is currently an open-source method, available at https://github.com/zehaoxiongl23/scSAGAN.
{"title":"scSAGAN: A scRNA-seq data imputation method based on Semi-Supervised Learning and Probabilistic Latent Semantic Analysis","authors":"Zehao Xiong, Xiangtao Chen, Jiawei Luo, Cong Shen, Zhongyuan Xu","doi":"10.1109/BIBM55620.2022.9995463","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995463","url":null,"abstract":"single-cell RNA-sequencing (scRNA-seq) technology can reveal cellular heterogeneity with high throughput and resolution, facilitating the profiling of single-cell transcriptomes. However, due to some experimental factors, a large number of missing values are generated in scRNA-seq data, which are called dropout events, and this phenomenon affects the downstream analysis. Imputation is an effective denoising method, but existing imputation methods still face a huge challenge: lack of interpretability. In this study, we propose single-cell Self-Attention Generative Adversarial Networks(scSAGAN), a semi-supervised imputation method for scRNA-seq data. scSAGAN mainly uses Semi-Supervised Learning (SSL) and Probabilistic Latent Semantic Analysis (PLSA), which can not only learn the potential characteristics of different types of cells but explain their imputation behavior. In clustering experiments, scSAGAN exhibits better clustering performance than all baselines on 7 datasets. Next, we interpret the imputation behavior of scSAGAN on datasets such as Alzheimer’s disease and find causative genes associated with the corresponding datasets. scSAGAN is currently an open-source method, available at https://github.com/zehaoxiongl23/scSAGAN.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122465997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9994973
Bo Liu, Hong Song, Qiang Li, Yucong Lin, Jian Yang
Lung cancer is with the highest morbidity and mortality, and early detection of cancerous changes is essential to reduce the risk of death. To achieve this, it is necessary to reduce the false positive rate of detection. In this paper, we propose a novel asymmetric residual network, called 3D ARCNN, to reduce false positive rate of lung nodules detection. 3D ARCNN consists of asymmetric convolutional and multilayer cascaded residual network structures. To solve the problem of deep neural network with large amounts of parameters and poor reproduction ability, the proposed model uses asymmetric convolution to reduce model parameters and enhance the generalization ability of the model. In addition, the model uses an internally cascaded multi-stage residual to prevent the gradient vanishing and exploding problems of deep networks. Experiments are performed on the public dataset LUNA16. Our method achieved high detection sensitivity of 91.6%, 92.7%, 93.2% and 95.8% at 1, 2, 4 and 8 false positives per scan, respectively, which got an average CPM index of 0.912. Experimental results show that the proposed 3D ARCNN is very useful for reducing the false positive rate of lung nodules in the clinic.
{"title":"3D ARCNN: An Asymmetric Residual CNN for Decreasing False Positive Rate of Lung Nodules Detection","authors":"Bo Liu, Hong Song, Qiang Li, Yucong Lin, Jian Yang","doi":"10.1109/BIBM55620.2022.9994973","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9994973","url":null,"abstract":"Lung cancer is with the highest morbidity and mortality, and early detection of cancerous changes is essential to reduce the risk of death. To achieve this, it is necessary to reduce the false positive rate of detection. In this paper, we propose a novel asymmetric residual network, called 3D ARCNN, to reduce false positive rate of lung nodules detection. 3D ARCNN consists of asymmetric convolutional and multilayer cascaded residual network structures. To solve the problem of deep neural network with large amounts of parameters and poor reproduction ability, the proposed model uses asymmetric convolution to reduce model parameters and enhance the generalization ability of the model. In addition, the model uses an internally cascaded multi-stage residual to prevent the gradient vanishing and exploding problems of deep networks. Experiments are performed on the public dataset LUNA16. Our method achieved high detection sensitivity of 91.6%, 92.7%, 93.2% and 95.8% at 1, 2, 4 and 8 false positives per scan, respectively, which got an average CPM index of 0.912. Experimental results show that the proposed 3D ARCNN is very useful for reducing the false positive rate of lung nodules in the clinic.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128089242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9995453
Xinlong Liu, Zepeng Sun, W. Liu, Feng Qiao, Li Cui, Jing Yang, Jingjie Sha, Jian Li, Li-Qun Xu
Solid-state nanopores have shown impressive performances in several sequencing research scenarios, such as biomolecule conformation detection, biomarker identification, and protein fingerprinting. In all these scenarios, accurate event detection is the fundamental step toward data analysis. Most existing event detection methods use either user-defined thresholds or adaptive thresholds determined automatically by the data. The former class depends heavily on human expertise, which is labor-intensive; the latter appears to be more advanced, however, the setting of threshold parameters is somewhat tricky. Hence, the results are usually inconsistent among different methods. In this paper, we develop a novel event detection method, where the selection threshold is computed following the principle governed by an analytical expression. Unlike other methods, each event’s starting and ending points are located based on the slope rather than picking the first point whose current value goes across the baseline. Moreover, we add a method to determine whether multiple levels are present within each event. We then evaluate the method on two groups of current traces generated by short ssDNA and 48.5kb λ-DNA samples, respectively. The results show that our method performs well on detecting challenging translocation events with relatively low amplitudes, and is also able to accurately locate the starting/end points of each level of the events.
{"title":"Multi-level translocation events analysis in solid-state nanopore current traces","authors":"Xinlong Liu, Zepeng Sun, W. Liu, Feng Qiao, Li Cui, Jing Yang, Jingjie Sha, Jian Li, Li-Qun Xu","doi":"10.1109/BIBM55620.2022.9995453","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995453","url":null,"abstract":"Solid-state nanopores have shown impressive performances in several sequencing research scenarios, such as biomolecule conformation detection, biomarker identification, and protein fingerprinting. In all these scenarios, accurate event detection is the fundamental step toward data analysis. Most existing event detection methods use either user-defined thresholds or adaptive thresholds determined automatically by the data. The former class depends heavily on human expertise, which is labor-intensive; the latter appears to be more advanced, however, the setting of threshold parameters is somewhat tricky. Hence, the results are usually inconsistent among different methods. In this paper, we develop a novel event detection method, where the selection threshold is computed following the principle governed by an analytical expression. Unlike other methods, each event’s starting and ending points are located based on the slope rather than picking the first point whose current value goes across the baseline. Moreover, we add a method to determine whether multiple levels are present within each event. We then evaluate the method on two groups of current traces generated by short ssDNA and 48.5kb λ-DNA samples, respectively. The results show that our method performs well on detecting challenging translocation events with relatively low amplitudes, and is also able to accurately locate the starting/end points of each level of the events.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"10 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132605757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-06DOI: 10.1109/BIBM55620.2022.9995199
Tiantian Li, Daming Zhu, Haitao Jiang, Haodi Feng, Xuefeng Cui
We focus on a new problem that is formulated to find a longest k-tuple of common sub-strings (abbr. k-CSSs) of two or more strings. We present a suffix tree based algorithm for this problem, which can find a longest k-CSS of m strings in $O(kmn^{k})$ time and $O(kmn)$ space where n is the length sum of the m strings. This algorithm can be used to approximate the longest k-CSS problem to a performance ratio $frac{1}{epsilon}$ in $O(kmn^{lceilepsilon krceil})$ time for $epsilonin(0,1]$. Since the algorithm has the space complexity in linear order of n, it will show advantage in comparing particularly long strings. This algorithm proves that the problem that asks to find a longest gapped pattern of non-constant number of strings is polynomial time solvable if the gap number is restricted constant, although the problem without any restriction on the gap number was proved NP-Hard. Using a C++ tool that is reliant on the algorithm, we performed experiments of finding longest 2-CSSs, 3-CSSs and 5-CSSs of 2 ~ 14 COVID-19 S-proteins. Under the help of longest 2-CSSs and 3-CSSs of COVID-19 S-proteins, we identified the mutation sites in the S-proteins of two COVID-19 variants Delta and Omicron. The algorithm based tool is available for downloading at https://github.com/lytt0/k-CSS.
{"title":"Longest k-tuple Common Sub-Strings","authors":"Tiantian Li, Daming Zhu, Haitao Jiang, Haodi Feng, Xuefeng Cui","doi":"10.1109/BIBM55620.2022.9995199","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995199","url":null,"abstract":"We focus on a new problem that is formulated to find a longest k-tuple of common sub-strings (abbr. k-CSSs) of two or more strings. We present a suffix tree based algorithm for this problem, which can find a longest k-CSS of m strings in $O(kmn^{k})$ time and $O(kmn)$ space where n is the length sum of the m strings. This algorithm can be used to approximate the longest k-CSS problem to a performance ratio $frac{1}{epsilon}$ in $O(kmn^{lceilepsilon krceil})$ time for $epsilonin(0,1]$. Since the algorithm has the space complexity in linear order of n, it will show advantage in comparing particularly long strings. This algorithm proves that the problem that asks to find a longest gapped pattern of non-constant number of strings is polynomial time solvable if the gap number is restricted constant, although the problem without any restriction on the gap number was proved NP-Hard. Using a C++ tool that is reliant on the algorithm, we performed experiments of finding longest 2-CSSs, 3-CSSs and 5-CSSs of 2 ~ 14 COVID-19 S-proteins. Under the help of longest 2-CSSs and 3-CSSs of COVID-19 S-proteins, we identified the mutation sites in the S-proteins of two COVID-19 variants Delta and Omicron. The algorithm based tool is available for downloading at https://github.com/lytt0/k-CSS.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132781884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}