IEEE Transactions on NanoBioscience最新文献_第9页

A Novel Framework for Tongue Feature Extraction Framework Based on Sublingual Vein Segmentation 基于舌下静脉分割的新型舌头特征提取框架

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-09-17 DOI: 10.1109/TNB.2024.3462461

Xiaohua Wan;Yulong Hu;Dehui Qiu;Juan Zhang;Xiaotong Wang;Fa Zhang;Bin Hu

The features of the sublingual veins, including swelling, varicose patterns, and cyanosis, are pivotal in differentiating symptoms and selecting treatments in Traditional Chinese Medicine (TCM) tongue diagnosis. These features serve as a crucial reflection of the human blood circulation status. Nevertheless, the automatic and precise extraction of sublingual vein features remains a formidable challenge, constrained by the scarcity of datasets for sublingual images and the interference of noise from non-tongue and non-sublingual vein elements. In this paper, we present an innovative tongue feature extraction method that relies on focusing specifically on segmenting the sublingual vein rather than the entire tongue base. To achieve this, we have developed a sublingual vein segmentation framework utilizing a Polyp-PVT network, effectively eliminating noise from the surrounding regions of the sublingual vein. Furthermore, we pioneer the utilization of a transformer-based approach, such as the Swin-Transformer network, to extract sublingual vein features, leveraging the remarkable capabilities of transformer networks. To complement our methodology, we have constructed a comprehensive dataset of sublingual vein images, facilitating the segmentation and classification of sublingual veins. Experimental results have demonstrated that our tongue feature extraction method, coupled with sublingual vein segmentation, significantly outperforms existing tongue feature extraction techniques.

舌下静脉的特征，包括肿胀、静脉曲张和紫绀，是中医舌诊鉴别症状和选择治疗方法的关键。这些特征是人体血液循环状况的重要反映。然而，由于舌下图像数据集的稀缺以及非舌下和非舌下静脉元素噪声的干扰，舌下静脉特征的自动、精确提取仍然是一个巨大的挑战。在本文中，我们提出了一种创新的舌头特征提取方法，该方法依赖于专注于分割舌下静脉而不是整个舌根。为了实现这一目标，我们开发了一个利用息肉- pvt网络的舌下静脉分割框架，有效地消除了舌下静脉周围区域的噪声。此外，我们率先利用变压器为基础的方法，如天鹅变压器网络，提取舌下静脉特征，充分利用变压器网络的卓越能力。为了补充我们的方法，我们构建了一个完整的舌下静脉图像数据集，方便了舌下静脉的分割和分类。实验结果表明，结合舌下静脉分割的舌特征提取方法明显优于现有的舌特征提取技术。

{"title":"A Novel Framework for Tongue Feature Extraction Framework Based on Sublingual Vein Segmentation","authors":"Xiaohua Wan;Yulong Hu;Dehui Qiu;Juan Zhang;Xiaotong Wang;Fa Zhang;Bin Hu","doi":"10.1109/TNB.2024.3462461","DOIUrl":"10.1109/TNB.2024.3462461","url":null,"abstract":"The features of the sublingual veins, including swelling, varicose patterns, and cyanosis, are pivotal in differentiating symptoms and selecting treatments in Traditional Chinese Medicine (TCM) tongue diagnosis. These features serve as a crucial reflection of the human blood circulation status. Nevertheless, the automatic and precise extraction of sublingual vein features remains a formidable challenge, constrained by the scarcity of datasets for sublingual images and the interference of noise from non-tongue and non-sublingual vein elements. In this paper, we present an innovative tongue feature extraction method that relies on focusing specifically on segmenting the sublingual vein rather than the entire tongue base. To achieve this, we have developed a sublingual vein segmentation framework utilizing a Polyp-PVT network, effectively eliminating noise from the surrounding regions of the sublingual vein. Furthermore, we pioneer the utilization of a transformer-based approach, such as the Swin-Transformer network, to extract sublingual vein features, leveraging the remarkable capabilities of transformer networks. To complement our methodology, we have constructed a comprehensive dataset of sublingual vein images, facilitating the segmentation and classification of sublingual veins. Experimental results have demonstrated that our tongue feature extraction method, coupled with sublingual vein segmentation, significantly outperforms existing tongue feature extraction techniques.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"24 3","pages":"269-279"},"PeriodicalIF":3.7,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Molecular Communication-Based Intelligent Dopamine Rate Modulator for Parkinson’s Disease Treatment 用于帕金森病治疗的基于分子通讯的智能多巴胺速率调节器

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-09-12 DOI: 10.1109/TNB.2024.3456031

Elham Baradari;Ozgur B. Akan

Parkinson’s disease (PD) is a progressive neurodegenerative disease, and it is caused by the loss of dopaminergic neurons in the basal ganglia (BG). Currently, there is no definite cure for PD, and available treatments mainly aim to alleviate its symptoms. Due to impaired neurotransmitter-based information transmission in PD, molecular communication-based approaches can be employed as potential solutions to address this issue. Molecular Communications (MC) is a bio-inspired communication method utilizing molecules to carry information. This mode of communication stands out for developing bio-compatible nanomachines for diagnosing and treating, particularly in addressing neurodegenerative diseases like PD, due to its compatibility with biological systems. This study presents a novel treatment method that introduces an Intelligent Dopamine Rate Modulator (IDRM), which is located in the synaptic gap between the substantia nigra pars compacta (SNc) and striatum to compensate for insufficiency dopamine release in BG caused by PD. For storing dopamine in the IDRM, dopamine compound (DAC) is swallowed and crossed through the digestive system, blood circulatory system, blood-brain barrier (BBB), and brain extracellular matrix uptakes with IDRMs. Here, the DAC concentration is calculated in these regions, revealing that the required exogenous dopamine consistently reaches IDRM. Therefore, the perpetual dopamine insufficiency in BG associated with PD can be compensated. This method reduces drug side effects because dopamine is not released in other brain regions. Unlike other treatments, this approach targets the root cause of PD rather than just reducing symptoms.

帕金森病（PD）是一种进行性神经退行性疾病，它是由基底神经节（BG）多巴胺能神经元的丧失引起的。目前，帕金森病还没有确切的治愈方法，现有的治疗方法主要是缓解其症状。由于PD中基于神经递质的信息传递受损，基于分子通信的方法可以作为解决这一问题的潜在解决方案。分子通信（MC）是一种利用分子携带信息的仿生通信方法。由于与生物系统的兼容性，这种通信模式在开发用于诊断和治疗的生物相容性纳米机器方面脱颖而出，特别是在解决神经退行性疾病（如PD）方面。本研究提出了一种新的治疗方法，即在黑质致密部（SNc）和纹状体之间的突触间隙引入智能多巴胺速率调节剂（IDRM），以补偿PD引起的BG多巴胺释放不足。为了在IDRM中储存多巴胺，多巴胺化合物（DAC）被吞下并与IDRM一起通过消化系统、血液循环系统、血脑屏障（BBB）和脑细胞外基质摄取。在这里，DAC浓度在这些区域计算，揭示所需的外源性多巴胺一致达到IDRM。因此，与PD相关的BG持续多巴胺不足可以得到补偿。这种方法减少了药物的副作用，因为多巴胺不会在大脑的其他区域释放。与其他治疗方法不同，这种方法针对的是帕金森病的根本原因，而不仅仅是减轻症状。

{"title":"Molecular Communication-Based Intelligent Dopamine Rate Modulator for Parkinson’s Disease Treatment","authors":"Elham Baradari;Ozgur B. Akan","doi":"10.1109/TNB.2024.3456031","DOIUrl":"10.1109/TNB.2024.3456031","url":null,"abstract":"Parkinson’s disease (PD) is a progressive neurodegenerative disease, and it is caused by the loss of dopaminergic neurons in the basal ganglia (BG). Currently, there is no definite cure for PD, and available treatments mainly aim to alleviate its symptoms. Due to impaired neurotransmitter-based information transmission in PD, molecular communication-based approaches can be employed as potential solutions to address this issue. Molecular Communications (MC) is a bio-inspired communication method utilizing molecules to carry information. This mode of communication stands out for developing bio-compatible nanomachines for diagnosing and treating, particularly in addressing neurodegenerative diseases like PD, due to its compatibility with biological systems. This study presents a novel treatment method that introduces an Intelligent Dopamine Rate Modulator (IDRM), which is located in the synaptic gap between the substantia nigra pars compacta (SNc) and striatum to compensate for insufficiency dopamine release in BG caused by PD. For storing dopamine in the IDRM, dopamine compound (DAC) is swallowed and crossed through the digestive system, blood circulatory system, blood-brain barrier (BBB), and brain extracellular matrix uptakes with IDRMs. Here, the DAC concentration is calculated in these regions, revealing that the required exogenous dopamine consistently reaches IDRM. Therefore, the perpetual dopamine insufficiency in BG associated with PD can be compensated. This method reduces drug side effects because dopamine is not released in other brain regions. Unlike other treatments, this approach targets the root cause of PD rather than just reducing symptoms.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"24 2","pages":"136-144"},"PeriodicalIF":3.7,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

State Observer Synchronization of Three-Dimensional Chaotic Oscillatory Systems Based on DNA Strand Displacement 基于 DNA 链位移的三维混沌振荡系统的状态观测器同步化

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-09-11 DOI: 10.1109/TNB.2024.3457755

Zicheng Wang;Haojie Wang;Yanfeng Wang;Junwei Sun

Currently, DNA strand displacement (DSD) as the theoretical basis of DNA chemical reaction networks (CRNs) has promoted the development of chaotic synchronization technique. This paper introduces the synchronization technology of two isomorphic three-dimensional chaotic systems based on DNA strand displacement under state observer. By studying the theoretical knowledge of DNA molecules, multiple DSD reactions are used to construct three-dimensional chaotic system. Based on two isomorphic chaotic systems, the linear transformation system and the state observer system are designed according to the theory of state observer construction. In addition, in order to realize the synchronization of chaotic systems, a coupling controller is designed between the drive system and the linear transformation system, and a soft variable-structure controller is designed between the state observer system and the response system. Through multiple DSD reactions, the chemical reaction networks of four chaotic systems and two controllers are constructed, and they are cascaded to realize the synchronization of two isomorphic three-dimensional chaotic systems. Numerical simulations verify the effectiveness and robustness of the scheme. Our work will extend and provide a reference for new methods to achieve synchronization of chaotic systems using DSD.

目前，DNA链位移（DSD）作为DNA化学反应网络（crn）的理论基础，促进了混沌同步技术的发展。介绍了在状态观测器下基于DNA链位移的两个同构三维混沌系统的同步技术。通过研究DNA分子的理论知识，利用多重DSD反应构建三维混沌系统。基于两个同构混沌系统，根据状态观测器构造理论，设计了线性变换系统和状态观测器系统。此外，为了实现混沌系统的同步，在驱动系统和线性变换系统之间设计了耦合控制器，在状态观测器系统和响应系统之间设计了软变结构控制器。通过多个DSD反应，构建了四个混沌系统和两个控制器的化学反应网络，并将它们级联，实现了两个同构三维混沌系统的同步。数值仿真验证了该方案的有效性和鲁棒性。我们的工作将为利用DSD实现混沌系统同步的新方法提供扩展和参考。

{"title":"State Observer Synchronization of Three-Dimensional Chaotic Oscillatory Systems Based on DNA Strand Displacement","authors":"Zicheng Wang;Haojie Wang;Yanfeng Wang;Junwei Sun","doi":"10.1109/TNB.2024.3457755","DOIUrl":"10.1109/TNB.2024.3457755","url":null,"abstract":"Currently, DNA strand displacement (DSD) as the theoretical basis of DNA chemical reaction networks (CRNs) has promoted the development of chaotic synchronization technique. This paper introduces the synchronization technology of two isomorphic three-dimensional chaotic systems based on DNA strand displacement under state observer. By studying the theoretical knowledge of DNA molecules, multiple DSD reactions are used to construct three-dimensional chaotic system. Based on two isomorphic chaotic systems, the linear transformation system and the state observer system are designed according to the theory of state observer construction. In addition, in order to realize the synchronization of chaotic systems, a coupling controller is designed between the drive system and the linear transformation system, and a soft variable-structure controller is designed between the state observer system and the response system. Through multiple DSD reactions, the chemical reaction networks of four chaotic systems and two controllers are constructed, and they are cascaded to realize the synchronization of two isomorphic three-dimensional chaotic systems. Numerical simulations verify the effectiveness and robustness of the scheme. Our work will extend and provide a reference for new methods to achieve synchronization of chaotic systems using DSD.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"24 2","pages":"145-156"},"PeriodicalIF":3.7,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Strategic Multi-Omics Data Integration via Multi-Level Feature Contrasting and Matching 通过多层次特征对比和匹配实现战略性多传感器数据整合

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-09-10 DOI: 10.1109/TNB.2024.3456797

Jinli Zhang;Hongwei Ren;Zongli Jiang;Zheng Chen;Ziwei Yang;Yasuko Matsubara;Yasushi Sakurai

The analysis and comprehension of multi-omics data has emerged as a prominent topic in the field of bioinformatics and data science. However, the sparsity characteristics and high dimensionality of omics data pose difficulties in terms of extracting meaningful information. Moreover, the heterogeneity inherent in multiple omics sources makes the effective integration of multi-omics data challenging To tackle these challenges, we propose MFCC-SAtt, a multi-level feature contrast clustering model based on self-attention to extract informative features from multi-omics data. MFCC-SAtt treats each omics type as a distinct modality and employs autoencoders with self-attention for each modality to integrate and compress their respective features into a shared feature space. By utilizing a multi-level feature extraction framework along with incorporating a semantic information extractor, we mitigate optimization conflicts arising from different learning objectives. Additionally, MFCC-SAtt guides deep clustering based on multi-level features which further enhances the quality of output labels. By conducting extensive experiments on multi-omics data, we have validated the exceptional performance of MFCC-SAtt. For instance, in a pan-cancer clustering task, MFCC-SAtt achieved an accuracy of over 80.38%.

多组学数据的分析和理解已成为生物信息学和数据科学领域的一个重要课题。然而，omics 数据的稀疏性和高维性给提取有意义的信息带来了困难。为了应对这些挑战，我们提出了基于自我关注的多级特征对比聚类模型 MFCC-SAtt，以从多组学数据中提取信息特征。MFCC-SAtt 将每种 omics 类型视为一种不同的模态，并针对每种模态采用具有自我注意功能的自动编码器，将它们各自的特征整合并压缩到一个共享特征空间中。通过利用多层次特征提取框架和语义信息提取器，我们缓解了不同学习目标带来的优化冲突。此外，MFCC-SAtt 还能引导基于多层次特征的深度聚类，从而进一步提高输出标签的质量。通过在多组学数据上进行大量实验，我们验证了 MFCC-SAtt 的卓越性能。例如，在泛癌症聚类任务中，MFCC-SAtt 的准确率超过了 80.38%。

{"title":"Strategic Multi-Omics Data Integration via Multi-Level Feature Contrasting and Matching","authors":"Jinli Zhang;Hongwei Ren;Zongli Jiang;Zheng Chen;Ziwei Yang;Yasuko Matsubara;Yasushi Sakurai","doi":"10.1109/TNB.2024.3456797","DOIUrl":"10.1109/TNB.2024.3456797","url":null,"abstract":"The analysis and comprehension of multi-omics data has emerged as a prominent topic in the field of bioinformatics and data science. However, the sparsity characteristics and high dimensionality of omics data pose difficulties in terms of extracting meaningful information. Moreover, the heterogeneity inherent in multiple omics sources makes the effective integration of multi-omics data challenging To tackle these challenges, we propose MFCC-SAtt, a multi-level feature contrast clustering model based on self-attention to extract informative features from multi-omics data. MFCC-SAtt treats each omics type as a distinct modality and employs autoencoders with self-attention for each modality to integrate and compress their respective features into a shared feature space. By utilizing a multi-level feature extraction framework along with incorporating a semantic information extractor, we mitigate optimization conflicts arising from different learning objectives. Additionally, MFCC-SAtt guides deep clustering based on multi-level features which further enhances the quality of output labels. By conducting extensive experiments on multi-omics data, we have validated the exceptional performance of MFCC-SAtt. For instance, in a pan-cancer clustering task, MFCC-SAtt achieved an accuracy of over 80.38%.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"23 4","pages":"579-590"},"PeriodicalIF":3.7,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design and Performance Evaluation of Machine Learning-Based Terahertz Metasurface Chemical Sensor 基于机器学习的太赫兹元表面化学传感器的设计与性能评估。

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-09-03 DOI: 10.1109/TNB.2024.3453372

Abdullah Baz;Jacob Wekalao;Ngaira Mandela;Shobhit K. Patel

This paper presents a terahertz metasurface based sensor design incorporating graphene and other plasmonic materials for highly sensitive detection of different chemicals. The proposed sensor employs the combination of multiple resonator designs - including circular and square ring resonators - to attain enhanced sensitivity among other performance parameters. Machine learning techniques like Random Forest regression, are employed to enhance the sensor design and predict its performance. The optimized sensor demonstrates excellent sensitivity of 417 GHzRIU

$^{mathbf {-{1}}}$

and a low detection limit of 0.264 RIU for ethanol and benzene detection. Furthermore, the integration of machine learning cuts down the simulation time and computational requirements by approximately 90% without compromising accuracy. The sensor’s unique design and performance characteristics, including its high-quality factor of 14.476, position it as a promising candidate for environmental monitoring and chemical sensing applications. Moreover, it also demonstrates potential for 2-bit encoding applications through strategic modulation of graphene chemical potential values. On the other hand, it also shows prospects of 2-bit encoding applications via the modulation of graphene chemical. This work provides a major advancement to the terahertz sensing application by proposing new materials, structures, and methods in computation in order to develop a high-performance chemical sensor.

本文介绍了一种基于太赫兹元表面的传感器设计，其中结合了石墨烯和其他等离子材料，用于高灵敏度地检测不同的化学物质。所提出的传感器采用了多种谐振器设计（包括圆形和方形环形谐振器）的组合，以提高灵敏度和其他性能参数。采用随机森林回归等机器学习技术来增强传感器设计并预测其性能。优化后的传感器灵敏度高达 417 GHzRIU-1，乙醇和苯的检测限低至 0.264 RIU。此外，在不影响精度的情况下，机器学习的集成将模拟时间和计算要求减少了约 90%。该传感器独特的设计和性能特点，包括 14.476 的高质量系数，使其成为环境监测和化学传感应用的理想候选产品。此外，通过对石墨烯化学势值进行策略性调制，它还展示了 2 位编码应用的潜力。另一方面，它还展示了通过调制石墨烯化学势值进行 2 位编码的应用前景。这项工作通过提出新材料、新结构和新计算方法，为太赫兹传感应用提供了重大进展，从而开发出高性能化学传感器。

{"title":"Design and Performance Evaluation of Machine Learning-Based Terahertz Metasurface Chemical Sensor","authors":"Abdullah Baz;Jacob Wekalao;Ngaira Mandela;Shobhit K. Patel","doi":"10.1109/TNB.2024.3453372","DOIUrl":"10.1109/TNB.2024.3453372","url":null,"abstract":"This paper presents a terahertz metasurface based sensor design incorporating graphene and other plasmonic materials for highly sensitive detection of different chemicals. The proposed sensor employs the combination of multiple resonator designs - including circular and square ring resonators - to attain enhanced sensitivity among other performance parameters. Machine learning techniques like Random Forest regression, are employed to enhance the sensor design and predict its performance. The optimized sensor demonstrates excellent sensitivity of 417 GHzRIU<inline-formula> <tex-math>$^{mathbf {-{1}}}$ </tex-math></inline-formula> and a low detection limit of 0.264 RIU for ethanol and benzene detection. Furthermore, the integration of machine learning cuts down the simulation time and computational requirements by approximately 90% without compromising accuracy. The sensor’s unique design and performance characteristics, including its high-quality factor of 14.476, position it as a promising candidate for environmental monitoring and chemical sensing applications. Moreover, it also demonstrates potential for 2-bit encoding applications through strategic modulation of graphene chemical potential values. On the other hand, it also shows prospects of 2-bit encoding applications via the modulation of graphene chemical. This work provides a major advancement to the terahertz sensing application by proposing new materials, structures, and methods in computation in order to develop a high-performance chemical sensor.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"24 2","pages":"128-135"},"PeriodicalIF":3.7,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142125606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Representation Learning Approach for Predicting circRNA Back-Splicing Event via Sequence-Interaction-Aware Dual Encoder 通过序列交互感知双编码器预测 circRNA 回接事件的表征学习方法

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-09-03 DOI: 10.1109/TNB.2024.3454079

Chengxin He;Lei Duan;Huiru Zheng;Xinye Wang;Lili Guan;Jiaxuan Xu

Circular RNAs (circRNAs) play a crucial role in gene regulation and association with diseases because of their unique closed continuous loop structure, which is more stable and conserved than ordinary linear RNAs. As fundamental work to clarify their functions, a large number of computational approaches for identifying circRNA formation have been proposed. However, these methods fail to fully utilize the important characteristics of back-splicing events, i.e., the positional information of the splice sites and the interaction features of its flanking sequences, for predicting circRNAs. To this end, we hereby propose a novel approach called SIDE for predicting circRNA back-splicing events using only raw RNA sequences. Technically, SIDE employs a dual encoder to capture global and interactive features of the RNA sequence, and then a decoder designed by the contrastive learning to fuse out discriminative features improving the prediction of circRNAs formation. Empirical results on three real-world datasets show the effectiveness of SIDE. Further analysis also reveals that the effectiveness of SIDE.

环状 RNA（circRNA）因其独特的闭合连续环状结构而在基因调控和疾病相关方面发挥着至关重要的作用，这种结构比普通线性 RNA 更稳定、更保守。作为阐明其功能的基础性工作，人们提出了大量识别 circRNA 形成的计算方法。然而，这些方法未能充分利用反向剪接事件的重要特征，即剪接位点的位置信息及其侧翼序列的相互作用特征来预测 circRNA。为此，我们提出了一种名为 SIDE 的新方法，仅利用原始 RNA 序列预测 circRNA 的反向剪接事件。在技术上，SIDE 采用双重编码器捕捉 RNA 序列的全局和交互特征，然后通过对比学习设计解码器，融合出辨别特征，从而提高 circRNA 形成的预测能力。在三个真实世界数据集上的实证结果表明了 SIDE 的有效性。进一步的分析还显示了 SIDE 的有效性。

{"title":"A Representation Learning Approach for Predicting circRNA Back-Splicing Event via Sequence-Interaction-Aware Dual Encoder","authors":"Chengxin He;Lei Duan;Huiru Zheng;Xinye Wang;Lili Guan;Jiaxuan Xu","doi":"10.1109/TNB.2024.3454079","DOIUrl":"10.1109/TNB.2024.3454079","url":null,"abstract":"Circular RNAs (circRNAs) play a crucial role in gene regulation and association with diseases because of their unique closed continuous loop structure, which is more stable and conserved than ordinary linear RNAs. As fundamental work to clarify their functions, a large number of computational approaches for identifying circRNA formation have been proposed. However, these methods fail to fully utilize the important characteristics of back-splicing events, i.e., the positional information of the splice sites and the interaction features of its flanking sequences, for predicting circRNAs. To this end, we hereby propose a novel approach called SIDE for predicting circRNA back-splicing events using only raw RNA sequences. Technically, SIDE employs a dual encoder to capture global and interactive features of the RNA sequence, and then a decoder designed by the contrastive learning to fuse out discriminative features improving the prediction of circRNAs formation. Empirical results on three real-world datasets show the effectiveness of SIDE. Further analysis also reveals that the effectiveness of SIDE.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"23 4","pages":"603-611"},"PeriodicalIF":3.7,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142125605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multiple Heterogeneous Networks Representation With Latent Space for Synthetic Lethality Prediction 利用潜空间的多重异构网络表示法进行合成致死率预测

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-08-16 DOI: 10.1109/TNB.2024.3444922

Xiangjin Hu;Haoran Yi;Hao Cheng;Yijing Zhao;Dongqi Zhang;Jinxin Li;Jingjing Ruan;Jin Zhang;Xinguo Lu

Computational synthetic lethality (SL) method has become a promising strategy to identify SL gene pairs for targeted cancer therapy and cancer medicine development. Feature representation for integrating various biological networks is crutial to improve the identification performance. However, previous feature representation, such as matrix factorization and graph neural network, projects gene features onto latent variables by keeping a specific geometric metric. There is a lack of models of gene representational latent space with considerating multiple dimentionalities correlation and preserving latent geometric structures in both sample and feature spaces. Therefore, we propose a novel method to model gene Latent Space using matrix Tri-Factorization (LSTF) to obtain gene representation with embedding variables resulting from the potential interpretation of synthetic lethality. Meanwhile, manifold subspace regularization is applied to the tri-factorization to capture the geometrical manifold structure in the latent space with gene PPI functional and GO semantic embeddings. Then, SL gene pairs are identified by the reconstruction of the associations with gene representations in the latent space. The experimental results illustrate that LSTF is superior to other state-of-the-art methods. Case study demonstrate the effectiveness of the predicted SL associations.

计算合成致死率（SL）方法已成为为癌症靶向治疗和癌症药物开发识别SL基因对的一种有前途的策略。整合各种生物网络的特征表示对于提高识别性能至关重要。然而，以往的特征表示方法，如矩阵因式分解和图神经网络，都是通过保持特定的几何度量将基因特征投射到潜在变量上。目前还缺乏同时考虑多维度相关性和保留样本空间与特征空间中潜在几何结构的基因表征潜在空间模型。因此，我们提出了一种利用矩阵三因子化（LSTF）对基因潜空间进行建模的新方法，以获得具有合成致死率潜在解释所产生的嵌入变量的基因表征。同时，将流形子空间正则化应用于三因子化，以捕捉潜空间中带有基因 PPI 功能嵌入和 GO 语义嵌入的几何流形结构。然后，通过重建潜空间中与基因表征的关联来识别 SL 基因对。实验结果表明，LSTF 优于其他最先进的方法。案例研究证明了预测 SL 关联的有效性。

{"title":"Multiple Heterogeneous Networks Representation With Latent Space for Synthetic Lethality Prediction","authors":"Xiangjin Hu;Haoran Yi;Hao Cheng;Yijing Zhao;Dongqi Zhang;Jinxin Li;Jingjing Ruan;Jin Zhang;Xinguo Lu","doi":"10.1109/TNB.2024.3444922","DOIUrl":"10.1109/TNB.2024.3444922","url":null,"abstract":"Computational synthetic lethality (SL) method has become a promising strategy to identify SL gene pairs for targeted cancer therapy and cancer medicine development. Feature representation for integrating various biological networks is crutial to improve the identification performance. However, previous feature representation, such as matrix factorization and graph neural network, projects gene features onto latent variables by keeping a specific geometric metric. There is a lack of models of gene representational latent space with considerating multiple dimentionalities correlation and preserving latent geometric structures in both sample and feature spaces. Therefore, we propose a novel method to model gene Latent Space using matrix Tri-Factorization (LSTF) to obtain gene representation with embedding variables resulting from the potential interpretation of synthetic lethality. Meanwhile, manifold subspace regularization is applied to the tri-factorization to capture the geometrical manifold structure in the latent space with gene PPI functional and GO semantic embeddings. Then, SL gene pairs are identified by the reconstruction of the associations with gene representations in the latent space. The experimental results illustrate that LSTF is superior to other state-of-the-art methods. Case study demonstrate the effectiveness of the predicted SL associations.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"23 4","pages":"564-571"},"PeriodicalIF":3.7,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Improved Framework for Drug-Side Effect Associations Prediction via Counterfactual Inference-Based Data Augmentation 通过基于反事实推理的数据扩充，改进药物副作用关联预测框架。

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-08-14 DOI: 10.1109/TNB.2024.3443244

Wenjie Yao;Ankang Wei;Zhen Xiao;Weizhong Zhao;Xianjun Shen;Xingpeng Jiang;Tingting He

Detecting side effects of drugs is a fundamental task in drug development. With the expansion of publicly available biomedical data, researchers have proposed many computational methods for predicting drug-side effect associations (DSAs), among which network-based methods attract wide attention in the biomedical field. However, the problem of data scarcity poses a great challenge for existing DSAs prediction models. Although several data augmentation methods have been proposed to address this issue, most of existing methods employ a random way to manipulate the original networks, which ignores the causality of existence of DSAs, leading to the poor performance on the task of DSAs prediction. In this paper, we propose a counterfactual inference-based data augmentation method for improving the performance of the task. First, we construct a heterogeneous information network (HIN) by integrating multiple biomedical data. Based on the community detection on the HIN, a counterfactual inference-based method is designed to derive augmented links, and an augmented HIN is obtained accordingly. Then, a meta-path-based graph neural network is applied to learn high-quality representations of drugs and side effects, on which the predicted DSAs are obtained. Finally, comprehensive experiments are conducted, and the results demonstrate the effectiveness of the proposed counterfactual inference-based data augmentation for the task of DSAs prediction.

检测药物的副作用是药物开发的一项基本任务。随着公开生物医学数据的增加，研究人员提出了许多预测药物副作用关联（DSAs）的计算方法，其中基于网络的方法在生物医学领域受到广泛关注。然而，数据稀缺问题给现有的 DSAs 预测模型带来了巨大挑战。虽然已经有多种数据增强方法被提出来解决这一问题，但现有方法大多采用随机的方式来处理原始网络，忽略了 DSA 存在的因果关系，导致 DSA 预测效果不佳。本文提出了一种基于反事实推理的数据增强方法，以提高任务的性能。首先，我们通过整合多个生物医学数据构建了一个异构信息网络（HIN）。在对 HIN 进行群落检测的基础上，我们设计了一种基于反事实推理的方法来推导增强链接，并相应地得到了一个增强的 HIN。然后，应用基于元路径的图神经网络学习药物和副作用的高质量表征，并在此基础上获得预测的 DSA。最后，我们进行了综合实验，结果证明了所提出的基于反事实推理的数据增强方法在 DSAs 预测任务中的有效性。

{"title":"An Improved Framework for Drug-Side Effect Associations Prediction via Counterfactual Inference-Based Data Augmentation","authors":"Wenjie Yao;Ankang Wei;Zhen Xiao;Weizhong Zhao;Xianjun Shen;Xingpeng Jiang;Tingting He","doi":"10.1109/TNB.2024.3443244","DOIUrl":"10.1109/TNB.2024.3443244","url":null,"abstract":"Detecting side effects of drugs is a fundamental task in drug development. With the expansion of publicly available biomedical data, researchers have proposed many computational methods for predicting drug-side effect associations (DSAs), among which network-based methods attract wide attention in the biomedical field. However, the problem of data scarcity poses a great challenge for existing DSAs prediction models. Although several data augmentation methods have been proposed to address this issue, most of existing methods employ a random way to manipulate the original networks, which ignores the causality of existence of DSAs, leading to the poor performance on the task of DSAs prediction. In this paper, we propose a counterfactual inference-based data augmentation method for improving the performance of the task. First, we construct a heterogeneous information network (HIN) by integrating multiple biomedical data. Based on the community detection on the HIN, a counterfactual inference-based method is designed to derive augmented links, and an augmented HIN is obtained accordingly. Then, a meta-path-based graph neural network is applied to learn high-quality representations of drugs and side effects, on which the predicted DSAs are obtained. Finally, comprehensive experiments are conducted, and the results demonstrate the effectiveness of the proposed counterfactual inference-based data augmentation for the task of DSAs prediction.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"23 4","pages":"540-547"},"PeriodicalIF":3.7,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141982181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ontology-Based Data Collection for a Hybrid Outbreak Detection Method Using Social Media 利用社交媒体的混合疫情检测方法基于本体的数据收集。

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-08-13 DOI: 10.1109/TNB.2024.3442912

Ghazaleh Babanejaddehaki;Aijun An;Heidar Davoudi

Given the persistent global challenge presented by rapidly spreading diseases, as evidenced notably by the widespread impact of the COVID-19 pandemic on both human health and economies worldwide, the necessity of developing effective infectious disease prediction models has become of utmost importance. In this context, the utilization of online social media platforms as valuable tools in healthcare settings has gained prominence, offering direct avenues for disseminating critical health information to the public in a timely and accessible manner. Propelled by the ubiquitous accessibility of the internet through computers and mobile devices, these platforms promise to revolutionize traditional detection methods, providing more immediate and reliable epidemiological insights. Leveraging this paradigm shift, our proposed framework harnesses Twitter data associated with infectious disease symptoms, employing ontology to identify and curate relevant tweets. Central to our methodology is a hybrid model that integrates XGBoost and Bidirectional Long Short-Term Memory (BiLSTM) architectures. The integration of XGBoost addresses the challenge of handling small dataset sizes, inherent during outbreaks due to limited time series data. XGBoost serves as a cornerstone for minimizing the loss function and identifying optimal features from our multivariate time series data. Subsequently, the combined dataset, comprising original features and predicted values by XGBoost, is channeled into the BiLSTM for further processing. Through extensive experimentation with a dataset spanning multiple infectious disease outbreaks, our hybrid model demonstrates superior predictive performance compared to state-of-the-art and baseline models. By enhancing forecasting accuracy and outbreak tracking capabilities, our model offers promising prospects for assisting health authorities in mitigating fatalities and proactively preparing for potential outbreaks.

鉴于快速传播的疾病所带来的持续性全球性挑战，特别是 COVID-19 大流行病对全球人类健康和经济造成的广泛影响，开发有效的传染病预测模型已变得极为重要。在此背景下，网络社交媒体平台作为医疗保健领域的重要工具，为及时、便捷地向公众传播重要的健康信息提供了直接途径。在通过电脑和移动设备无处不在地访问互联网的推动下，这些平台有望彻底改变传统的检测方法，提供更即时、更可靠的流行病学见解。利用这一模式转变，我们提出的框架利用与传染病症状相关的 Twitter 数据，采用本体论来识别和整理相关推文。我们方法的核心是一个混合模型，它集成了 XGBoost 和双向长短期记忆（BiLSTM）架构。XGBoost 的集成解决了处理小数据集的难题，这是在爆发期间因时间序列数据有限而固有的问题。XGBoost 是最小化损失函数和从多元时间序列数据中识别最佳特征的基石。随后，由原始特征和 XGBoost 预测值组成的组合数据集被导入 BiLSTM 进行进一步处理。通过对跨越多种传染病爆发的数据集进行广泛实验，我们的混合模型与最先进的模型和基线模型相比，显示出了卓越的预测性能。通过提高预测准确性和疫情跟踪能力，我们的模型有望协助卫生部门减少死亡人数，并为潜在的疫情爆发做好积极准备。

{"title":"Ontology-Based Data Collection for a Hybrid Outbreak Detection Method Using Social Media","authors":"Ghazaleh Babanejaddehaki;Aijun An;Heidar Davoudi","doi":"10.1109/TNB.2024.3442912","DOIUrl":"10.1109/TNB.2024.3442912","url":null,"abstract":"Given the persistent global challenge presented by rapidly spreading diseases, as evidenced notably by the widespread impact of the COVID-19 pandemic on both human health and economies worldwide, the necessity of developing effective infectious disease prediction models has become of utmost importance. In this context, the utilization of online social media platforms as valuable tools in healthcare settings has gained prominence, offering direct avenues for disseminating critical health information to the public in a timely and accessible manner. Propelled by the ubiquitous accessibility of the internet through computers and mobile devices, these platforms promise to revolutionize traditional detection methods, providing more immediate and reliable epidemiological insights. Leveraging this paradigm shift, our proposed framework harnesses Twitter data associated with infectious disease symptoms, employing ontology to identify and curate relevant tweets. Central to our methodology is a hybrid model that integrates XGBoost and Bidirectional Long Short-Term Memory (BiLSTM) architectures. The integration of XGBoost addresses the challenge of handling small dataset sizes, inherent during outbreaks due to limited time series data. XGBoost serves as a cornerstone for minimizing the loss function and identifying optimal features from our multivariate time series data. Subsequently, the combined dataset, comprising original features and predicted values by XGBoost, is channeled into the BiLSTM for further processing. Through extensive experimentation with a dataset spanning multiple infectious disease outbreaks, our hybrid model demonstrates superior predictive performance compared to state-of-the-art and baseline models. By enhancing forecasting accuracy and outbreak tracking capabilities, our model offers promising prospects for assisting health authorities in mitigating fatalities and proactively preparing for potential outbreaks.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"23 4","pages":"591-602"},"PeriodicalIF":3.7,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141975609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Controllability Reinforcement Learning Method for Pancreatic Cancer Biomarker Identification 胰腺癌生物标记物识别的可控性强化学习方法

IF 3.7 4区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

IEEE Transactions on NanoBioscience

Pub Date : 2024-08-12 DOI: 10.1109/TNB.2024.3441689

Yan Wang;Jie Hong;Yuting Lu;Nan Sheng;Yuan Fu;Lili Yang;Lingyu Meng;Lan Huang;Hao Wang

Pancreatic cancer is one of the most malignant cancers with rapid progression and poor prognosis. The use of transcriptional data can be effective in finding new biomarkers for pancreatic cancer. Many network-based methods used to identify cancer biomarkers are proposed, among which the combination of network controllability appears. However, most of the existing methods do not study RNA, rely on priori and mutations information, or can only achieve classification tasks. In this study, we propose a method combined Relational Graph Convolutional Network and Deep Q-Network called RDDriver to identify pancreatic cancer biomarkers based on multi-layer heterogeneous transcriptional regulation network. Firstly, we construct a regulation network containing long non-coding RNA, microRNA, and messenger RNA. Secondly, Relational Graph Convolutional Network is used to learn the node representation. Finally, we use the idea of Deep Q-Network to build a model, which score and prioritize each RNA with the Popov-Belevitch-Hautus criterion. We train RDDriver on three small simulated networks, and calculate the average score after applying the model parameters to the regulation networks separately. To demonstrate the effectiveness of the method, we perform experiments for comparison between RDDriver and other eight methods based on the approximate benchmark of three types cancer drivers RNAs.

胰腺癌是恶性程度最高的癌症之一，病情发展快，预后差。利用转录数据可以有效地找到胰腺癌的新生物标志物。人们提出了许多基于网络的癌症生物标记物识别方法，其中包括网络可控性组合。然而，现有方法大多不研究 RNA，依赖先验信息和突变信息，或只能完成分类任务。在本研究中，我们提出了一种结合关系图卷积网络和深度 Q 网络的方法，称为 RDDriver，用于识别基于多层异构转录调控网络的胰腺癌生物标记物。首先，我们构建了一个包含长非编码 RNA、microRNA 和信使 RNA 的调控网络。其次，使用关系图卷积网络学习节点表示。最后，我们利用深度 Q 网络的思想建立了一个模型，用 Popov-Belevitch-Hautus 准则对每个 RNA 进行评分和优先排序。我们在三个小型模拟网络上训练 RDDriver，并在将模型参数分别应用于调控网络后计算平均得分。为了证明该方法的有效性，我们以三种癌症驱动 RNA 为近似基准，进行了 RDDriver 与其他八种方法的比较实验。

{"title":"A Controllability Reinforcement Learning Method for Pancreatic Cancer Biomarker Identification","authors":"Yan Wang;Jie Hong;Yuting Lu;Nan Sheng;Yuan Fu;Lili Yang;Lingyu Meng;Lan Huang;Hao Wang","doi":"10.1109/TNB.2024.3441689","DOIUrl":"10.1109/TNB.2024.3441689","url":null,"abstract":"Pancreatic cancer is one of the most malignant cancers with rapid progression and poor prognosis. The use of transcriptional data can be effective in finding new biomarkers for pancreatic cancer. Many network-based methods used to identify cancer biomarkers are proposed, among which the combination of network controllability appears. However, most of the existing methods do not study RNA, rely on priori and mutations information, or can only achieve classification tasks. In this study, we propose a method combined Relational Graph Convolutional Network and Deep Q-Network called RDDriver to identify pancreatic cancer biomarkers based on multi-layer heterogeneous transcriptional regulation network. Firstly, we construct a regulation network containing long non-coding RNA, microRNA, and messenger RNA. Secondly, Relational Graph Convolutional Network is used to learn the node representation. Finally, we use the idea of Deep Q-Network to build a model, which score and prioritize each RNA with the Popov-Belevitch-Hautus criterion. We train RDDriver on three small simulated networks, and calculate the average score after applying the model parameters to the regulation networks separately. To demonstrate the effectiveness of the method, we perform experiments for comparison between RDDriver and other eight methods based on the approximate benchmark of three types cancer drivers RNAs.","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"23 4","pages":"556-563"},"PeriodicalIF":3.7,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10633729","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141971020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0