Pub Date : 2025-03-14DOI: 10.1016/j.patter.2025.101208
Melania Muñoz-García, Amber Hartman Scholz
The UN Convention on Biological Diversity adopted new rules for sharing benefits from publicly available genetic sequence data, also known as digital sequence information (DSI). In this Opinion, the authors describe the key elements researchers need to be aware of, address real-life questions, and explain the practical implications of these rules for research and development.
{"title":"Navigating COP16's digital sequence information outcomes: What researchers need to do in practice.","authors":"Melania Muñoz-García, Amber Hartman Scholz","doi":"10.1016/j.patter.2025.101208","DOIUrl":"10.1016/j.patter.2025.101208","url":null,"abstract":"<p><p>The UN Convention on Biological Diversity adopted new rules for sharing benefits from publicly available genetic sequence data, also known as digital sequence information (DSI). In this Opinion, the authors describe the key elements researchers need to be aware of, address real-life questions, and explain the practical implications of these rules for research and development.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101208"},"PeriodicalIF":6.7,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963070/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-14DOI: 10.1016/j.patter.2025.101203
Kisan Thapa, Meric Kinali, Shichao Pei, Augustin Luna, Özgün Babur
High-throughput molecular profiling technologies have revolutionized molecular biology research in the past decades. One important use of molecular data is to make predictions of phenotypes and other features of the organisms using machine learning algorithms. Deep learning models have become increasingly popular for this task due to their ability to learn complex non-linear patterns. Applying deep learning to molecular profiles, however, is challenging due to the very high dimensionality of the data and relatively small sample sizes, causing models to overfit. A solution is to incorporate biological prior knowledge to guide the learning algorithm for processing the functionally related input together. This helps regularize the models and improve their generalizability and interpretability. Here, we describe three major strategies proposed to use prior knowledge in deep learning models to make predictions based on molecular profiles. We review the related deep learning architectures, including the major ideas in relatively new graph neural networks.
{"title":"Strategies to include prior knowledge in omics analysis with deep neural networks.","authors":"Kisan Thapa, Meric Kinali, Shichao Pei, Augustin Luna, Özgün Babur","doi":"10.1016/j.patter.2025.101203","DOIUrl":"10.1016/j.patter.2025.101203","url":null,"abstract":"<p><p>High-throughput molecular profiling technologies have revolutionized molecular biology research in the past decades. One important use of molecular data is to make predictions of phenotypes and other features of the organisms using machine learning algorithms. Deep learning models have become increasingly popular for this task due to their ability to learn complex non-linear patterns. Applying deep learning to molecular profiles, however, is challenging due to the very high dimensionality of the data and relatively small sample sizes, causing models to overfit. A solution is to incorporate biological prior knowledge to guide the learning algorithm for processing the functionally related input together. This helps regularize the models and improve their generalizability and interpretability. Here, we describe three major strategies proposed to use prior knowledge in deep learning models to make predictions based on molecular profiles. We review the related deep learning architectures, including the major ideas in relatively new graph neural networks.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101203"},"PeriodicalIF":6.7,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963003/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-14DOI: 10.1016/j.patter.2025.101206
Paul Trauttmansdorff, Kim M Hajek
{"title":"Data shadows: When data become tangible, material, and fragile.","authors":"Paul Trauttmansdorff, Kim M Hajek","doi":"10.1016/j.patter.2025.101206","DOIUrl":"https://doi.org/10.1016/j.patter.2025.101206","url":null,"abstract":"","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101206"},"PeriodicalIF":6.7,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963112/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-14DOI: 10.1016/j.patter.2025.101207
Jon Rueda, Txetxu Ausín, Mark Coeckelbergh, Juan Ignacio Del Valle, Francisco Lara, Belén Liedo, Joan Llorca Albareda, Heidi Mertes, Robert Ranisch, Vera Lúcia Raposo, Bernd C Stahl, Murilo Vilaça, Íñigo de Miguel
The concept of dignity is proliferating in ethical, legal, and policy discussions of AI, yet dignity is an elusive concept with multiple philosophical interpretations. The authors argue that the unspecific and uncritical employment of the notion of dignity can be counterproductive for AI ethics.
{"title":"Why dignity is a troubling concept for AI ethics.","authors":"Jon Rueda, Txetxu Ausín, Mark Coeckelbergh, Juan Ignacio Del Valle, Francisco Lara, Belén Liedo, Joan Llorca Albareda, Heidi Mertes, Robert Ranisch, Vera Lúcia Raposo, Bernd C Stahl, Murilo Vilaça, Íñigo de Miguel","doi":"10.1016/j.patter.2025.101207","DOIUrl":"10.1016/j.patter.2025.101207","url":null,"abstract":"<p><p>The concept of dignity is proliferating in ethical, legal, and policy discussions of AI, yet dignity is an elusive concept with multiple philosophical interpretations. The authors argue that the unspecific and uncritical employment of the notion of dignity can be counterproductive for AI ethics.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101207"},"PeriodicalIF":6.7,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963102/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-06eCollection Date: 2025-03-14DOI: 10.1016/j.patter.2025.101204
Alejandra Alvarado, Wanying Wang, Andrew L Hufton
{"title":"Affirming our commitment to diversity, equity, and inclusion.","authors":"Alejandra Alvarado, Wanying Wang, Andrew L Hufton","doi":"10.1016/j.patter.2025.101204","DOIUrl":"https://doi.org/10.1016/j.patter.2025.101204","url":null,"abstract":"","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101204"},"PeriodicalIF":6.7,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963069/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-03eCollection Date: 2025-03-14DOI: 10.1016/j.patter.2025.101187
Yuzhu Cai, Sheng Yin, Yuxi Wei, Chenxin Xu, Weibo Mao, Felix Juefei-Xu, Siheng Chen, Yanfeng Wang
The burgeoning landscape of text-to-image models, exemplified by innovations such as Midjourney and DALL·E 3, has revolutionized content creation across diverse sectors. However, these advances bring forth critical ethical concerns, particularly with the misuse of open-source models to generate content that violates societal norms. Addressing this, we introduce Ethical-Lens, a framework designed to facilitate the value-aligned usage of text-to-image tools without necessitating internal model revision. Ethical-Lens ensures value alignment in text-to-image models across toxicity and bias dimensions by refining user commands and rectifying model outputs. Systematic evaluation metrics, combining GPT4-V, HEIM, and FairFace scores, assess alignment capability. Our experiments reveal that Ethical-Lens enhances alignment capabilities to levels comparable with or superior to commercial models such as DALL E 3, while preserving the quality of generated images. This study indicates the potential of Ethical-Lens to promote the sustainable development of open-source text-to-image tools and their beneficial integration into society.
{"title":"Ethical-Lens: Curbing malicious usages of open-source text-to-image models.","authors":"Yuzhu Cai, Sheng Yin, Yuxi Wei, Chenxin Xu, Weibo Mao, Felix Juefei-Xu, Siheng Chen, Yanfeng Wang","doi":"10.1016/j.patter.2025.101187","DOIUrl":"https://doi.org/10.1016/j.patter.2025.101187","url":null,"abstract":"<p><p>The burgeoning landscape of text-to-image models, exemplified by innovations such as Midjourney and DALL·E 3, has revolutionized content creation across diverse sectors. However, these advances bring forth critical ethical concerns, particularly with the misuse of open-source models to generate content that violates societal norms. Addressing this, we introduce Ethical-Lens, a framework designed to facilitate the value-aligned usage of text-to-image tools without necessitating internal model revision. Ethical-Lens ensures value alignment in text-to-image models across toxicity and bias dimensions by refining user commands and rectifying model outputs. Systematic evaluation metrics, combining GPT4-V, HEIM, and FairFace scores, assess alignment capability. Our experiments reveal that Ethical-Lens enhances alignment capabilities to levels comparable with or superior to commercial models such as DALL <math><mrow><mo>·</mo></mrow> </math> E 3, while preserving the quality of generated images. This study indicates the potential of Ethical-Lens to promote the sustainable development of open-source text-to-image tools and their beneficial integration into society.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101187"},"PeriodicalIF":6.7,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963081/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-25eCollection Date: 2025-03-14DOI: 10.1016/j.patter.2025.101184
Chuanpeng Dong, Feifei Zhang, Emily He, Ping Ren, Nipun Verma, Xinxin Zhu, Di Feng, James Cai, Hongyu Zhao, Sidi Chen
Immunotherapies, including checkpoint blockade and chimeric antigen receptor T cell (CAR-T) therapy, have revolutionized cancer treatment; however, many patients remain unresponsive to these treatments or relapse following treatment. CRISPR screenings have been used to identify novel single gene targets that can enhance immunotherapy effectiveness, but the identification of combinational targets remains a challenge. Here, we introduce a computational approach that uses sgRNA set enrichment analysis to identify cancer-intrinsic paralog pairs for enhancing immunotherapy using genome-wide screens. We have further developed an ensemble learning model that uses an XGBoost classifier and incorporates features to predict paralog gene pairs that influence immunotherapy efficacy. We experimentally validated the functional significance of these predicted paralog pairs using CRISPR double knockout (DKO). These data and analyses collectively provide a sensitive approach to identifying previously undetected paralog gene pairs that can significantly affect cancer immunotherapy response, even when individual genes within the pair have limited effect.
{"title":"Sensitive detection of synthetic response to cancer immunotherapy driven by gene paralog pairs.","authors":"Chuanpeng Dong, Feifei Zhang, Emily He, Ping Ren, Nipun Verma, Xinxin Zhu, Di Feng, James Cai, Hongyu Zhao, Sidi Chen","doi":"10.1016/j.patter.2025.101184","DOIUrl":"10.1016/j.patter.2025.101184","url":null,"abstract":"<p><p>Immunotherapies, including checkpoint blockade and chimeric antigen receptor T cell (CAR-T) therapy, have revolutionized cancer treatment; however, many patients remain unresponsive to these treatments or relapse following treatment. CRISPR screenings have been used to identify novel single gene targets that can enhance immunotherapy effectiveness, but the identification of combinational targets remains a challenge. Here, we introduce a computational approach that uses sgRNA set enrichment analysis to identify cancer-intrinsic paralog pairs for enhancing immunotherapy using genome-wide screens. We have further developed an ensemble learning model that uses an XGBoost classifier and incorporates features to predict paralog gene pairs that influence immunotherapy efficacy. We experimentally validated the functional significance of these predicted paralog pairs using CRISPR double knockout (DKO). These data and analyses collectively provide a sensitive approach to identifying previously undetected paralog gene pairs that can significantly affect cancer immunotherapy response, even when individual genes within the pair have limited effect.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101184"},"PeriodicalIF":6.7,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963098/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gene fusions are common cancer drivers and therapeutic targets, but clinical-grade open-source bioinformatic tools are lacking. Here, we introduce a fusion detection method named SplitFusion, which is fast by leveraging Burrows-Wheeler Aligner-maximal exact match (BWA-MEM) split alignments, can detect cryptic splice-site fusions (e.g., EML4::ALK v3b and ARv7), call fusions involving highly repetitive gene partners (e.g., CIC::DUX4), and infer frame-ness and exon-boundary alignments for functional prediction and minimizing false positives. Using 1,848 datasets of various sizes, SplitFusion demonstrated superior sensitivity and specificity compared to three other tools. In 1,076 formalin-fixed paraffin-embedded lung cancer samples, SplitFusion identified novel fusions and revealed that EML4::ALK variant 3 was associated with multiple fusion variants coexisting in the same tumor. Additionally, SplitFusion can call targeted splicing variants. Using data from 515 The Cancer Genome Atlas (TCGA) samples, SplitFusion showed the highest sensitivity and uncovered two cases of SLC34A2::ROS1 that were missed in previous studies. These capabilities make SplitFusion highly suitable for clinical applications and the study of fusion-defined tumor heterogeneity.
{"title":"SplitFusion enables ultrasensitive gene fusion detection and reveals fusion variant-associated tumor heterogeneity.","authors":"Weiwei Bian, Baifeng Zhang, Zhengbo Song, Binyamin A Knisbacher, Yee Man Chan, Chloe Bao, Chunwei Xu, Wenxian Wang, Athena Hoi Yee Chu, Chenyu Lu, Hongxian Wang, Siyu Bao, Zhenyu Gong, Hoi Yee Keung, Zi-Ying Maggie Chow, Yiping Zhang, Wah Cheuk, Gad Getz, Valentina Nardi, Mengsu Yang, William Chi Shing Cho, Jian Wang, Juxiang Chen, Zongli Zheng","doi":"10.1016/j.patter.2025.101174","DOIUrl":"10.1016/j.patter.2025.101174","url":null,"abstract":"<p><p>Gene fusions are common cancer drivers and therapeutic targets, but clinical-grade open-source bioinformatic tools are lacking. Here, we introduce a fusion detection method named SplitFusion, which is fast by leveraging Burrows-Wheeler Aligner-maximal exact match (BWA-MEM) split alignments, can detect cryptic splice-site fusions (e.g., <i>EML4::ALK</i> v3b and <i>ARv7</i>), call fusions involving highly repetitive gene partners (e.g., <i>CIC::DUX4</i>), and infer frame-ness and exon-boundary alignments for functional prediction and minimizing false positives. Using 1,848 datasets of various sizes, SplitFusion demonstrated superior sensitivity and specificity compared to three other tools. In 1,076 formalin-fixed paraffin-embedded lung cancer samples, SplitFusion identified novel fusions and revealed that <i>EML4::ALK</i> variant 3 was associated with multiple fusion variants coexisting in the same tumor. Additionally, SplitFusion can call targeted splicing variants. Using data from 515 The Cancer Genome Atlas (TCGA) samples, SplitFusion showed the highest sensitivity and uncovered two cases of <i>SLC34A2::ROS1</i> that were missed in previous studies. These capabilities make SplitFusion highly suitable for clinical applications and the study of fusion-defined tumor heterogeneity.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 2","pages":"101174"},"PeriodicalIF":6.7,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11873004/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143558204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-14DOI: 10.1016/j.patter.2025.101183
Alejandra Alvarado
{"title":"Lessons from the EU AI Act.","authors":"Alejandra Alvarado","doi":"10.1016/j.patter.2025.101183","DOIUrl":"https://doi.org/10.1016/j.patter.2025.101183","url":null,"abstract":"","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 2","pages":"101183"},"PeriodicalIF":6.7,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11873001/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143558240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-13eCollection Date: 2025-03-14DOI: 10.1016/j.patter.2025.101182
Joseph Paillard, Jörg F Hipp, Denis A Engemann
Spectral analysis using wavelets is widely used for identifying biomarkers in EEG signals. Recently, Riemannian geometry has provided an effective mathematical framework for predicting biomedical outcomes from multichannel electroencephalography (EEG) recordings while showing concord with neuroscientific domain knowledge. However, these methods rely on handcrafted rules and sequential optimization. In contrast, deep learning (DL) offers end-to-end trainable models achieving state-of-the-art performance on various prediction tasks but lacks interpretability and interoperability with established neuroscience concepts. We introduce Gabor Riemann EEGNet (GREEN), a lightweight neural network that integrates wavelet transforms and Riemannian geometry for processing raw EEG data. Benchmarking on six prediction tasks across four datasets with over 5,000 participants, GREEN outperformed non-deep state-of-the-art models and performed favorably against large DL models while using orders-of-magnitude fewer parameters. Computational experiments showed that GREEN facilitates learning sparse representations without compromising performance. By integrating domain knowledge, GREEN combines a desirable complexity-performance trade-off with interpretable representations.
{"title":"GREEN: A lightweight architecture using learnable wavelets and Riemannian geometry for biomarker exploration with EEG signals.","authors":"Joseph Paillard, Jörg F Hipp, Denis A Engemann","doi":"10.1016/j.patter.2025.101182","DOIUrl":"10.1016/j.patter.2025.101182","url":null,"abstract":"<p><p>Spectral analysis using wavelets is widely used for identifying biomarkers in EEG signals. Recently, Riemannian geometry has provided an effective mathematical framework for predicting biomedical outcomes from multichannel electroencephalography (EEG) recordings while showing concord with neuroscientific domain knowledge. However, these methods rely on handcrafted rules and sequential optimization. In contrast, deep learning (DL) offers end-to-end trainable models achieving state-of-the-art performance on various prediction tasks but lacks interpretability and interoperability with established neuroscience concepts. We introduce Gabor Riemann EEGNet (GREEN), a lightweight neural network that integrates wavelet transforms and Riemannian geometry for processing raw EEG data. Benchmarking on six prediction tasks across four datasets with over 5,000 participants, GREEN outperformed non-deep state-of-the-art models and performed favorably against large DL models while using orders-of-magnitude fewer parameters. Computational experiments showed that GREEN facilitates learning sparse representations without compromising performance. By integrating domain knowledge, GREEN combines a desirable complexity-performance trade-off with interpretable representations.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 3","pages":"101182"},"PeriodicalIF":6.7,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963017/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143781257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}