Denish Trivedi, Kalyani Patrikar and Anirban Mondal
Graph neural networks (GNN) have been demonstrated to correlate molecular structure with properties, enabling rapid evaluation of molecules for a given application. Molecular properties, including ground and excited states, are crucial to analyzing molecular behavior. However, while attention-based mechanisms and pooling methods have been optimized to accurately predict specific properties, no versatile models can predict diverse molecular properties. Here, we present graph neural networks that predict a wide range of properties with high accuracy. Model performance is high regardless of dataset size and origin. Further, we demonstrate an implementation of hierarchical pooling enabling high-accuracy prediction of excited state properties by effectively weighing aspects of features that correlate better with target properties. We show that graph attention networks consistently outperform convolution networks and linear regression, particularly for small dataset sizes. The graph attention model is more accurate than previous message-passing neural networks developed for the prediction of diverse molecular properties. Hence, the model is an efficient tool for screening and designing molecules for applications that require tuning multiple molecular properties.
{"title":"Graph-based networks for accurate prediction of ground and excited state molecular properties from minimal features†","authors":"Denish Trivedi, Kalyani Patrikar and Anirban Mondal","doi":"10.1039/D4ME00113C","DOIUrl":"https://doi.org/10.1039/D4ME00113C","url":null,"abstract":"<p >Graph neural networks (GNN) have been demonstrated to correlate molecular structure with properties, enabling rapid evaluation of molecules for a given application. Molecular properties, including ground and excited states, are crucial to analyzing molecular behavior. However, while attention-based mechanisms and pooling methods have been optimized to accurately predict specific properties, no versatile models can predict diverse molecular properties. Here, we present graph neural networks that predict a wide range of properties with high accuracy. Model performance is high regardless of dataset size and origin. Further, we demonstrate an implementation of hierarchical pooling enabling high-accuracy prediction of excited state properties by effectively weighing aspects of features that correlate better with target properties. We show that graph attention networks consistently outperform convolution networks and linear regression, particularly for small dataset sizes. The graph attention model is more accurate than previous message-passing neural networks developed for the prediction of diverse molecular properties. Hence, the model is an efficient tool for screening and designing molecules for applications that require tuning multiple molecular properties.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 12","pages":" 1275-1284"},"PeriodicalIF":3.2,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142714083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
So Jung Park, Tristan Myers, Vinson Liao and Arthi Jayaraman
Block copolymer (BCP) self-assembly leads to nanostructured materials with diverse ordered morphologies, some of which are attractive for transport applications. Multiblock AB copolymers are of interest as they offer a larger design parameter space than diblock copolymers allowing researchers to tailor their self-assembly to achieve target morphologies. In this study, we investigate the phase behavior of symmetric AxByAzByAx and BxAyBzAyBx pentablock copolymers (pentaBCPs) where A and B monomers have the same statistical segment length. We use a combination of self-consistent field theory (SCFT) calculations and molecular dynamics (MD) simulations to link the polymer design parameters, namely the fraction of middle block volume to the volume of all blocks of same type, τ, overall volume fraction of A block, fA, and segregation strength, χN, to the equilibrium morphologies and the distributions of chain conformations in these morphologies. In the phase diagrams calculated using SCFT, we observe broader double gyroid windows and the existence of lamellar morphologies even at small values fA in contrast to what has been seen for diblock copolymers. We also see a reentrant phase sequence of double gyroid → cylinder → lamellae → cylinder → double gyroid with increasing τ at fixed fA. The chain conformations adopted in these morphologies are sampled in coarse-grained MD simulations and quantified with distributions of the chain end-to-end distance and fractions of chains whose middle (A or B) and end (A or B) blocks remain within domains of same chemistry (A or B). These analyses show that the pentaBCP chains adopt “looping”, “bridging”, and “hybrid” (both looping and bridging) conformations, with a majority of the chains adopting the hybrid conformation. The spatial distributions for each of the blocks in the pentaBCPs show that blocks of the same type in a chain locally segregate within the same domains, with shorter blocks segregating towards the domain boundaries and longer blocks filling the domain interior. This combined SCFT-MD approach enables us to rapidly screen the extensive pentaBCP design space to identify design rules for transport-favorable morphologies as well as verify the chain conformations and spatial arrangements associated with the theory predicted reentrant phase behavior.
嵌段共聚物(BCP)的自组装可产生具有多种有序形态的纳米结构材料,其中一些对传输应用具有吸引力。与二嵌段共聚物相比,多嵌段 AB 共聚物具有更大的设计参数空间,允许研究人员定制自组装以实现目标形态,因此备受关注。在本研究中,我们研究了对称的 AxByAzByAx 和 BxAyBzAyBx 五嵌段共聚物(pentaBCPs)的相行为,其中 A 和 B 单体具有相同的统计段长度。我们采用自洽场理论(SCFT)计算和分子动力学(MD)模拟相结合的方法,将聚合物设计参数(即中间嵌段体积占同类所有嵌段体积的比例τ、A 嵌段的总体积分数 fA 和偏析强度 χN)与平衡形态以及这些形态中的链构象分布联系起来。在使用 SCFT 计算的相图中,我们观察到了更宽的双陀螺窗口,甚至在 fA 值较小时也存在片状形态,这与二嵌段共聚物的情况截然不同。我们还发现,在固定的 fA 值下,随着 τ 的增大,会出现双陀螺→圆柱→薄片→圆柱→双陀螺的重入相序列。在粗粒度 MD 模拟中对这些形态所采用的链构象进行了采样,并通过链端到端距离的分布以及中间(A 或 B)和末端(A 或 B)块保持在相同化学性质(A 或 B)域内的链的分数进行了量化。这些分析表明,pentaBCP 链采用了 "循环"、"桥接 "和 "混合"(循环和桥接)构象,其中大多数链采用了混合构象。pentaBCP 链中每个嵌段的空间分布显示,链中相同类型的嵌段会局部分离到相同的结构域中,较短的嵌段分离到结构域边界,较长的嵌段则填充到结构域内部。这种 SCFT-MD 组合方法使我们能够快速筛选广泛的 pentaBCP 设计空间,从而确定有利于传输的形态设计规则,并验证与理论预测的重入相行为相关的链构象和空间排列。
{"title":"Self-consistent field theory and coarse-grained molecular dynamics simulations of pentablock copolymer melt phase behavior†","authors":"So Jung Park, Tristan Myers, Vinson Liao and Arthi Jayaraman","doi":"10.1039/D4ME00138A","DOIUrl":"https://doi.org/10.1039/D4ME00138A","url":null,"abstract":"<p >Block copolymer (BCP) self-assembly leads to nanostructured materials with diverse ordered morphologies, some of which are attractive for transport applications. Multiblock AB copolymers are of interest as they offer a larger design parameter space than diblock copolymers allowing researchers to tailor their self-assembly to achieve target morphologies. In this study, we investigate the phase behavior of symmetric A<small><sub><em>x</em></sub></small>B<small><sub><em>y</em></sub></small>A<small><sub><em>z</em></sub></small>B<small><sub><em>y</em></sub></small>A<small><sub><em>x</em></sub></small> and B<small><sub><em>x</em></sub></small>A<small><sub><em>y</em></sub></small>B<small><sub><em>z</em></sub></small>A<small><sub><em>y</em></sub></small>B<small><sub><em>x</em></sub></small> pentablock copolymers (pentaBCPs) where A and B monomers have the same statistical segment length. We use a combination of self-consistent field theory (SCFT) calculations and molecular dynamics (MD) simulations to link the polymer design parameters, namely the fraction of middle block volume to the volume of all blocks of same type, <em>τ</em>, overall volume fraction of A block, <em>f</em><small><sub>A</sub></small>, and segregation strength, <em>χN</em>, to the equilibrium morphologies and the distributions of chain conformations in these morphologies. In the phase diagrams calculated using SCFT, we observe broader double gyroid windows and the existence of lamellar morphologies even at small values <em>f</em><small><sub>A</sub></small> in contrast to what has been seen for diblock copolymers. We also see a reentrant phase sequence of double gyroid → cylinder → lamellae → cylinder → double gyroid with increasing <em>τ</em> at fixed <em>f</em><small><sub>A</sub></small>. The chain conformations adopted in these morphologies are sampled in coarse-grained MD simulations and quantified with distributions of the chain end-to-end distance and fractions of chains whose middle (A or B) and end (A or B) blocks remain within domains of same chemistry (A or B). These analyses show that the pentaBCP chains adopt “looping”, “bridging”, and “hybrid” (both looping and bridging) conformations, with a majority of the chains adopting the hybrid conformation. The spatial distributions for each of the blocks in the pentaBCPs show that blocks of the same type in a chain locally segregate within the same domains, with shorter blocks segregating towards the domain boundaries and longer blocks filling the domain interior. This combined SCFT-MD approach enables us to rapidly screen the extensive pentaBCP design space to identify design rules for transport-favorable morphologies as well as verify the chain conformations and spatial arrangements associated with the theory predicted reentrant phase behavior.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 12","pages":" 1235-1253"},"PeriodicalIF":3.2,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/me/d4me00138a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142714082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ebony Shire, André A. B. Coimbra, Carlos Barba Ostria, Leonardo Rios-Solis and Diego López Barreiro
Structural proteins like silk, squid ring teeth, elastin, collagen, or resilin, among others, are inspiring the development of new sustainable biopolymeric materials for applications including healthcare, food, soft robotics, or textiles. Furthermore, advances in the fields of soft materials and synthetic biology have a joint great potential to guide the design of novel structural proteins, despite both fields progressing mostly in a separate fashion so far. Using recombinant DNA technologies and microbial fermentations, we can design new structural proteins with monomer-level sequence control and a dispersity of ca. 1.0, based on permutations of tandem repeats derived from natural structural proteins. However, the molecular design of recombinant and repetitive structural proteins is a nontrivial task that is generally approached using low-throughput trial-and-error experimentation. Here, we review recent progress in this area, in terms of structure–function relationships and DNA synthesis technologies. We also discuss experimental and computational advances towards the establishment of rapid prototyping pipelines for this family of biopolymers. Finally, we highlight future challenges to make protein-based materials a commercially viable alternative to current fossil-based polymers.
蚕丝、乌贼环齿、弹性蛋白、胶原蛋白或树脂蛋白等结构蛋白正在激发人们开发新型可持续生物聚合物材料,其应用领域包括医疗保健、食品、软机器人或纺织品。此外,软性材料和合成生物学领域的进步在指导新型结构蛋白质的设计方面具有共同的巨大潜力,尽管迄今为止这两个领域的进展大多各自为政。利用 DNA 重组技术和微生物发酵技术,我们可以根据从天然结构蛋白中提取的串联重复序列的排列组合,设计出具有单体级序列控制和约 1.0 分散性的新型结构蛋白。然而,重组和重复结构蛋白的分子设计并非易事,通常需要通过低通量的试错实验来完成。在此,我们从结构-功能关系和 DNA 合成技术的角度回顾了这一领域的最新进展。我们还讨论了在为这一系列生物聚合物建立快速原型管道方面取得的实验和计算进展。最后,我们强调了使基于蛋白质的材料成为目前化石基聚合物的商业可行替代品所面临的未来挑战。
{"title":"Molecular design of protein-based materials – state of the art, opportunities and challenges at the interface between materials engineering and synthetic biology","authors":"Ebony Shire, André A. B. Coimbra, Carlos Barba Ostria, Leonardo Rios-Solis and Diego López Barreiro","doi":"10.1039/D4ME00122B","DOIUrl":"10.1039/D4ME00122B","url":null,"abstract":"<p >Structural proteins like silk, squid ring teeth, elastin, collagen, or resilin, among others, are inspiring the development of new sustainable biopolymeric materials for applications including healthcare, food, soft robotics, or textiles. Furthermore, advances in the fields of soft materials and synthetic biology have a joint great potential to guide the design of novel structural proteins, despite both fields progressing mostly in a separate fashion so far. Using recombinant DNA technologies and microbial fermentations, we can design new structural proteins with monomer-level sequence control and a dispersity of <em>ca.</em> 1.0, based on permutations of tandem repeats derived from natural structural proteins. However, the molecular design of recombinant and repetitive structural proteins is a nontrivial task that is generally approached using low-throughput trial-and-error experimentation. Here, we review recent progress in this area, in terms of structure–function relationships and DNA synthesis technologies. We also discuss experimental and computational advances towards the establishment of rapid prototyping pipelines for this family of biopolymers. Finally, we highlight future challenges to make protein-based materials a commercially viable alternative to current fossil-based polymers.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 12","pages":" 1187-1209"},"PeriodicalIF":3.2,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/me/d4me00122b?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142269140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mark S. Bannon, Jeffrey F. Ellena, Aditi S. Gourishankar, Spencer R. Marsh, Dilza Trevisan-Silva, Nicholas E. Sherman, L. Jane Jourdan, Robert G. Gourdie and Rachel A. Letteri
Peptides are naturally potent and selective therapeutics with massive potential; however, low cell membrane permeability limits their clinical implementation, particularly for hydrophilic, anionic peptides with intracellular targets. To overcome this limitation, esterification of anionic carboxylic acids on therapeutic peptides can simultaneously increase hydrophobicity and net charge to facilitate cell internalization, whereafter installed esters can be cleaved hydrolytically to restore activity. To date, however, most esterified therapeutics contain either a single esterification site or multiple esters randomly incorporated on multiple sites. This investigation provides molecular engineering insight into how the number and position of esters installed onto the therapeutic peptide α carboxyl terminus 11 (αCT11, RPRPDDLEI) with 4 esterification sites affect hydrophobicity and the hydrolysis process that reverts the peptide to its original form. After installing methyl esters onto αCT11 using Fischer esterification, we isolated 5 distinct products and used 2D nuclear magnetic resonance spectroscopy, reverse-phase high performance liquid chromatography, and mass spectrometry to determine which residues were esterified in each and the resulting increase in hydrophobicity. We found esterifying the C-terminal isoleucine to impart the largest increase in hydrophobicity. Monitoring ester hydrolysis showed the C-terminal isoleucine ester to be the most hydrolytically stable, followed by the glutamic acid, whereas esters on aspartic acids hydrolyze rapidly. LC-MS revealed the formation of transient intramolecular aspartimides prior to hydrolysis to carboxylic acids. In vitro proof-of-concept experiments showed esterifying αCT11 to increase cell migration into a scratch, highlighting the potential of multi-site esterification as a tunable, reversible strategy to enable the delivery of therapeutic peptides.
{"title":"Multi-site esterification: a tunable, reversible strategy to tailor therapeutic peptides for delivery†","authors":"Mark S. Bannon, Jeffrey F. Ellena, Aditi S. Gourishankar, Spencer R. Marsh, Dilza Trevisan-Silva, Nicholas E. Sherman, L. Jane Jourdan, Robert G. Gourdie and Rachel A. Letteri","doi":"10.1039/D4ME00072B","DOIUrl":"10.1039/D4ME00072B","url":null,"abstract":"<p >Peptides are naturally potent and selective therapeutics with massive potential; however, low cell membrane permeability limits their clinical implementation, particularly for hydrophilic, anionic peptides with intracellular targets. To overcome this limitation, esterification of anionic carboxylic acids on therapeutic peptides can simultaneously increase hydrophobicity and net charge to facilitate cell internalization, whereafter installed esters can be cleaved hydrolytically to restore activity. To date, however, most esterified therapeutics contain either a single esterification site or multiple esters randomly incorporated on multiple sites. This investigation provides molecular engineering insight into how the number and position of esters installed onto the therapeutic peptide α carboxyl terminus 11 (αCT11, RPRPDDLEI) with 4 esterification sites affect hydrophobicity and the hydrolysis process that reverts the peptide to its original form. After installing methyl esters onto αCT11 using Fischer esterification, we isolated 5 distinct products and used 2D nuclear magnetic resonance spectroscopy, reverse-phase high performance liquid chromatography, and mass spectrometry to determine which residues were esterified in each and the resulting increase in hydrophobicity. We found esterifying the C-terminal isoleucine to impart the largest increase in hydrophobicity. Monitoring ester hydrolysis showed the C-terminal isoleucine ester to be the most hydrolytically stable, followed by the glutamic acid, whereas esters on aspartic acids hydrolyze rapidly. LC-MS revealed the formation of transient intramolecular aspartimides prior to hydrolysis to carboxylic acids. <em>In vitro</em> proof-of-concept experiments showed esterifying αCT11 to increase cell migration into a scratch, highlighting the potential of multi-site esterification as a tunable, reversible strategy to enable the delivery of therapeutic peptides.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 12","pages":" 1215-1227"},"PeriodicalIF":3.2,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/me/d4me00072b?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Youcong Li, Jiahao Dong, Yue Zhao, Lei Gao, Yu-Hao Gu and Shuai Yuan
Metal–organic frameworks (MOFs) are promising platforms for designing photoresponsive materials due to their structural versatility and tunable properties. However, challenges remain in fine-tuning the photoresponsive behavior while maintaining the high stability of MOFs. In this study, we synthesized a MOF containing redox-active pyromellitic diimide (PMDI) groups and unsaturated Zr6 clusters named Zr-PMDI-DMF and fine-tuned its photochromic properties by exchanging the coordination solvent molecules on the Zr sites. Unlike traditional Zr6 clusters with bidentate carboxylate coordination, Zr-PMDI-DMF features monodentate carboxylate coordination with the exposed Zr sites occupied by solvent molecules. We post-synthetically exchanged the coordinated N,N-dimethylformamide (DMF) solvent molecules with 2-(dimethylamino)ethanol (DMAE), N-methyltetrahydropyrrole (NMP), and dimethyl sulfoxide (DMSO) and determined the structures of the coordinated solvent molecules using single-crystal X-ray diffraction. Through photochromic and bleaching cycle experiments, electron paramagnetic resonance spectroscopy, and density functional theory calculations, we found that the coordinated solvents act as electron donors. In contrast, PMDI ligands act as electron acceptors, causing intra-framework electron transfer and photochromism. The rate of the photochromic response correlated with the electron-donating ability of the solvents, following the trend of DMAE > NMP > DMSO > DMF.
{"title":"Controlling the photochromism of zirconium pyromellitic diimide-based metal–organic frameworks through coordinating solvents†","authors":"Youcong Li, Jiahao Dong, Yue Zhao, Lei Gao, Yu-Hao Gu and Shuai Yuan","doi":"10.1039/D4ME00104D","DOIUrl":"10.1039/D4ME00104D","url":null,"abstract":"<p >Metal–organic frameworks (MOFs) are promising platforms for designing photoresponsive materials due to their structural versatility and tunable properties. However, challenges remain in fine-tuning the photoresponsive behavior while maintaining the high stability of MOFs. In this study, we synthesized a MOF containing redox-active pyromellitic diimide (PMDI) groups and unsaturated Zr<small><sub>6</sub></small> clusters named Zr-PMDI-DMF and fine-tuned its photochromic properties by exchanging the coordination solvent molecules on the Zr sites. Unlike traditional Zr<small><sub>6</sub></small> clusters with bidentate carboxylate coordination, Zr-PMDI-DMF features monodentate carboxylate coordination with the exposed Zr sites occupied by solvent molecules. We post-synthetically exchanged the coordinated <em>N</em>,<em>N</em>-dimethylformamide (DMF) solvent molecules with 2-(dimethylamino)ethanol (DMAE), <em>N</em>-methyltetrahydropyrrole (NMP), and dimethyl sulfoxide (DMSO) and determined the structures of the coordinated solvent molecules using single-crystal X-ray diffraction. Through photochromic and bleaching cycle experiments, electron paramagnetic resonance spectroscopy, and density functional theory calculations, we found that the coordinated solvents act as electron donors. In contrast, PMDI ligands act as electron acceptors, causing intra-framework electron transfer and photochromism. The rate of the photochromic response correlated with the electron-donating ability of the solvents, following the trend of DMAE > NMP > DMSO > DMF.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 12","pages":" 1228-1234"},"PeriodicalIF":3.2,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lingfeng Gui, Alan Armstrong, Amparo Galindo, Fareed Bhasha Sayyed, Stanley P. Kolis and Claire S. Adjiman
Developing an accurate predictive model of solvent effects on reaction kinetics is a challenging task, yet it can play an important role in process development. While first-principles or machine learning models are often compute- or data-intensive, simple surrogate models, such as multivariate linear or quadratic regression models, are useful when computational resources and data are scarce. The judicious choice of a small set of training data, i.e., a set of solvents in which quantum mechanical (QM) calculations of liquid-phase rate constants are to be performed, is critical to obtaining a reliable model. This is, however, made especially challenging by the highly irregular shape of the discrete space of possible experiments (solvent choices). In this work, we demonstrate that when choosing a set of computer experiments to generate training data, the D-optimality criterion value of the chosen set correlates well with the likelihood of achieving good model performance. With the Menshutkin reaction of pyridine and phenacyl bromide as a case study, this finding is further verified via the evaluation of the surrogate models regressed using D-optimal solvent sets generated from four distinct selection spaces. We also find that incorporating quadratic terms in the surrogate model and choosing the D-optimal solvent set from a selection space similar to the test set can significantly improve the accuracy of reaction rate constant predictions while using a small training dataset. Our approach holds promise for the use of statistical optimality criteria for other types of computer experiments, supporting the construction of surrogate models with reduced resource and data requirements.
就溶剂对反应动力学的影响建立精确的预测模型是一项极具挑战性的任务,但却能在工艺开发中发挥重要作用。第一原理或机器学习模型通常是计算或数据密集型的,而简单的代用模型,如多元线性或二次回归模型,在计算资源和数据稀缺的情况下非常有用。要获得可靠的模型,明智地选择一小组训练数据(即一组溶剂,在其中对液相速率常数进行量子力学(QM)计算)至关重要。然而,由于可能的实验(溶剂选择)的离散空间形状极不规则,这尤其具有挑战性。在这项工作中,我们证明了在选择一组计算机实验来生成训练数据时,所选实验组的 D-optimality 标准值与获得良好模型性能的可能性密切相关。以吡啶和苯酰溴的 Menschutkin 反应为例,通过评估使用从四个不同选择空间生成的 D-最优溶剂集回归的代用模型,进一步验证了这一发现。我们还发现,在代用模型中加入二次项,并从与测试集类似的选择空间中选择 D 最佳溶剂集,可以显著提高反应速率常数预测的准确性,同时只需使用少量的训练数据集。我们的方法有望在其他类型的计算机实验中使用统计最优性标准,支持在减少资源和数据需求的情况下构建代用模型。
{"title":"On the design of optimal computer experiments to model solvent effects on reaction kinetics†","authors":"Lingfeng Gui, Alan Armstrong, Amparo Galindo, Fareed Bhasha Sayyed, Stanley P. Kolis and Claire S. Adjiman","doi":"10.1039/D4ME00074A","DOIUrl":"10.1039/D4ME00074A","url":null,"abstract":"<p >Developing an accurate predictive model of solvent effects on reaction kinetics is a challenging task, yet it can play an important role in process development. While first-principles or machine learning models are often compute- or data-intensive, simple surrogate models, such as multivariate linear or quadratic regression models, are useful when computational resources and data are scarce. The judicious choice of a small set of training data, <em>i.e.</em>, a set of solvents in which quantum mechanical (QM) calculations of liquid-phase rate constants are to be performed, is critical to obtaining a reliable model. This is, however, made especially challenging by the highly irregular shape of the discrete space of possible experiments (solvent choices). In this work, we demonstrate that when choosing a set of computer experiments to generate training data, the D-optimality criterion value of the chosen set correlates well with the likelihood of achieving good model performance. With the Menshutkin reaction of pyridine and phenacyl bromide as a case study, this finding is further verified <em>via</em> the evaluation of the surrogate models regressed using D-optimal solvent sets generated from four distinct selection spaces. We also find that incorporating quadratic terms in the surrogate model and choosing the D-optimal solvent set from a selection space similar to the test set can significantly improve the accuracy of reaction rate constant predictions while using a small training dataset. Our approach holds promise for the use of statistical optimality criteria for other types of computer experiments, supporting the construction of surrogate models with reduced resource and data requirements.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 12","pages":" 1254-1274"},"PeriodicalIF":3.2,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/me/d4me00074a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yongbaek Kim, Hiroto Isobe, Keishi Nishio and Kazuki Murai
Biomineralization has garnered attention not only for its fundamental role in understanding the mechanisms of biomineral formation but also as a method for fabricating next-generation functional materials. In this study, we investigated the nucleation, crystal growth, and particle growth processes of calcium phosphates (CaPs) formed using selective mineralization at the hydrogel interface induced by the fusion of peptide hydrogels. After 1 day of mineralization, band-like white precipitates were observed at the fusion interface of the hydrogels. Notably, the nucleation and crystal growth of the mineralized CaP exhibited different behaviors owing to the differences in the properties of the reaction interface for mineralization. The selective nucleation and crystal growth of the CaPs at the hydrogel interface were attributed to (1) the local concentration of mineral sources near the peptide network, driven by electrostatic interactions between the polar functional groups and mineral source ions, and (2) selective crystal growth of the CaPs induced by the nanostructure of the surface functional groups.
生物矿化不仅在理解生物矿物形成机制方面发挥着基础性作用,而且还是制造下一代功能材料的一种方法,因而备受关注。在本研究中,我们研究了多肽水凝胶融合诱导的水凝胶界面选择性矿化形成的磷酸钙(CaPs)的成核、晶体生长和颗粒生长过程。矿化一天后,在水凝胶的融合界面上观察到了带状白色沉淀。值得注意的是,由于矿化反应界面的性质不同,矿化 CaP 的成核和晶体生长表现出不同的行为。水凝胶界面上 CaPs 的选择性成核和晶体生长归因于:(1) 极性官能团和矿物源离子之间的静电作用驱动了肽网络附近矿物源的局部富集;(2) 表面官能团的纳米结构诱导了 CaPs 的选择性晶体生长。
{"title":"Selective mineralization at hydrogel interface induced by fusion between peptide hydrogels†","authors":"Yongbaek Kim, Hiroto Isobe, Keishi Nishio and Kazuki Murai","doi":"10.1039/D4ME00112E","DOIUrl":"10.1039/D4ME00112E","url":null,"abstract":"<p >Biomineralization has garnered attention not only for its fundamental role in understanding the mechanisms of biomineral formation but also as a method for fabricating next-generation functional materials. In this study, we investigated the nucleation, crystal growth, and particle growth processes of calcium phosphates (CaPs) formed using selective mineralization at the hydrogel interface induced by the fusion of peptide hydrogels. After 1 day of mineralization, band-like white precipitates were observed at the fusion interface of the hydrogels. Notably, the nucleation and crystal growth of the mineralized CaP exhibited different behaviors owing to the differences in the properties of the reaction interface for mineralization. The selective nucleation and crystal growth of the CaPs at the hydrogel interface were attributed to (1) the local concentration of mineral sources near the peptide network, driven by electrostatic interactions between the polar functional groups and mineral source ions, and (2) selective crystal growth of the CaPs induced by the nanostructure of the surface functional groups.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 11","pages":" 1107-1115"},"PeriodicalIF":3.2,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Junlong Yao, Zongqiang Fu, Huan Yang, Lin Gao, Xueliang Jiang, Wei Nie, Zhengguang Sun, Haolan Lu, Meiyun Lin and Jinglou Xu
Multifunctional composites with rapid self-healing performance have been widely applied in various fields. However, different types of fillers result in decreased self-healing efficiency and present agglomeration and poor compatibility especially at high filler contents. Here, based on the different surface modifications of barium titanate (BT) and silicon carbide (SiC) and the amide-bond synergistic effects between these fillers, self-healing supramolecular composites with high filler contents (up to 30%) are reported, and exhibit high strength, dielectric and thermal-conduction properties. Modification significantly improves the dispersion of these fillers, and greatly enhances the coexistence and synergy between these fillers. This three-phase amide-bonded supramolecular composite exhibits a high tensile strength of 3.22 MPa compared to other self-healing materials such as self-healing hydrogels, a high dielectric constant of 23, a high thermal conductivity of 0.36 W m−1 K−1 and a superior self-healing efficiency of above 94%. These performances are ascribed to the formation of amide bonds between the amino groups in 3-aminopropyltriethoxysilane (KH550)-modified silicon carbide (SiC-NH2) and the carboxyl groups in tartaric acid (TA)-modified barium titanate (BT-TA), which can provide efficient supramolecular interactions between different fillers, as well as more reversible hydrogen bonding for the matrix. This three-phase amide-bonded supramolecular composite provides an effective strategy to improve the self-healing properties of multifunctional composites, and will bring pioneering functions to electronic packaging materials, dielectric energy storage materials, environmental energy and other fields, which can open up broad application prospects.
{"title":"Construction of amide-bonded supramolecular multifunctional fillers towards boosted self-healing, thermal conductivity and dielectric properties","authors":"Junlong Yao, Zongqiang Fu, Huan Yang, Lin Gao, Xueliang Jiang, Wei Nie, Zhengguang Sun, Haolan Lu, Meiyun Lin and Jinglou Xu","doi":"10.1039/D4ME00114A","DOIUrl":"10.1039/D4ME00114A","url":null,"abstract":"<p >Multifunctional composites with rapid self-healing performance have been widely applied in various fields. However, different types of fillers result in decreased self-healing efficiency and present agglomeration and poor compatibility especially at high filler contents. Here, based on the different surface modifications of barium titanate (BT) and silicon carbide (SiC) and the amide-bond synergistic effects between these fillers, self-healing supramolecular composites with high filler contents (up to 30%) are reported, and exhibit high strength, dielectric and thermal-conduction properties. Modification significantly improves the dispersion of these fillers, and greatly enhances the coexistence and synergy between these fillers. This three-phase amide-bonded supramolecular composite exhibits a high tensile strength of 3.22 MPa compared to other self-healing materials such as self-healing hydrogels, a high dielectric constant of 23, a high thermal conductivity of 0.36 W m<small><sup>−1</sup></small> K<small><sup>−1</sup></small> and a superior self-healing efficiency of above 94%. These performances are ascribed to the formation of amide bonds between the amino groups in 3-aminopropyltriethoxysilane (KH550)-modified silicon carbide (SiC-NH<small><sub>2</sub></small>) and the carboxyl groups in tartaric acid (TA)-modified barium titanate (BT-TA), which can provide efficient supramolecular interactions between different fillers, as well as more reversible hydrogen bonding for the matrix. This three-phase amide-bonded supramolecular composite provides an effective strategy to improve the self-healing properties of multifunctional composites, and will bring pioneering functions to electronic packaging materials, dielectric energy storage materials, environmental energy and other fields, which can open up broad application prospects.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 11","pages":" 1167-1178"},"PeriodicalIF":3.2,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qinghe Gao, Tammo Dukker, Artur M. Schweidtmann and Jana M. Weber
The estimation of polymer properties is of crucial importance in many domains such as energy, healthcare, and packaging. Recently, graph neural networks (GNNs) have shown promising results for the prediction of polymer properties based on supervised learning. However, the training of GNNs in a supervised learning task demands a huge amount of polymer property data that is time-consuming and computationally/experimentally expensive to obtain. Self-supervised learning offers great potential to reduce this data demand through pre-training the GNNs on polymer structure data only. These pre-trained GNNs can then be fine-tuned on the supervised property prediction task using a much smaller labeled dataset. We propose to leverage self-supervised learning techniques in GNNs for the prediction of polymer properties. We employ a recent polymer graph representation that includes essential features of polymers, such as monomer combinations, stochastic chain architecture, and monomer stoichiometry, and process the polymer graphs through a tailored GNN architecture. We investigate three self-supervised learning setups: (i) node- and edge-level pre-training, (ii) graph-level pre-training, and (iii) ensembled node-, edge- & graph-level pre-training. We additionally explore three different transfer strategies of fully connected layers with the GNN architecture. Our results indicate that the ensemble node-, edge- & graph-level self-supervised learning with all layers transferred depicts the best performance across dataset size. In scarce data scenarios, it decreases the root mean square errors by 28.39% and 19.09% for the prediction of electron affinity and ionization potential compared to supervised learning without the pre-training task.
{"title":"Self-supervised graph neural networks for polymer property prediction†","authors":"Qinghe Gao, Tammo Dukker, Artur M. Schweidtmann and Jana M. Weber","doi":"10.1039/D4ME00088A","DOIUrl":"10.1039/D4ME00088A","url":null,"abstract":"<p >The estimation of polymer properties is of crucial importance in many domains such as energy, healthcare, and packaging. Recently, graph neural networks (GNNs) have shown promising results for the prediction of polymer properties based on supervised learning. However, the training of GNNs in a supervised learning task demands a huge amount of polymer property data that is time-consuming and computationally/experimentally expensive to obtain. Self-supervised learning offers great potential to reduce this data demand through pre-training the GNNs on polymer structure data only. These pre-trained GNNs can then be fine-tuned on the supervised property prediction task using a much smaller labeled dataset. We propose to leverage self-supervised learning techniques in GNNs for the prediction of polymer properties. We employ a recent polymer graph representation that includes essential features of polymers, such as monomer combinations, stochastic chain architecture, and monomer stoichiometry, and process the polymer graphs through a tailored GNN architecture. We investigate three self-supervised learning setups: (i) node- and edge-level pre-training, (ii) graph-level pre-training, and (iii) ensembled node-, edge- & graph-level pre-training. We additionally explore three different transfer strategies of fully connected layers with the GNN architecture. Our results indicate that the ensemble node-, edge- & graph-level self-supervised learning with all layers transferred depicts the best performance across dataset size. In scarce data scenarios, it decreases the root mean square errors by 28.39% and 19.09% for the prediction of electron affinity and ionization potential compared to supervised learning without the pre-training task.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 11","pages":" 1130-1143"},"PeriodicalIF":3.2,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/me/d4me00088a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Electrowetting display (EWD) technology is among the most promising reflective display technologies due to its full-color capabilities and fast video-speed performance. The colored EWD inks are typically prepared by dissolving soluble organic dyes in non-polar solvents, which significantly influence the color performance, electro-optical behaviour, and longevity of EWD devices. In this study, density functional theory (DFT) at the PBE1PBE/6-31G* level and time-dependent density functional theory (TD-DFT) at the M06-2X/6-31G* level were utilized to calculate a series of benzobisthiadiazole-based donor–acceptor–donor (D–A–D) type near-infrared organic dyes for EWDs, providing structural and spectral data to aid in spectral assignment. The quantum chemical calculations' results align with our experimental synthesis data, showing molecular colors spanning blue, green, and cyan. Detailed investigations into the properties of these dyes, including absorption, electro-optical response, and photo-stability, were conducted. The experimental outcomes indicate that these organic dyes are excellent candidates for EWD applications.
{"title":"Computational-assisted molecular design, synthesis and application of benzobisthiadiazole-based near-infrared dye in electrowetting displays†","authors":"Junheng Chen, Haoteng Lin, Xintong Wang, Dinggui He, Baoyi Luo, Yuanyuan Guo, Wangqiao Chen and Guofu Zhou","doi":"10.1039/D4ME00115J","DOIUrl":"10.1039/D4ME00115J","url":null,"abstract":"<p >Electrowetting display (EWD) technology is among the most promising reflective display technologies due to its full-color capabilities and fast video-speed performance. The colored EWD inks are typically prepared by dissolving soluble organic dyes in non-polar solvents, which significantly influence the color performance, electro-optical behaviour, and longevity of EWD devices. In this study, density functional theory (DFT) at the PBE1PBE/6-31G* level and time-dependent density functional theory (TD-DFT) at the M06-2X/6-31G* level were utilized to calculate a series of benzobisthiadiazole-based donor–acceptor–donor (D–A–D) type near-infrared organic dyes for EWDs, providing structural and spectral data to aid in spectral assignment. The quantum chemical calculations' results align with our experimental synthesis data, showing molecular colors spanning blue, green, and cyan. Detailed investigations into the properties of these dyes, including absorption, electro-optical response, and photo-stability, were conducted. The experimental outcomes indicate that these organic dyes are excellent candidates for EWD applications.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 11","pages":" 1144-1154"},"PeriodicalIF":3.2,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}