Protein-protein interactions are central to numerous cellular processes, including transport, signaling, and immune response. Structural modeling of protein assemblies typically relies on AlphaFold or docking methods, which produce structural models evaluated against a single experimental reference. While AlphaFold2 and its extension, AlphaFold-Multimer, have advanced complex prediction, they, and conventional docking tools, offer only static representations. However, flexibility at protein-protein interfaces is increasingly recognized as critical for function. To address this limitation, DynaBench provides a benchmark of interface dynamics in biologically relevant protein assemblies. We performed MD simulations for over 200 protein-protein complexes listed in the Docking Benchmark 5.5 (https://zlab.umassmed.edu/benchmark/), generating three 100 ns long replicas per complex. All trajectories are now publicly available online (http://www-lbt.ibpc.fr/DynaBench) via the MDposit platform (INRIA node), which is part of the EU-funded Molecular Dynamics Data Bank (MDDB). These simulations offer a unique resource for exploring interfacial flexibility, training machine learning models, redefining accuracy metrics for model evaluation, and informing the design of protein interfaces.
Recent advances in protein structure prediction have created high-confidence candidate structures for nearly every known protein-coding gene. At the same time, many software packages have been created to visualize protein structures, protein multiple sequence alignments (MSAs), and protein annotations. However, few software tools can highlight the direct relationship between nucleotide variation of protein-coding genes in genome space and the evolutionary and structural context of that variation in protein space. To help address these needs, we created a suite of robust and reusable JavaScript components to show protein structures, MSAs, phylogenies, and their relationship to protein-coding gene regions using the JBrowse 2 genome browser. This software allows users to interface with web services such as AlphaFoldDB and Foldseek to access pre-computed structures, or to upload protein structures from sources such as ColabFold or PDB. Our resources are available at https://github.com/GMOD/proteinbrowser.
The importance of the genetic code in virology is universally acknowledged. However, it is less known that viral genomes can harbour a second code, embedded within the genetic code, that orchestrates the efficient assembly and genome packaging in many viral systems. Since its discovery in a bacterial virus, the molecular details and function of this mechanism have been characterised in a broad range of viral families, including major human pathogens. This Perspective article reports on the hallmarks of this "assembly/packaging code", the journey of its discovery, and the enticing opportunities it brings both in antiviral therapy and in virus nanotechnology.
Protein aggregation plays a central role in the pathogenesis of many neurodegenerative diseases and poses major challenges in protein engineering. A key driver of this process is the presence of aggregation-prone regions (APRs) within protein sequences. We present AggrescanAI, a deep learning-based tool that predicts residue-level aggregation propensity directly from sequence. It leverages contextual embeddings from the ProtT5 protein language model, which captures rich information implicitly encoded in the sequence, without requiring structural data. The model was trained on a set of experimentally annotated APRs, expanded via homology transfering, evaluated by cross-validation, and validated with an external benchmark. AggrescanAI outperforms state of the art predictors and captures aggregation shifts induced by pathogenic mutations. To facilitate accessibility, we provide a user-friendly and fully open Google Colab notebook: https://gitlab.com/bioinformatics-fil/aggrescanai. AggrescanAI represents a new generation of sequence-based aggregation predictors, powered by deep learning and protein language models.

