Do Genes-in-Pieces Code for Proteins that Fold in Pieces?
Published:20 Sep.2024    Source:Rice University
A new study led by Rice University's Peter Wolynes offers new insights into the evolution of foldable proteins. Researchers at Rice and the University of Buenos Aires used energy landscape theory to distinguish between foldable and nonfoldable parts of protein sequences. Their study illuminates the ongoing debate about whether the pieces of DNA that code for only part of a protein during their origins can fold on their own. The researchers focused on the extensive relationship between exons in protein structures and the evolution of protein foldability. They highlighted the significance of exons, the parts of the gene that code for proteins, and introns, the silent regions discarded during gene translation into proteins.
 
"Using the extensive genomic exon-intron organization and protein sequence data now available, we explored exon boundary conservation and assessed its behavior using energy landscape theoretic measurements," said Wolynes, the D.R. Bullard-Welch Foundation Professor of Science, professor of chemistry, biosciences, physics and astronomy and co-director of the Center for Theoretical Biological Physics (CTBP). The team has now assessed exons as potential protein folding modules across 38 abundant and conserved protein families. Over generations, exons can shuffle randomly along the genome, leading to significant changes in genes and the creation of new proteins. The findings indicated deviations in the exon size distribution from exponential decay, suggesting there was evolutionary selection. Natural proteins are linear chains of amino acids that typically fold into compact three-dimensional structures to perform biological functions. The specific sequence of amino acids dictates the final 3D structure. Therefore, the idea that exons translate into independently folded protein regions, or foldons, is very attractive.
 
Using computational methods, the researchers measured the likelihood of the amino acid chain coded by an exon to fold into a stable 3D structure, similar to the full protein. Their results showed that while not all exons led to foldable modules, the most conserved exons, consistently found in diverse organisms, corresponded with better foldons. The study found a correlation between protein folding and evolution in certain globular protein families. Protein folding involves amino acid chains folding in space to perform biological functions within relevant timescales. This correlation is a fundamental concept in protein science, assessed using genomic data and energy functions. Interestingly, the general trend did not hold for all protein families, suggesting that other biological factors may influence protein folding and evolution. The researchers' work paves the way for future studies to understand these additional factors and their impact on evolutionary biology.