Academic literature on the topic 'Protein Side-chain Networks (PScN)'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Protein Side-chain Networks (PScN).'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Protein Side-chain Networks (PScN)"

1

Hwang, Jenn-Kang, and Wen-Fa Liao. "Side-chain prediction by neural networks and simulated annealing optimization." "Protein Engineering, Design and Selection" 8, no. 4 (1995): 363–70. http://dx.doi.org/10.1093/protein/8.4.363.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

IRWIN, J., H. BOHR, K. MOCHIZUKI, and P. G. WOLYNES. "CLASSIFICATION AND PREDICTION OF PROTEIN SIDE-CHAINS BY NEURAL NETWORK TECHNIQUES." International Journal of Neural Systems 03, supp01 (January 1992): 177–82. http://dx.doi.org/10.1142/s0129065792000504.

Full text
Abstract:
Neural Network methodology is used to classify and predict side-chain configurations in proteins on the basis of their sequence and in some cases also Cα-atomic distance information. In some of these methods, where Potts Associative Memories are employed, a mixed set of Potts systems each describe the various orientational states of a particular side-chain. The methods can find the correct side-chain orientations in proteins reasonably well after being trained on a data set of other proteins of known 3-dimensional structure.
APA, Harvard, Vancouver, ISO, and other styles
3

Xu, Gang, Qinghua Wang, and Jianpeng Ma. "OPUS-Rota3: Improving Protein Side-Chain Modeling by Deep Neural Networks and Ensemble Methods." Journal of Chemical Information and Modeling 60, no. 12 (November 19, 2020): 6691–97. http://dx.doi.org/10.1021/acs.jcim.0c00951.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Bond, Paul S., Keith S. Wilson, and Kevin D. Cowtan. "Predicting protein model correctness in Coot using machine learning." Acta Crystallographica Section D Structural Biology 76, no. 8 (July 27, 2020): 713–23. http://dx.doi.org/10.1107/s2059798320009080.

Full text
Abstract:
Manually identifying and correcting errors in protein models can be a slow process, but improvements in validation tools and automated model-building software can contribute to reducing this burden. This article presents a new correctness score that is produced by combining multiple sources of information using a neural network. The residues in 639 automatically built models were marked as correct or incorrect by comparing them with the coordinates deposited in the PDB. A number of features were also calculated for each residue using Coot, including map-to-model correlation, density values, B factors, clashes, Ramachandran scores, rotamer scores and resolution. Two neural networks were created using these features as inputs: one to predict the correctness of main-chain atoms and the other for side chains. The 639 structures were split into 511 that were used to train the neural networks and 128 that were used to test performance. The predicted correctness scores could correctly categorize 92.3% of the main-chain atoms and 87.6% of the side chains. A Coot ML Correctness script was written to display the scores in a graphical user interface as well as for the automatic pruning of chains, residues and side chains with low scores. The automatic pruning function was added to the CCP4i2 Buccaneer automated model-building pipeline, leading to significant improvements, especially for high-resolution structures.
APA, Harvard, Vancouver, ISO, and other styles
5

Conover, Matthew, Max Staples, Dong Si, Miao Sun, and Renzhi Cao. "AngularQA: Protein Model Quality Assessment with LSTM Networks." Computational and Mathematical Biophysics 7, no. 1 (May 29, 2019): 1–9. http://dx.doi.org/10.1515/cmb-2019-0001.

Full text
Abstract:
AbstractQuality Assessment (QA) plays an important role in protein structure prediction. Traditional multimodel QA method usually suffer from searching databases or comparing with other models for making predictions, which usually fail when the poor quality models dominate the model pool. We propose a novel protein single-model QA method which is built on a new representation that converts raw atom information into a series of carbon-alpha (Cα) atoms with side-chain information, defined by their dihedral angles and bond lengths to the prior residue. An LSTM network is used to predict the quality by treating each amino acid as a time-step and consider the final value returned by the LSTM cells. To the best of our knowledge, this is the first time anyone has attempted to use an LSTM model on the QA problem; furthermore, we use a new representation which has not been studied for QA. In addition to angles, we make use of sequence properties like secondary structure parsed from protein structure at each time-step without using any database, which is different than all existed QA methods. Our model achieves an overall correlation of 0.651 on the CASP12 testing dataset. Our experiment points out new directions for QA problem and our method could be widely used for protein structure prediction problem. The software is freely available at GitHub: https://github.com/caorenzhi/AngularQA
APA, Harvard, Vancouver, ISO, and other styles
6

Petrovskiy, Denis V., Kirill S. Nikolsky, Vladimir R. Rudnev, Liudmila I. Kulikova, Tatiana V. Butkova, Kristina A. Malsagova, Arthur T. Kopylov, and Anna L. Kaysheva. "Modeling Side Chains in the Three-Dimensional Structure of Proteins for Post-Translational Modifications." International Journal of Molecular Sciences 24, no. 17 (August 30, 2023): 13431. http://dx.doi.org/10.3390/ijms241713431.

Full text
Abstract:
Amino acid substitutions and post-translational modifications (PTMs) play a crucial role in many cellular processes by directly affecting the structural and dynamic features of protein interaction. Despite their importance, the understanding of protein PTMs at the structural level is still largely incomplete. The Protein Data Bank contains a relatively small number of 3D structures having post-translational modifications. Although recent years have witnessed significant progress in three-dimensional modeling (3D) of proteins using neural networks, the problem related to predicting accurate PTMs in proteins has been largely ignored. Predicting accurate 3D PTM models in proteins is closely related to another fundamental problem: predicting the correct side-chain conformations of amino acid residues in proteins. An analysis of publications as well as the paid and free software packages for modeling three-dimensional structures showed that most of them focus on working with unmodified proteins and canonical amino acid residues; the number of articles and software packages placing emphasis on modeling three-dimensional PTM structures is an order of magnitude smaller. This paper focuses on modeling the side-chain conformations of proteins containing PTMs (nonstandard amino acid residues). We collected our own libraries comprising the most frequently observed PTMs from the PDB and implemented a number of algorithms for predicting the side-chain conformation at modification points and in the immediate environment of the protein. A comprehensive analysis of both the algorithms per se and compared to the common Rosetta and FoldX structure modeling packages was also carried out. The proposed algorithmic solutions are comparable in their characteristics to the well-known Rosetta and FoldX packages for the modeling of three-dimensional structures and have great potential for further development and optimization. The source code of algorithmic solutions has been deposited to and is available at the GitHub source.
APA, Harvard, Vancouver, ISO, and other styles
7

WANG, LIANGJIANG, and SUSAN J. BROWN. "PREDICTION OF DNA-BINDING RESIDUES FROM SEQUENCE FEATURES." Journal of Bioinformatics and Computational Biology 04, no. 06 (December 2006): 1141–58. http://dx.doi.org/10.1142/s0219720006002387.

Full text
Abstract:
Protein–DNA interaction plays a pivotal role in transcriptional regulation, DNA metabolism and chromatin formation. Although structural data are available for a few hundreds of protein–DNA complexes, the molecular recognition pattern is still poorly understood. With the rapid accumulation of sequence data from many genomes, it is important to develop predictive methods for identification of potential DNA-binding residues in proteins. In this study, neural networks have been trained using five sequence-derived features for prediction of DNA-binding residues. These features include the molecular mass, hydrophobicity index, side chain p K a value, solvent accessible surface area and conservation score of an amino acid. Interestingly, the side chain p K a value appears to be the best feature for prediction, suggesting that the ionization state of amino acid side chains is important for DNA-binding. The predictive performance is enhanced by using multiple features for classifier construction. The classifier that has been constructed using all the five features predicts at 72.71% sensitivity and 67.73% specificity. This is by far the most accurate classifier reported for prediction of DNA-binding residues from sequence data. The classifier has also been evaluated by using the Receiver Operating Characteristic curve and by examining the predictions made for different classes of DNA-binding proteins. Supplementary materials including the datasets are available at .
APA, Harvard, Vancouver, ISO, and other styles
8

Steiner, Thomas, Antoine M. M. Schreurs, Jan A. Kanters, and Jan Kroon. "Water Molecules Hydrogen Bonding to Aromatic Acceptors of Amino Acids: the Structure of Tyr-Tyr-Phe Dihydrate and a Crystallographic Database Study on Peptides." Acta Crystallographica Section D Biological Crystallography 54, no. 1 (January 1, 1998): 25–31. http://dx.doi.org/10.1107/s0907444997007981.

Full text
Abstract:
The crystal structure of Tyr-Tyr-Phe dihydrate contains a hydrogen bond formed between a water molecule and the Phe side chain. The geometry is centered with a distance of 3.26 Å between the water O atom and the aromatic centroid. In a database study on hydrated peptides, four related examples are found which exhibit a wide variability of hydrogen-bond geometries. The intermolecular surroundings of the water molecules are inspected, showing that they are typically involved in complex networks of conventional and non-conventional hydrogen bonds. Possible relevance for protein hydration is given.
APA, Harvard, Vancouver, ISO, and other styles
9

Santana, Roberto, Pedro Larrañaga, and José A. Lozano. "Combining variable neighborhood search and estimation of distribution algorithms in the protein side chain placement problem." Journal of Heuristics 14, no. 5 (October 23, 2007): 519–47. http://dx.doi.org/10.1007/s10732-007-9049-8.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Mahatabuddin, Sheikh, Daichi Fukami, Tatsuya Arai, Yoshiyuki Nishimiya, Rumi Shimizu, Chie Shibazaki, Hidemasa Kondo, Motoyasu Adachi, and Sakae Tsuda. "Polypentagonal ice-like water networks emerge solely in an activity-improved variant of ice-binding protein." Proceedings of the National Academy of Sciences 115, no. 21 (May 7, 2018): 5456–61. http://dx.doi.org/10.1073/pnas.1800635115.

Full text
Abstract:
Polypentagonal water networks were recently observed in a protein capable of binding to ice crystals, or ice-binding protein (IBP). To examine such water networks and clarify their role in ice-binding, we determined X-ray crystal structures of a 65-residue defective isoform of a Zoarcidae-derived IBP (wild type, WT) and its five single mutants (A20L, A20G, A20T, A20V, and A20I). Polypentagonal water networks composed of ∼50 semiclathrate waters were observed solely on the strongest A20I mutant, which appeared to include a tetrahedral water cluster exhibiting a perfect position match to the (101¯0) first prism plane of a single ice crystal. Inclusion of another symmetrical water cluster in the polypentagonal network showed a perfect complementarity to the waters constructing the (202¯1) pyramidal ice plane. The order of ice-binding strength was A20L < A20G < WT < A20T < A20V < A20I, where the top three mutants capable of binding to the first prism and the pyramidal ice planes commonly contained a bifurcated γ-CH3 group. These results suggest that a fine-tuning of the surface of Zoarcidae-derived IBP assisted by a side-chain group regulates the holding property of its polypentagonal water network, the function of which is to freeze the host protein to specific ice planes.
APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "Protein Side-chain Networks (PScN)"

1

Vijayabaskar, M. S. "Protein-DNA Graphs And Interaction Energy Based Protein Structure Networks." Thesis, 2011. https://etd.iisc.ac.in/handle/2005/1904.

Full text
Abstract:
Proteins orchestrate a number of cellular processes either alone or in concert with other biomolecules like nucleic acids, carbohydrates, and lipids. They exhibit an intrinsic ability to fold de novo to their functional states. The three–dimensional structure of a protein, dependent on its amino acid sequence, is important for its function. Understanding this sequence– structure–function relationship has become one of the primary goals in biophysics. Various experimental techniques like X–ray crystallography, Nuclear Magnetic Resonance (NMR), and site–directed mutagenesis have been used extensively towards this goal. Computational studies include mainly sequence based, and structure based approaches. The sequence based approaches such as sequence alignments, phylogenetic analysis, domain identification, statistical coupling analysis etc., aim at deriving meaningful information from the primary sequence of the protein. The structure based approaches, on the other hand, use structures of folded proteins. Recent advances in structure determination and efforts by various structural consortia have resulted in an enormous amount of structures available for analysis. Innumerable observations such as the allowed and disallowed regions in the conformations of a peptide unit, hydrophobic core in globular proteins, existence of regular secondary structures like helices, sheets, and turns and a limited fold space have been landmarks in understanding the characteristics of protein structures. The uniqueness of protein structure is attained through non–covalent interactions among the constituent amino acids. Analyses of protein structures show that different types of non–covalent interactions like hydrophobic interactions, hydrogen bonding, salt bridges, aromatic stacking, cation–π interactions, and solvent interactions hold protein structures together. Although such structure analyses have provided a wealth of information, they have largely been performed at a pair–wise level and an investigation involving such pair–wise interactions alone is not sufficient to capture all the determinants of protein structures, since they happen at a global level. This consideration has led to the development of graphs/networks for proteins. Graphs or Networks are a collection of nodes connected by edges. Protein Structure Networks (PSNs) can be constructed using various definitions of nodes and edges. Nodes may vary from atoms to secondary structures in Synopsis proteins, and the edges can range from simple atom–atom distances to distance between secondary structures. To study the interplay of amino acids in structure formation, the most commonly used PSNs consider amino acids as nodes. The criterion for edge definition, however, varies. PSNs can be constructed at a course grain level by considering the distances between Cα/Cβ atoms, any side–chain atoms, or the centroids of the amino acids. At a finer level, PSNs can be constructed using atomic details by considering the interaction types or by computing the extent of interaction between amino acids. Representation of proteins as networks and their analyses has given us a unique perspective on various aspects such as protein structure organization, stability, folding, function, oligomerization and so on. A variety of network properties like the degree distribution, clustering coefficient, characteristic path lengths, clusters, and hubs have been investigated. Most of these studies are carried out on protein structures alone. However, the interaction of proteins with other biopolymers like nucleic acids is vital for many crucial biological processes like transcription and translation. In this thesis, we have attempted to address this problem by constructing and analyzing combined graphs of the structures of protein and DNA. Also, in almost all of the PSN studies, the connections have been made solely on the basis of geometric criteria. In the later part of the thesis, we have taken PSN a step further by defining the non–covalent connections based on chemical considerations in the form of the energies of interactions. The thesis contains two sections. The first part mainly involves the construction and application of PSNs to study DNA binding proteins. The DNA binding proteins are involved in several high fidelity processes like DNA recombination, DNA replication, and transcription. Although the protein– DNA interfaces have been extensively analyzed using pair–wise interactions, we gain additional global perspective from network approach. Furthermore, most of the earlier investigations have been carried out from the protein point of view (protein centric) and the present network approach aims to combine both the protein centric and the DNA centric view points by construction and analyses of protein–DNA graphs. These studies are described in Chapters 3 and 4. The second part of the thesis discusses the development, characterization, and application of protein structure networks based on non– covalent interaction energies. The investigations are presented in chapters 5 and 6. Chapter 3 discusses the development of Protein–DNA Graphs (PDGs) where the protein–DNA interfaces are represented as networks. PDG is a bipartite network in which amino acids form a set of nodes and the nucleotides form the other set. The extent of interaction between the two diverse types of biopolymers is normalized to define the strength of interaction. Edges are then constructed based on the interaction strength between amino acids and nucleotides. Such a representation, reported here for the first time, provides a holistic view of the interacting surface. The developed PDGs are further analyzed in terms of clusters of interacting residues and identification of highly connected residues, known as hubs, along the protein–DNA interface and discussed in terms of their interacting motifs. Important clusters have been identified in a set of protein–DNA complexes, where the amino acids interact with different chemical components of DNA such as phosphate, deoxyribose and base with varying degrees of connectivity. An analysis of such fragment based PDGs provided insights into the nature of protein–DNA interaction, which could not have been obtained by conventional pair–wise analysis. The predominance of deoxyribose–amino acid clusters in beta–sheet proteins, distinction of the interface clusters in helix–turn–helix and the zipper type proteins are some of the new findings from the analysis of PDGs. Additionally, a potential classification scheme has been proposed for protein–DNA complexes on the basis of their interface clusters. This classification scheme gives a general idea of how the proteins interact with different components of DNA in various complexes. The present graph–based method has provided a deeper insight into the analysis of the protein–DNA recognition mechanisms from both protein and DNA view points, thus throwing more light on the nature and specificity of these interactions (Sathyapriya, Vijayabaskar et al. 2008). Chapter 4 delineates the application of PSN to an important problem in molecular biology. An analysis of interface clusters from multimeric proteins provides a clue to the important residues contributing to the stability of the oligomers. One such prediction was made on the DNA binding protein under starvation from Mycobacterium smegmatis (Ms–Dps) using PSNs. Two types of trimers, Trimer A (tA) and Trimer B (tB) can be derived from the dodecamer because of the inherent three fold symmetry of the spherical crystal structure. The irreversible dodecamerization of these native Ms--Dps trimers, in vitro, is known to be directly associated with the bimodal function (DNA binding and iron storage) of this protein. Interface clusters which were Synopsis identified from the PSNs of the derived trimers, allowed us to convincingly predict the residues E146 and F47 for mutation studies. The prediction was followed up by our experimental collaborators (Rakhi PC and Dipankar Chatterji), which led to the elucidation of the molecular mechanism behind the in vitro oligomerization of Ms--Dps. The F47E mutant was impaired in dodecamerization, and the double mutant (E146AF47E) was a native monomer in solution. These two observations suggested that the two trimers are important for dodecamerization and that the residues selected are important for the structural stability of the protein in vitro. From the structural and functional characterizations of the mutants, we have proposed an oligomerization pathway of Ms–Dps (Chowdhury, Vijayabaskar et al. 2008). The second part of the thesis involves the development, characterization (Chapter 5) and application (Chapter 6) of Protein Energy Networks (PENs). As mentioned above, the PSNs constructed on the geometric basis efficiently capture the topology and associated properties at the level of atom–atom contact. The chemistry, however, is not completely captured by these network representations, and a wealth of information can be extracted by incorporating the details of chemical interactions. This study is an advancement over the existing PSNs, in terms of edges being defined on the basis of interaction energies among the amino acids. This interaction energy is the resultant of various types of interactions within a protein. Use of such realistic interaction energies in a weighted network captures all the essential features responsible for maintaining the protein structure. The methodology involved in representing proteins as interaction energy weighted networks, with realistic edge weights obtained from standard force fields is described in Chapter 5. The interaction energies were derived from equilibrium ensembles (obtained using molecular dynamics simulations) to account for the structural plasticity, which is essential for function elucidation. The suitability of this method to study single static structures was validated by obtaining interaction energies on minimized crystal structures of proteins. The PENs were then characterized using network parameters like edge weight distributions, clusters, hubs, and shortest paths. The PENs exhibited three distinct behaviors in terms of the size of the largest connected cluster as a function of interaction energy; namely, the pre–transition, transition, and post transition regions, irrespective of the topology of the proteins. The pre– transition region (energies<–20 kJ/mol) comprises smaller clusters with mainly charged and polar residues as hubs. Crucial topological changes take place in the transition region (–10 to –20 kJ/mol), where the smaller clusters aggregate, through low energy van der Waals interactions, to form a single large cluster in the post–transition region (energies>–10 kJ/mol). These behaviors reinforce the concept that hydrophobic interactions hold together local clusters of highly interacting residues, keeping the protein topology intact (Vijayabaskar and Vishveshwara 2010). The applications of PENs in studying protein organization, allosteric communication, thermophilic stability and the structural relation of remote homologues of TIM barrel families have been outlined in Chapter 6. In the first case, the weighted networks were used to identify stabilization regions in protein structures and hierarchical organization in the folded proteins, which may provide some insights into the general mechanism of protein folding and stabilization (Vijayabaskar and Vishveshwara 2010). In the second case the features of communication paths in proteins were elucidated from PENs, and specific paths have been extensively discussed in the case of PDZ domain, which is known to bring together protein partners, mediating various cellular processes. Changes in PEN upon ligand binding, resulting in alterations of the shortest paths (energetically most favorable paths) for a small fraction of residues, indicated that allosteric communication is anisotropic in PDZ. The observations also establish that the shortest paths between functionally important sites traverse through key residues in PDZ2 domain. Furthermore, shortest paths in PENs provide us the exact pathways of communication between residues. Although the communication in PDZ has been extensively investigated, detailed information of pathways at the energy level has emerged for the first time from the present study from PEN analysis (Vijayabaskar and Vishveshwara 2010). In the third case, a set of thermophilic and mesophilic proteins were compared to determine the factors responsible for their thermal stability from a network perspective using PENs. The sub– graph parameters such as cluster population, hubs and cliques were the prominent contributing factors for thermal stability. Also, the thermophilic proteins have a better–packed hydrophobic core. The property of thermophilic protein to increase stability by increasing the connectivity but retain conformational flexibility is discussed from a cliques and communities (higher order inter–connection of residues) perspective (Vijayabaskar and Vishveshwara 2010). Finally, the remote homologues from the TIM barrel fold have been analyzed using PENs to identify the interactions responsible for the maintenance of the fold despite low sequence similarity. A study of conserved Synopsis interactions in family specific PENs reveals that the formation of the central beta barrel is vital for the TIM barrel formation. The beta barrel is being formed by either conserved long range electrostatic interactions or by tandem arrangement of low energy hydrophobic interactions. The contributions of helix–sheet and helix–helix interactions are not conserved in the families. This study suggests that the sequentially near residues forming the helix–sheet interactions are common in many folds and hence formed despite non– conservation, whereas formation of beta barrel requires long range interactions, thus more conserved within the families. The thesis also consists of an appendix in which a web–tool, developed to express proteins as networks and analyze these networks using different network parameters is discussed. The web based program–GraProStr allows us to represent proteins as structure graphs/networks by considering the amino acid residues as nodes and representing non–covalent interactions among them as edges. The different networks (classified based on edge definition) which can be obtained using GraProStr are Protein Side–chain Networks (PScNs), Cα/Cβ distance based networks (PcNs) and Protein– Ligand Networks (PLNs). The parameters which can be generated include clusters, hubs, cliques (rigid regions in proteins) and communities (group of cliques). It is also possible to differentiate the above mentioned parameters for monomers and interfaces in multimeric proteins. The well tested tool is now made available to the scientific community for the first time. GraProStr is available online and can be accessed from http://vishgraph.mbu.iisc.ernet.in/GraProStr/index.html. With a variety of structure networks, and a set of easily interpretable network parameters GraProStr can be useful is analyzing protein structures from a global paradigm (Vijayabaskar, Vidya et al. 2010). In summary, we have extensively studied DNA binding proteins using side– chain based protein structure networks and by integrating the DNA molecule into the network. Also, we have upgraded the existing methodology of generating structure networks, by representing both the geometry and the chemistry of residues as interaction energies among them. Using this energy based network we have studied diverse problems like protein structure formation, stabilization, and allosteric communication in detail. The above mentioned methodologies are a considerable advancement over existing structure network representations and have been shown in this thesis to shed more light on the structural features of proteins.
APA, Harvard, Vancouver, ISO, and other styles
2

Vijayabaskar, M. S. "Protein-DNA Graphs And Interaction Energy Based Protein Structure Networks." Thesis, 2011. http://etd.iisc.ernet.in/handle/2005/1904.

Full text
Abstract:
Proteins orchestrate a number of cellular processes either alone or in concert with other biomolecules like nucleic acids, carbohydrates, and lipids. They exhibit an intrinsic ability to fold de novo to their functional states. The three–dimensional structure of a protein, dependent on its amino acid sequence, is important for its function. Understanding this sequence– structure–function relationship has become one of the primary goals in biophysics. Various experimental techniques like X–ray crystallography, Nuclear Magnetic Resonance (NMR), and site–directed mutagenesis have been used extensively towards this goal. Computational studies include mainly sequence based, and structure based approaches. The sequence based approaches such as sequence alignments, phylogenetic analysis, domain identification, statistical coupling analysis etc., aim at deriving meaningful information from the primary sequence of the protein. The structure based approaches, on the other hand, use structures of folded proteins. Recent advances in structure determination and efforts by various structural consortia have resulted in an enormous amount of structures available for analysis. Innumerable observations such as the allowed and disallowed regions in the conformations of a peptide unit, hydrophobic core in globular proteins, existence of regular secondary structures like helices, sheets, and turns and a limited fold space have been landmarks in understanding the characteristics of protein structures. The uniqueness of protein structure is attained through non–covalent interactions among the constituent amino acids. Analyses of protein structures show that different types of non–covalent interactions like hydrophobic interactions, hydrogen bonding, salt bridges, aromatic stacking, cation–π interactions, and solvent interactions hold protein structures together. Although such structure analyses have provided a wealth of information, they have largely been performed at a pair–wise level and an investigation involving such pair–wise interactions alone is not sufficient to capture all the determinants of protein structures, since they happen at a global level. This consideration has led to the development of graphs/networks for proteins. Graphs or Networks are a collection of nodes connected by edges. Protein Structure Networks (PSNs) can be constructed using various definitions of nodes and edges. Nodes may vary from atoms to secondary structures in Synopsis proteins, and the edges can range from simple atom–atom distances to distance between secondary structures. To study the interplay of amino acids in structure formation, the most commonly used PSNs consider amino acids as nodes. The criterion for edge definition, however, varies. PSNs can be constructed at a course grain level by considering the distances between Cα/Cβ atoms, any side–chain atoms, or the centroids of the amino acids. At a finer level, PSNs can be constructed using atomic details by considering the interaction types or by computing the extent of interaction between amino acids. Representation of proteins as networks and their analyses has given us a unique perspective on various aspects such as protein structure organization, stability, folding, function, oligomerization and so on. A variety of network properties like the degree distribution, clustering coefficient, characteristic path lengths, clusters, and hubs have been investigated. Most of these studies are carried out on protein structures alone. However, the interaction of proteins with other biopolymers like nucleic acids is vital for many crucial biological processes like transcription and translation. In this thesis, we have attempted to address this problem by constructing and analyzing combined graphs of the structures of protein and DNA. Also, in almost all of the PSN studies, the connections have been made solely on the basis of geometric criteria. In the later part of the thesis, we have taken PSN a step further by defining the non–covalent connections based on chemical considerations in the form of the energies of interactions. The thesis contains two sections. The first part mainly involves the construction and application of PSNs to study DNA binding proteins. The DNA binding proteins are involved in several high fidelity processes like DNA recombination, DNA replication, and transcription. Although the protein– DNA interfaces have been extensively analyzed using pair–wise interactions, we gain additional global perspective from network approach. Furthermore, most of the earlier investigations have been carried out from the protein point of view (protein centric) and the present network approach aims to combine both the protein centric and the DNA centric view points by construction and analyses of protein–DNA graphs. These studies are described in Chapters 3 and 4. The second part of the thesis discusses the development, characterization, and application of protein structure networks based on non– covalent interaction energies. The investigations are presented in chapters 5 and 6. Chapter 3 discusses the development of Protein–DNA Graphs (PDGs) where the protein–DNA interfaces are represented as networks. PDG is a bipartite network in which amino acids form a set of nodes and the nucleotides form the other set. The extent of interaction between the two diverse types of biopolymers is normalized to define the strength of interaction. Edges are then constructed based on the interaction strength between amino acids and nucleotides. Such a representation, reported here for the first time, provides a holistic view of the interacting surface. The developed PDGs are further analyzed in terms of clusters of interacting residues and identification of highly connected residues, known as hubs, along the protein–DNA interface and discussed in terms of their interacting motifs. Important clusters have been identified in a set of protein–DNA complexes, where the amino acids interact with different chemical components of DNA such as phosphate, deoxyribose and base with varying degrees of connectivity. An analysis of such fragment based PDGs provided insights into the nature of protein–DNA interaction, which could not have been obtained by conventional pair–wise analysis. The predominance of deoxyribose–amino acid clusters in beta–sheet proteins, distinction of the interface clusters in helix–turn–helix and the zipper type proteins are some of the new findings from the analysis of PDGs. Additionally, a potential classification scheme has been proposed for protein–DNA complexes on the basis of their interface clusters. This classification scheme gives a general idea of how the proteins interact with different components of DNA in various complexes. The present graph–based method has provided a deeper insight into the analysis of the protein–DNA recognition mechanisms from both protein and DNA view points, thus throwing more light on the nature and specificity of these interactions (Sathyapriya, Vijayabaskar et al. 2008). Chapter 4 delineates the application of PSN to an important problem in molecular biology. An analysis of interface clusters from multimeric proteins provides a clue to the important residues contributing to the stability of the oligomers. One such prediction was made on the DNA binding protein under starvation from Mycobacterium smegmatis (Ms–Dps) using PSNs. Two types of trimers, Trimer A (tA) and Trimer B (tB) can be derived from the dodecamer because of the inherent three fold symmetry of the spherical crystal structure. The irreversible dodecamerization of these native Ms--Dps trimers, in vitro, is known to be directly associated with the bimodal function (DNA binding and iron storage) of this protein. Interface clusters which were Synopsis identified from the PSNs of the derived trimers, allowed us to convincingly predict the residues E146 and F47 for mutation studies. The prediction was followed up by our experimental collaborators (Rakhi PC and Dipankar Chatterji), which led to the elucidation of the molecular mechanism behind the in vitro oligomerization of Ms--Dps. The F47E mutant was impaired in dodecamerization, and the double mutant (E146AF47E) was a native monomer in solution. These two observations suggested that the two trimers are important for dodecamerization and that the residues selected are important for the structural stability of the protein in vitro. From the structural and functional characterizations of the mutants, we have proposed an oligomerization pathway of Ms–Dps (Chowdhury, Vijayabaskar et al. 2008). The second part of the thesis involves the development, characterization (Chapter 5) and application (Chapter 6) of Protein Energy Networks (PENs). As mentioned above, the PSNs constructed on the geometric basis efficiently capture the topology and associated properties at the level of atom–atom contact. The chemistry, however, is not completely captured by these network representations, and a wealth of information can be extracted by incorporating the details of chemical interactions. This study is an advancement over the existing PSNs, in terms of edges being defined on the basis of interaction energies among the amino acids. This interaction energy is the resultant of various types of interactions within a protein. Use of such realistic interaction energies in a weighted network captures all the essential features responsible for maintaining the protein structure. The methodology involved in representing proteins as interaction energy weighted networks, with realistic edge weights obtained from standard force fields is described in Chapter 5. The interaction energies were derived from equilibrium ensembles (obtained using molecular dynamics simulations) to account for the structural plasticity, which is essential for function elucidation. The suitability of this method to study single static structures was validated by obtaining interaction energies on minimized crystal structures of proteins. The PENs were then characterized using network parameters like edge weight distributions, clusters, hubs, and shortest paths. The PENs exhibited three distinct behaviors in terms of the size of the largest connected cluster as a function of interaction energy; namely, the pre–transition, transition, and post transition regions, irrespective of the topology of the proteins. The pre– transition region (energies<–20 kJ/mol) comprises smaller clusters with mainly charged and polar residues as hubs. Crucial topological changes take place in the transition region (–10 to –20 kJ/mol), where the smaller clusters aggregate, through low energy van der Waals interactions, to form a single large cluster in the post–transition region (energies>–10 kJ/mol). These behaviors reinforce the concept that hydrophobic interactions hold together local clusters of highly interacting residues, keeping the protein topology intact (Vijayabaskar and Vishveshwara 2010). The applications of PENs in studying protein organization, allosteric communication, thermophilic stability and the structural relation of remote homologues of TIM barrel families have been outlined in Chapter 6. In the first case, the weighted networks were used to identify stabilization regions in protein structures and hierarchical organization in the folded proteins, which may provide some insights into the general mechanism of protein folding and stabilization (Vijayabaskar and Vishveshwara 2010). In the second case the features of communication paths in proteins were elucidated from PENs, and specific paths have been extensively discussed in the case of PDZ domain, which is known to bring together protein partners, mediating various cellular processes. Changes in PEN upon ligand binding, resulting in alterations of the shortest paths (energetically most favorable paths) for a small fraction of residues, indicated that allosteric communication is anisotropic in PDZ. The observations also establish that the shortest paths between functionally important sites traverse through key residues in PDZ2 domain. Furthermore, shortest paths in PENs provide us the exact pathways of communication between residues. Although the communication in PDZ has been extensively investigated, detailed information of pathways at the energy level has emerged for the first time from the present study from PEN analysis (Vijayabaskar and Vishveshwara 2010). In the third case, a set of thermophilic and mesophilic proteins were compared to determine the factors responsible for their thermal stability from a network perspective using PENs. The sub– graph parameters such as cluster population, hubs and cliques were the prominent contributing factors for thermal stability. Also, the thermophilic proteins have a better–packed hydrophobic core. The property of thermophilic protein to increase stability by increasing the connectivity but retain conformational flexibility is discussed from a cliques and communities (higher order inter–connection of residues) perspective (Vijayabaskar and Vishveshwara 2010). Finally, the remote homologues from the TIM barrel fold have been analyzed using PENs to identify the interactions responsible for the maintenance of the fold despite low sequence similarity. A study of conserved Synopsis interactions in family specific PENs reveals that the formation of the central beta barrel is vital for the TIM barrel formation. The beta barrel is being formed by either conserved long range electrostatic interactions or by tandem arrangement of low energy hydrophobic interactions. The contributions of helix–sheet and helix–helix interactions are not conserved in the families. This study suggests that the sequentially near residues forming the helix–sheet interactions are common in many folds and hence formed despite non– conservation, whereas formation of beta barrel requires long range interactions, thus more conserved within the families. The thesis also consists of an appendix in which a web–tool, developed to express proteins as networks and analyze these networks using different network parameters is discussed. The web based program–GraProStr allows us to represent proteins as structure graphs/networks by considering the amino acid residues as nodes and representing non–covalent interactions among them as edges. The different networks (classified based on edge definition) which can be obtained using GraProStr are Protein Side–chain Networks (PScNs), Cα/Cβ distance based networks (PcNs) and Protein– Ligand Networks (PLNs). The parameters which can be generated include clusters, hubs, cliques (rigid regions in proteins) and communities (group of cliques). It is also possible to differentiate the above mentioned parameters for monomers and interfaces in multimeric proteins. The well tested tool is now made available to the scientific community for the first time. GraProStr is available online and can be accessed from http://vishgraph.mbu.iisc.ernet.in/GraProStr/index.html. With a variety of structure networks, and a set of easily interpretable network parameters GraProStr can be useful is analyzing protein structures from a global paradigm (Vijayabaskar, Vidya et al. 2010). In summary, we have extensively studied DNA binding proteins using side– chain based protein structure networks and by integrating the DNA molecule into the network. Also, we have upgraded the existing methodology of generating structure networks, by representing both the geometry and the chemistry of residues as interaction energies among them. Using this energy based network we have studied diverse problems like protein structure formation, stabilization, and allosteric communication in detail. The above mentioned methodologies are a considerable advancement over existing structure network representations and have been shown in this thesis to shed more light on the structural features of proteins.
APA, Harvard, Vancouver, ISO, and other styles
3

Dighe, Anasuya. "Studies on Dynamic Plasticity of Ligand Binding Sites in Proteins." Thesis, 2017. http://etd.iisc.ac.in/handle/2005/4236.

Full text
Abstract:
Molecular recognition between proteins and their associated ligands constitutes ligand-induced protein rewiring thereby enabling the formation of a stable protein-ligand complex. The studies presented in this thesis address the conformational plasticity inherent to proteins by virtue of which they adapt to diverse ligands and orchestrate complex biological processes like signal transduction, transcription and protein-protein interaction. Adopting network theory based formalisms for understanding protein-ligand associations involve deconstructing the three-dimensional structure of a protein in terms of nodes and edges. With this view, Protein Structure Networks (PSNs) of ligand-bound complexes are studied by considering their side-chain non-covalent interactions. Agonist and antagonist-bound G-Protein Coupled Receptors (GPCRs) are investigated to gain mechanistic insights into allostery and its role in signal transduction. The degree of similarity between PSNs of these complexes is quantified by means of Network Similarity Score (NSS). The physical nature of these networks is inspected by subjecting them to perturbations and major players in maintaining the stability of such networks are identified. Residue-wise groupings (at backbone and side-chain level) are obtained by applying graph spectral methods. All-atom Molecular Dynamics (MD) simulations are carried out to gain a better understanding of protein-ligand binding by analysing conformational ensembles of these complexes. In this scenario, two members from a highly versatile ligand-inducible transcription factor superfamily, i.e., Nuclear Receptors (NR) are studied, that are known to exhibit extremes of ligand binding behavior ranging from promiscuity to specificity. Diverse ligands are known to bind to proteins and the overall nature of their binding site is investigated. In particular, similarities among binding sites of diverse proteins are analysed by using PocketMatch. Percolation of these similarities to regions surrounding the binding site is reported and examples depicting this extended similarity are discussed. Overall, studies presented in this thesis provide a structural perspective into the adaptability of proteins for recognizing diverse ligands and undergoing local or global re-organizations in their framework to regulate complex biological processes.
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography