Mini Review Volume 5 Issue 3
Mind the Byte, Spain
Correspondence: Melchor Sanchez-Martinez, Mind the Byte, Parc Cientific de Barcelona, c/Baldiri Reixac, 4-8, 08028, Barcelona, Spain, Tel +34 934020938
Received: February 20, 2016 | Published: March 10, 2017
Citation: Nonell-Canals A, Sanchez-Martinez M. Intrinsically disordered proteins as drug targets. MOJ Proteomics Bioinform. 2017;5(2):69-73. DOI: 10.15406/mojpb.2017.05.00157
Intrinsically disordered proteins (IDPs) are characterized by a lack of folded structure. Since their identification more than a decade ago, they were designed as potential drug targets. However, nowadays, only few therapeutic molecules have been designed against them. Due to the nature of these proteins bioinformatics methods could have a key role disentangling IDPs related issues, which is key to design new therapeutic agents against them.
Keywords: intrinsically disordered proteins (Idps), small molecules, drug design, druggability, protein dynamics, protein flexibility, protein cavities, drug binding
The challenge of targeting intrinsically disordered proteins
Drug design is a long complex process. Translating basic research to the market (passing through drug discovery, preclinical and clinical studies) takes tens of years and costs billions of dollars. Computational techniques help to shorten the process economizing it. During the last three decades Computer-aided drug discovery/design methods (CAAD) have played a major role in the development of therapeutically important molecules.1,2 The importance of CADD techniques have been growing due to the drug discovery landscape has changed enormously getting complex.
Recent scientific and technological advances have allowed changing the traditional drug discovery paradigm, as the researchers has now access to myriads of high throughput biological data.3 CADD approaches complement experimental screens guiding early-stage drug discovery efforts towards the so-called rational drug design.4
In rational drug design, all known theoretical and experimental knowledge of the system under study is used. Of great importance is the incorporation of the dynamical nature of the proteins. The structure-function paradigm must be substituted by the structure-(dynamics)-function, as proteins are flexible entities, and thus move. Its function is closely related to flexibility and to carry out any function is necessary a conformational change. For instance, the determined conformations of a protein often differ in its ligand-bound and -unbound forms. Protein conformational changes can open new ligand binding sites, whose exploration is key to fully assess the efficacy of a drug as well as to identify non-specific targeting with possible undesired effects. It is not only needed to block a certain conformation of a protein, it is necessary to modulate its dynamics.5–8 Targeting protein dynamics is a must.
A complex example where dynamics is crucial is constituted by Intrinsically Disordered Proteins (IDPs).9,10 For these proteins, their dynamic nature plays an essential role due to fundamentally exist as conformational ensembles. A similar example could be allosteric proteins for which protein flexibility have a great role been exploited to drug design.11,12
Intrinsically disordered proteins
IDPs are an emerging family of proteins whose most characteristic feature is that they don't present a folded structure, and thus challenge the sequence-structure-function paradigm.13–15 This lack of stable structure, over the entire protein length or in some regions (also named Intrinsically Disordered Regions or IDRs), provides them a structural plasticity which is not achievable by ordered proteins that is essential to carry out their cellular function.16,17 The paradigm should change to sequence-disorder-(dynamics)-function.17
Structural disorder is abundant in all species, although its level is higher in eukaryotes than in prokaryotes. IDPs (meaning long disordered regions of at least 30 residues) represent around the 30% of eukaryotic proteomes, and play key roles in a wide range of cellular processes, being involved in numerous human pathologies.15,16 Despite their importance, the IDPs study started at the end of the 20th century, and has been only in the last decade and a half when their existence has been widely accepted, becoming nowadays a hot scientific topic at both experimental and computational level.
Some regions of IDPs can adopt secondary structures, at least for a transient time, which is closely related to its function and to their interactions with binding partners. These structured elements can exist in the free protein, emerging upon conformational transitions or the combinations of both.18–20 The order-disordered transitions suffered by IDPs can be produced by several factors such as exogenous perturbations (for instance pH,21–23 Tª22,24–26 or macromolecular crowding)23,27–30 or binding (multiple) diverse molecular entities, such as proteins18,31,32 or small molecules.10,33–35 As they are very promiscuous molecules, IDPs can bind multiple partners.14
The conformational flexibility showed by IDPs is key to understand their behavior. There are growing evidences suggesting that when it is implicated in disease, is pathogenic causing so-called “conformational diseases” by means of misfolding, missignaling or even oligomerization processes.36–39 For instance, the conformational flexibility of the N-terminal huntingtin is responsible of the formation of transient helical structures to Oligomerization.40,41 Targeting the conformational variability, i.e., targeting the dynamics of IDPs is a potential therapeutic strategy and in that sense, IDPs and their binding partners9,10,42 have been designed as targetable proteins.
The widespread implication of IDPs in disease, make them perfect candidates to drug design, but unluckily, it is not happening and there are only few drugs targeting IDPs. This fact is a great contradiction as there are bioinformatics studies showing that Post Translational Modifications (PTMs) prefer disordered regions or that 79% of cancer associated proteins and 57% of the identified cardiovascular disease-associated proteins contain disordered regions of more than 30 residues.43,44 Compared with drug design pipe-lines for ordered proteins, for IDPs these processes remain in their infancy.44,45 Actually, the drugs related with IDPs consist on few molecules and some small peptides.10,42,44 This happens, among other reasons, because how IDPs perform their diverse function is not well understood. Thus, understand the functional and conformational properties of IDPs, is of great interest for a wide range of biological processes. Studying the structural determinants of this class of proteins could lead to understand their role in both healthy and altered-disease-associated cellular environments.
Unfortunately, traditional computational and experimental approaches have been hampered by diverse challenges. The most important is that IDPs present disordered regions that make impossible the use of crystallographic and cryo-electron microscopy methods as well as computational static methods such as rigid docking (over a single proteic conformation) because these techniques are not able to capture the conformational diversity of IDPs. IDPs exist in an ensemble of states, meaning that an ensemble of inter-converting conformers is required to describe the conformational behavior of IDPs.
The dynamic behavior of IDPs makes necessary the usage of “dynamical techniques” to their study. In this regard, important advances have been made towards IDPs understanding using spectroscopic techniques, such as Nuclear Magnetic Resonance (NMR).16,18,46 Small Angle X-ray Scattering47,48 or single-molecule fluorescence28,49–51 as well as employing atomistic and coarse-grained simulations.20,52–55
By using the above-mentioned techniques some approximations have been reported to design drugs targeting IDPs. The common approaches are (I) directly bind to the disordered ensemble neutralizing the protein function/dysfunction and (II) bind to a binding partner and inhibit IDP binding or stabilize its bound state.10 They are aimed to targeting conformational transitions and regulatory elements besides of modulating Post Translational Modifications (PTMs).10 These strategies currently involved the development of small molecules and peptide inhibitors, but we cannot forget biologics. The design of monoclonal antibodies to capture IDPs plasticity could be also a great therapeutic opportunity to be further explored in the future. They are attractive therapeutic tools due to their high specificity and that they can be used to target protein-protein interactions.56 Besides, it has been shown that they are able to bind a peptide in different conformations.57 These findings suggest that could be possible to use them over multi-conformational entities such as IDPs.
Conformational landscape
Conformational ensembles, also known as structural ensembles, are the accessible set of structures at a certain temperature describing the proteins structure. They are powerful tools to represent the range of conformations that can be sampled by proteins, thus allowing for an explicit representation of the dynamics of the protein. They are indicators of the structural heterogeneity of proteins. They can be generated purely experimentally, or theoretically for instance using NMR or Molecular Dynamics (MD) related techniques or by fitting (theoretical) ensembles to experimental data.58–61
Conformational ensembles have been employed to study different aspects related to fundamental properties of proteins, such as molecular recognition or protein folding and to study IDPs their usage is a widely-accepted option. IDPs are a perfect example of an scenario in that simulations and experiments have to converge to generate representative conformational ensembles, and a lot of efforts are put on it.62 The flexibility of proteins has been widely studied both experimentally and computationally. There exist a wide range of experimental techniques suitable to explore different time scales and resolutions. In IDPs the number of useful techniques is reduced because of their special characteristics. In that sense, computational methods are useful even more than in ordered proteins.
Experimental procedures provide a lot of useful data and computational approaches need it to carry out simulations as more realistic as possible. However, for IDPs, where dynamics is so important, computation clearly has advantages over experiments as it can describe dynamics completely and it can characterize the full conformational energy landscape of proteins (although conformational substrates and the rates of inter conversion can be located experimentally, the transition pathway at atomic resolution is out of reach.63 The computational simulations can be used as a ‘virtual microscope’ to study processes or molecules in a cell that are not directly accessible in experiments.64 Sometimes the synergies between experiments and computation are not so clear as for IDPs, and are ignored. However, in this cases it is strongly evidenced and in that sense the usage of CAAD techniques for IDPs related drug discovery have an enormous importance.
A useful method to generate conformational ensembles is classical MD. MD is a powerful technique to study protein motions generating dynamically relevant conformational ensembles,65 however there are scenarios in that it at cannot be routinely employed, as when the conformational space is too vast (as in IDPs) or the dynamics too slow. In these cases, enhanced sampling methods as Metadynamics66 or Replica Exchange Molecular Dynamics (REMD)67 as well as Coarse-Grained techniques could be used to overpass this limitation. There are also other option such as Brownian dynamics (BD)68 and Monte Carlo (MC) based methods for instance Replica exchange Monte Carlo (REMC)69,70 or Markov Chain Monte Carlo (MCMC),55 that have been employed to study IDPs with good results.
Binding cavity detection and understanding
Therapeutic ligands usually accomplish their mission by binding to small cavities (binding sites or pockets) of target proteins. IDPs are not an exception. The conformational landscape exploration upon binding is key to design drugs against IDPs. Identify the binding cavities over the generated ensembles is a must. In other words, the cavities must be detected for each ensemble and for each of the found pockets analyzing their structural and functional characteristics. To that extent, usual CAAD techniques can be useful to estimate the druggability and the effects of small molecules on IDP interactions.10,34,71,72 These methods analyze the cavities over static structures coming from experiments or simulations such as DrugPred,73 Cavity74 or fpocket,75 or coupled to MD simulations and calculate the druggability “on-the -fly”, such as the recently developed JEDI.76
Zhang and coworkers44 have studied the ligand-binding cavities of diverse IDPs comparing some of their properties with those of ordered proteins. They concluded that IDPs are predicted to possess more binding cavities than ordered proteins of a similar length, showing different geometries. This agrees with previous studies which stated that IDPs possess greater surface and interface areas per residue than ordered proteins. Most importantly, they showed that the druggability of IDPs cavities may be comparable with those of ordered proteins. Besides they analyzed the conservation of the cavities along the different ensembles showing reasonable good results (higher than 50%).
The above-mentioned information is interesting but there are some issues that still should be addressed. For instance: IDPs binding sites can be targeted when they are completely disordered or only partially structured regions can host a ligand bound? There is a minimum percentage of ordered residues within the cavity necessary to drug binding? It is necessary that the binding pocket be embedded inside an ordered region or at least be flanked by ordered regions? , i.e., There are differences over IDPs and IDRs cavities? The structural characteristics of the binding pockets (size, volume, SASA, etc) rely on the disorder degree?
Fully understand IDPs binding processes will constitute the definitive step to rational design new drugs against them. In that sense the conformational landscape and the binding cavities exploration, upon ligand binding and in the unbound state, is the first step to reach that understanding.
IDPs seem to be perfect drug targets; however there is still a lack of knowledge as well as methodological gaps/flaws that difficult their usage. Nowadays there is quite clear how order to disorder transition and thus the adoption of secondary structure take place as well as its importance in disease. Moreover there are several techniques, or better said combinations of them, that constitute valid pipelines to their study and understanding.
However, direct targeting the conformational flexibility of IDPs is still in the very beginning. With a better understanding of the atomic details of the conformations of IDPs as well as the biophysical forces governing them, small molecules or even antibodies might be developed to stabilize distinct IDPs conformations or just to modulate IDPs interactions and thus their misfolding events.
There are still open structural and functional questions that should be addressed. More efforts should be done to address them, but we are in the correct way and attending to the great impact that IDPs have in human health we are assisting to crucial moments for our society. Be able to cope with disorder imply be able to fight against diseases ranging from cancer or neurodegenerative pathologies to orphan diseases, and computation has a lot to do on it. We reckon the actual and coming findings will bring IDPs the full recognition they deserve as drug targets, and hopefully several drugs will be designed in the next years targeting disorder and plasticity.
MSM thanks the Ministerio de Economia y Competitividad for a Torres Quevedo grant (PTQ-14-06579).
The author declares no conflict of interest.
©2017 Nonell-Canals, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.