Editorial Volume 1 Issue 1
Center for Computational Science, Miller School of Medicine, University of Miami, USA
Correspondence: Nicholas Tsinoremas, Center for Computational Science, University of Miami, Miami, FL 33146, Miller School of Medicine, University of Miami, Miami, FL 33136, USA
Received: April 28, 2014 | Published: May 2, 2014
Citation: Capobianco E, Tsinoremas N. Health or disease–why does “dark matter” matter more? J Investig Genomics. 2014;1(1):3-4. DOI: 10.15406/jig.2014.01.00002
The expression of protein-coding genes (mRNA) has been the benchmark of pathophysiological studies for decades. However, in recent years, this dogma has been challenged by novel data, namely non-coding RNAs (part of dark matter) and their interactions with binding proteins or other RNAs determine the cellular functions under both healthy and disease states and very intricate regulatory networks underlie complex diseases that require integrated analysis at systems scale. Understanding dark matter provides the opportunity to formulate new paradigms that govern biological systems and devise novel therapies and diagnostic tools. The findings of the ENCODE (Encyclopedia of DNA Elements) project1 are clear: evidence of wide spread and heterogeneous functionalization in both protein coding and non-protein coding transcribed regions of the human genome means that the attribute ‘junk DNA’ for the latter regions is no longer appropriate.2 As a result, the analysis of RNA-Seqtranscriptome profiles obtained under various healthy and pathological conditions should now be directed to ‘dark matter’ in order to decipher the complex dysregulation patterns which might characterize disease signatures.
In cancer, for instance, the well-known hallmarks that have governed the study of tumorgenesis will have to be redefined, taking into account aspects of dark matter such as the regulatory functions of long non-coding RNAs, as documented in.3‒5 Cancer diagnosis and therapy are in the highest need of novel input from next generation sequencing, which is acquiring a remarkably more clinical focus. More specifically, better diagnosis could come from a richer characterization of tumorigenesis phases by incorporating new oncogenic elements,6 leading to more accurate differentiation between and within cancers. As for therapy, it is expected that new antitumor drugs designed with increased RNA specificity (i.e. alternative transcript expression) will result in more targeted therapeutic outcomes. Therefore, studying the effects of interactions between non-coding RNAs and other entities such as proteins, metabolites, and other high-throughput genomic products, including RNAs, becomes a crucial research direction. Such data analysis forces the consideration of RNA activity at systems scale and the design and implementation of forms of control of different cancer states. For instance, non-coding RNAs are currently analyzed as decoys exerting control over the mRNA, thus modifying their functions.7 As such, they influence microRNA regulation in a network of RNA crosstalk focused on competing for mRNA binding, according the known ‘sponging’8 or also ‘competing endogenous RNA’ (ceRNA) law.9
How such mechanisms may work for a wider spectrum of RNA modulation is currently not known. The most recent examples of complex regulation networks are specific to microRNA functional synergistic,10 microRNA-program mediated,11 and transcriptionally cross-regulated ceRNA12 networks, with corresponding results holding at various scales. More refined RNA-binding protein interactions (RBPome) are expected from a deeper use of CLIP (cross-linking immunoprecipitation),13,14 and thus extra layers of complexity will be added to the current networks. The systems complexity of commonly-known regulation mechanisms underlying gene expression, involving alternative splicing, epigenetics, structural variation, among other factors, is clearly destined to change and novel inference tools will be needed. Both the systems role of non-coding RNAs and their specialization into many possible RNA-RNA associations and integrations (with clinical data) will be central to future studies. The greatest hope is that a better understanding of the web of complexRNA regulatory circuits15 and the achievement of an extended polymorphism map14 will offer better opportunities to identify new roles of RNA in disease, say markers or therapeutic targets.16,17
A major challenge related to the development of accurate analyses within a systems disease context is the inferring of regulation mechanisms, i.e. RNA-RNA interactions. This is an area of enormous clinical impact in that it depends on the study and understanding of the principles and dynamics of drug actions, in order to understand their characteristic efficacy, resistance, and toxicity. Network approaches provide great inference tools to address this challenge, by optimizing the potential of complex combinatorial treatments through identifying synergistic and antagonistic drug effects. Classical model-based approaches would instead require the specification of high-dimensional and multi-parameter settings, whose robustness and sensitivity properties require massive control to predict systems effects of perturbations (i.e. treatments) at various complexity levels (single versus multiple targeted drugs).
In conclusion, integrative inference methodologies and tools are able to address the dark matter in depth (range of expression levels) and breadth (types of variation). These tools require the development of new computational paradigms that are mostly hybrid approaches of known methods. They should be designed to address the nature of associations between bio-types, correlative or causal, the power of multiple testing and validation procedures, the application of ensemble optimization techniques to handle unknown complexity, and the integration of various types of data in the same model framework.
None.
Author declares that there is no conflict of interest.
©2014 Capobianco, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.