Genomics of lncRNA

Genomic characterisation of lncRNAs in health and disease

Another strategy for understanding the functions of 1000s of new lncRNA genes is to study their properties as a group. We use bioinformatic analysis of large genomic datasets to try to characterise lncRNAs, divide them into classes, and predict their functions. Right now, we are pursuing several main approaches.

First, we are attempting to predict lncRNAs that are involved in the progression of cancer. This work is carried out within the PanCancer Analysis of Whole Genomes (PCAWG) effort by the International Cancer Genome Consortium. The software tool we developed, ExInAtor, uses maps of mutations from thousands of tumours to identify cancer driver lncRNAs. The candidates predicted in this way can be tested experimentally.


Outline of the ExInAtor method for discovering cancer driver genes with elevated exonic mutation density.

Another approach we have is driven by the idea that an lncRNA’s function is strongly reflected in its localisation within the cell. For example, chromatin regulatory lncRNAs must presumably be located in the nucleus,while those regulating mRNA translation should be located in the cytosol. We are creating genome-wide maps of lncRNA sub-cellular localisation to predict their functions and attempt to identify the factors that control this process.


Mapping lncRNAs at subcellular resolution. (A) Organ locations of cell lines used; (B) Available cellular compartments; (C) Relative Concentration Index (RCI), a measure for estimating subcellular localisation of RNA transcripts.