RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium: S. Katayama 1, *, Y. Tomaru 1, *, T. Kasukawa 1, K. Waki 1, 2, M. Nakanishi 1, M. Nakamura 1, H. Nishida 1, C. C. Yap 1, M. Suzuki 1, J. Kawai 1, 2, H. Suzuki 1, P. Carninci 1, 2 {dagger}, Y. Hayashizaki 1, 2{ddagger}, C. Wells 3, M. Frith 1, 3, T. Ravasi 3, K. C. Pang 3, 4, J. Hallinan 3, J. Mattick 3, D. A. Hume 3, L. Lipovich 5, S. Batalov6, P. G. Engström 7, *, Y. Mizuno 7, *, M. A. Faghihi 7, 8, A. Sandelin 7, A. M. Chalk 7, S. Mottagui-Tabar 7, 8, Z. Liang 7, B. Lenhard 7, and C. Wahlestedt 7, 8{ddagger}
* These authors contributed equally to this work.
1 Laboratory for Genome Exploration Research Group, RIKEN
Genomic Sciences Centre (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho,
Tsurumi-ku, Yokohama, 230-0045, Japan.
2 Genome Science Laboratory, Discovery and Research Institute,
RIKEN Wako Main Campus, 2-1 Hirosawa, Wako, 351-0198, Japan.
3 Australian Research Council Special Research Centre
for Functional and Applied Genomics, Institute for Molecular Bioscience,
The University of Queensland, Brisbane Qld, 4072, Australia.
4 T Cell Laboratory, Ludwig Institute for Cancer Research,
Austin and Repatriation Medical Centre, Heidelberg VIC 3084, Australia.
5 Genome Institute of Singapore, 60 Biopolis Street #02-01,
Singapore 138672.
6 Genomics Institute of the Novartis Research Foundation
(GNF), 10675 John Jay Hopkins Drive, San Diego, CA 92121, USA.
7 Center for Genomics and Bioinformatics, Karolinska
Institutet, Berzelius v. 35, S-17177 Stockholm, Sweden.
8 Scripps Florida, Jupiter, FL 33458, USA.
{dagger} To whom technical correspondence should be
addressed.
{ddagger} To whom general correspondence should be
addressed.
The Fantom 3 cDNAs are available through Y.H. ( e-mail: yosihide@gsc.riken.jp
).
Antisense transcription (transcription from the opposite strand to a protein-coding or sense strand) has been ascribed roles in gene regulation involving degradation of the corresponding sense transcripts (RNA interference), as well as gene silencing at the chromatin level. Global transcriptome analysis provides evidence that a large proportion of the genome can produce transcripts from both strands, and that antisense transcripts commonly link neighboring "genes" in complex loci into chains of linked transcriptional units. Expression profiling reveals frequent concordant regulation of sense/antisense pairs. We present experimental evidence that perturbation of an antisense RNA can alter the expression of sense messenger RNAs, suggesting that antisense transcription contributes to control of transcriptional outputs in mammals.
Although previous analysis of the mammalian transcriptome suggested
that up to 20% of transcripts may contribute to sense-antisense (S/AS)
pairs (1-3), large-scale cDNA sequencing in the FANTOM3
project (4) suggests that antisense transcription is
more widespread. To elucidate the function of S/AS pairs, we used the FANTOM3
data set to analyze their location in the mouse genome, the extent and
position of their overlap, and promoter architecture and regulation (4).
Analysis of the imprinted gnas locus in mice demonstrated
numerous sense and antisense transcripts expressed selectively depending
on parental chromosomal origin (5). However, paired S/AS
expression is not restricted to imprinted loci. For example, fig.
S1 shows the complex transcript overlap patterns of the HoxA locus
and complex transcript overlap patterns. To analyze such complex loci on
a genomewide scale, we generated a cDNA set comprising 158,807 full-length
transcripts obtained by merging the 102,801 Fantom-3 cDNA set ( http://fantom3.gsc.riken.jp/db/
) with mouse cDNAs from GenBank ( http://www.ncbi.nlm.nih.gov/Genbank/
) and clustering the cDNAs into transcriptional units (TUs), in
which members share sequence transcribed from the same strand. There were
50,111 overlapping transcript pairs, grouped into 29,780 nonredundant different
overlapping regions in 8331 TU pairs (9713 distinct representative overlapping
regions).
In the accompanying paper (4), transcription and
termination sites were identified. On the basis of this information, more
than 72% of all genome-mapped TUs (43,553) overlap with some cDNA, 5' or
3' expressed sequence tag (EST) sequence, or tag or tag-pair region
mapped to the opposite strand (Table 1). From the above
data, 4520 TU pairs contain full-length transcripts, which form S/AS pairs
on exons (Table 2). S/AS interaction might also occur
between immature RNAs (heterogeneous nuclear RNA, hnRNA) in the nucleus.
Furthermore, introns themselves can originate smaller RNA with biological
activity (6). In addition to transcript pairs that share
exons in opposite orientations, 4129 TU pairs were transcribed from different
strands of the same locus without apparently sharing overlapping exons
(Table 2). Although conservative, the combined S/AS prediction
is 1.5- to 2-fold greater than that from previous studies of mouse (1)
and human (2, 3, 7)
transcripts.
Together with microarray analysis (table S4), CAGE tag frequency data represent a de facto expression profiling approach, and allowed further validation of the coexpression of S/AS pairs (table S5). Randomly primed CAGE libraries identified more S/AS pairs than did oligo-dT primed CAGE libraries, suggesting that some polyadenylate [poly(A)] minus RNA transcripts or very long non-coding RNA transcripts are involved in S/AS (fig. S3, table S5). In keeping with an earlier report (14), S/AS CAGE tags were detected concordantly at greater than the expected frequency. These coexpressed S/AS pairs (table S6) show complex and tissue-specific regulation. Specific examples are considered in fig S4. Different types of genes are preferentially involved in S/AS regulation, with particular overrepresentation for cytoplasmic proteins and underrepresentation for membrane and extracellular proteins (tables S1 and S2) (15).
Possible regulatory interactions between S/AS pairs can be assessed
by monitoring correlation of expression with time. To assess such patterns
of regulation, we selected S/AS pairs for transcripts where the expression
was substantially increased or decreased during the activation of bone
marrow-derived macrophages by bacterial lipopolysaccharide (LPS)
(16). Out of 15 S/AS pairs tested, 7 showed various
patterns of reciprocal regulation (Fig. 1). Three S/AS
pairs showed proportional coregulation, where both members of the S/AS
pair decreased with time. Two pairs showed reciprocal regulation, where
one transcript concentration was induced while the other declined in response
to LPS. Two more regulated S/AS pairs showed no obvious connection. A transcriptional
map of these transcripts is available in fig.
S5.
Fig. 1. Time-course analysis of S/AS pairs.
Fig. 1. Time-course analysis of S/AS pairs.
Expression of S/AS RNA pairs was verified by reverse transcription polymerase chain reaction over 24 hours after activation of macrophages with LPS. R, correlation coefficient. y axis, relative expression; blue/pink symbols ratio, actual expression levels at time 0 hours. (A to G) Different S/AS transcript pairs.
Although concordant regulation is more frequent in S/AS pairs,
there are many examples in which the two transcripts are expressed reciprocally.
Examples were chosen to test the effect of disturbing the expression of
one or the other partner in the S/AS pair. Out of five S/AS pairs selected
from expression profiling (17), two produced divergent
coregulation. Figure 2A shows an example of reciprocal
regulation of two coding transcripts, Ddx39 (AK012002 [GenBank] ) and CD97
[a G protein-coupled receptor (AK004577 [GenBank] )]. Targeted small interfering
RNA (siRNA) inhibition of Ddx39 led to an increase in CD97 mRNA, but the
reciprocal effect was not observed (Fig. 2A).
Fig. 2. Expression perturbation of S/AS pairs.
Fig. 2. Expression perturbation of S/AS pairs.
siRNAs were designed against the indicated transcripts to specifically inhibit only the target transcripts without producing an off-target effect.
(A) Relative expression of the coding transcripts Ddx39 and CD97 24 hours after transfection. The Ddx39 transcript was silenced by siRNA designed to inhibit the transcript at two positions, Ddx39-1 and Ddx39-2, outside the CD97 overlap.
(B to D) Hepa1-6 mouse cells.
(B) siRNA perturbation of CEBPD (CCAAT/enhancer-binding protein related) and I530027A02 (hypothetical aminoacyl-tRNA synthetase class II).
(C) Overexpression of I530027A02 transcript induces overexpression of CEBPD.
(D) Perturbation of KIF20a (rabkinesin-6) and CDC23 (cell division cycle 23 yeast homolog) testing both cytoplasmic and nuclear RNA.
(E) HeLa cell. Ts-S, thymidylate synthase; TS-AS, thymidylate synthase antisense. Results represent the mean ±SE of three independent experiments performed in triplicate relative to GAPDH (glyceraldehyde-3-phosphate dehydrogenase) controls. Controls, no siRNA added.
CAGE data identified potentially coregulated S/AS pairs in mouse hepatocyte Hepa1-6 cells. In contrast with the above correlation, the inhibition of sense hypothetical aminoacyl-tRNA synthetase class II-containing protein (I530027A02) resulted in decreased antisense C/EBP delta expression, but the reverse interaction was not observed (Fig. 2B). The association between these two transcripts was tested further by transiently over-expressing I530027A02 (Fig. 2C), which caused induction of CEBP/delta expression. This finding argues against the simplistic assumption of a negative regulatory role of antisense transcription.
Similarly, the cytoplasmic level of CDC23 was decreased by siRNA
against the AS transcript Kif20a for 48 hours (Fig. 2D).
Here, the RNA concentration in the nucleus was diminished, suggesting moderate
reduction at a nuclear level as well. Another example is shown in the human
HeLa cell line, in which siRNA-mediated ablation of an antisense thymidylate
synthetase transcript produced a marginal, but reproducible, elevated level
of the thymidylate synthetase mRNA (Fig. 2E).
The examples above involve S/AS pairs in which both partners encode
protein, and the transcripts are processed and exported to the cytoplasm.
We also manipulated the expression of six pairs in which one partner is
noncoding, and in four of them there was a slight positive correlation
(18). In three other S/AS pairs in the two different
cellular systems tested, there was no evidence that ablation of the AS
transcript altered the level of the sense transcript. This finding is consistent
with the unaffected phenotype in ROSA-26 locus knockout mice, in which
ablation of ncRNA did not alter expression of overlapping coding transcripts
(19).
S/AS hybrids can potentially provide the templates for transcript cleavage involving the enzyme Dicer, which forms the molecular basis for so-called RNA interference (RNAi) (20). Dicer cleaves the RNA duplex to produce siRNAs, which in turn catalyze cleavage of the corresponding mRNA. siRNAs can also participate in transcriptional gene silencing in the nucleus (21-23). In addition, Dicer seems to be essential for heterochromatin formation in vertebrate cells (24). In addition to siRNA-mediated activities, noncoding antisense RNAs apparently contribute to local chromatin modification or methylation when they overlap the sense promoter (25-27).
Both RNAi and RNA-dependent heterochromatin assembly, as the basis
for function in S/AS pairs, would predict that the transcripts display
divergent regulation, but most S/AS pairs in our study were positively
correlated in their expression. Alternatively, coexpression would occur
if the transcription of the S/AS pair was controlled by the same enhancer
elements (28). If antisense transcripts do reflect the
transcriptional activity of enhancers, the act of transcription from the
antisense promoter may generate the regulatory interaction. In the imprinted
IGF2R locus, the antisense transcript, AIR, does not imprint the overlapping
Mas1 gene, and elimination of the transcriptional overlap with IGF2R
in a transgene does not prevent silencing (29). Hence,
some effects of antisense transcription may not require the formation of
an RNA duplex.
The large-scale transcriptome profiling of the mouse by the Fantom3
Consortium reveals that antisense transcription is widespread in the
mammalian genome. Although there are some examples in which the pairs
are discordantly regulated, and some experimental evidence of a direct
regulatory interaction, generally the S/AS pairs are positively correlated.
Whether concordant or discordant regulation reflects common or divergent
regulation, or cis/trans-acting regulatory interactions, will require detailed
analysis of the kind presented here for each of the pairs of transcripts
under a wide range of conditions.
Materials and Methods
Figs. S1 to S5
Tables S1 to S6
References and Notes
DDBJ Accession Codes
1. Claverie J-M, "Fewer Genes, More Noncoding RNA", Science 309: 1529-1530 (2005).
2. Mattick JS, "The Functional Genomics of Noncoding RNA". Science 309: 1527-1528 (2005).
3. The FANTOM Consortium, P. Carninci, T. Kasukawa, S. Katayama,
J. Gough, M. C. Frith, N. Maeda, R. Oyama, T. Ravasi, B. Lenhard, C. Wells,
R. Kodzius, K. Shimokawa, V. B. Bajic, S. E. Brenner, S. Batalov, A. R.
R. Forrest, M. Zavolan, M. J. Davis, L. G. Wilming, V. Aidinis, J. E. Allen,
A. Ambesi-Impiombato, R. Apweiler, R. N. Aturaliya, T. L. Bailey, M. Bansal,
L. Baxter, K. W. Beisel, T. Bersano, H. Bono, A. M. Chalk, K. P. Chiu,
V. Choudhary, A. Christoffels, D. R. Clutterbuck, M. L. Crowe, E. Dalla,
B. P. Dalrymple, B. de Bono, G. D. Gatta, D. di Bernardo, T. Down, P. Engstrom,
M. Fagiolini, G. Faulkner, C. F. Fletcher, T. Fukushima, M. Furuno, S.
Futaki, M. Gariboldi, P. Georgii-Hemming, T. R. Gingeras, T. Gojobori,
R. E. Green, S. Gustincich, M. Harbers, Y. Hayashi, T. K. Hensch, N. Hirokawa,
D. Hill, L. Huminiecki, M. Iacono, K. Ikeo, A. Iwama, T. Ishikawa, M. Jakt,
A. Kanapin, M. Katoh, Y. Kawasawa, J. Kelso, H. Kitamura, H. Kitano, G.
Kollias, S. P. T. Krishnan, A. Kruger, S. K. Kummerfeld, I. V. Kurochkin,
L. F. Lareau, D. Lazarevic, L. Lipovich, J. Liu, S. Liuni, S. McWilliam,
M. M. Babu, M. Madera, L. Marchionni, H. Matsuda, S. Matsuzawa, H. Miki,
F. Mignone, S. Miyake, K. Morris, S. Mottagui-Tabar, N. Mulder, N. Nakano,
H. Nakauchi, P. Ng, R. Nilsson, S. Nishiguchi, S. Nishikawa, F. Nori, O.
Ohara, Y. Okazaki, V. Orlando, K. C. Pang, W. J. Pavan, G. Pavesi, G. Pesole,
N. Petrovsky, S. Piazza, J. Reed, J. F. Reid, B. Z. Ring, M. Ringwald,
B. Rost, Y. Ruan, S. L. Salzberg, A. Sandelin, C. Schneider, C. Schonbach,
K. Sekiguchi, C. A. M. Semple, S. Seno, L. Sessa, Y. Sheng, Y. Shibata,
H. Shimada, K. Shimada, D. Silva, B. Sinclair, S. Sperling, E. Stupka,
K. Sugiura, R. Sultana, Y. Takenaka, K. Taki, K. Tammoja, S. L. Tan, S.
Tang, M. S. Taylor, J. Tegner, S. A. Teichmann, H. R. Ueda, E. van Nimwegen,
R. Verardo, C. L. Wei, K. Yagi, H. Yamanishi, E. Zabarovsky, S. Zhu, A.
Zimmer, W. Hide, C. Bult, S. M. Grimmond, R. D. Teasdale, E. T. Liu, V.
Brusic, J. Quackenbush, C. Wahlestedt, J. S. Mattick, D. A. Hume, RIKEN
Genome Exploration Research Group and Genome, C. Kai, D. Sasaki, Y. Tomaru,
S. Fukuda, M. Kanamori-Katayama, M. Suzuki, J. Aoki, T. Arakawa, J. Iida,
K. Imamura, M. Itoh, T. Kato, H. Kawaji, N. Kawagashira, T. Kawashima,
M. Kojima, S. Kondo, H. Konno, K. Nakano, N. Ninomiya, T. Nishio, M. Okada,
C. Plessy, K. Shibata, T. Shiraki, S. Suzuki, M. Tagami, K. Waki, A. Watahiki,
Y. Okamura-Oho, H. Suzuki, J. Kawai, and Y. Hayashizaki, "The
Transcriptional Landscape of the Mammalian Genome", Science, vol. 309.
no. 5740, pp. 1559-1563 (September 2, 2005).
http://www.sciencemag.org/cgi/content/abstract/309/5740/1559.
1. Chen J, Sun M, Rowley JD, and Hurst LD,
"The small introns
of antisense genes are better explained by selection for rapid transcription
than by ‘genomic design’".
2. Cho DH, Thienes CP, Mahoney SE, Analau E, Filippova GN, and Tapscott
SJ,
"Antisense Transcription
and Heterochromatin at the
DM1 CTG Repeats Are Constrained by CTCF".
3. Hovsepian JA, and Frenster JH, "Sense and Antisense during RNA Initiation of the DNA Transcription Bubble".
Links to RNA and Biological Causality:
Links to
Euchromatin Activator RNA Reviews:
Links to
Euchromatin Activator RNA Research:
Links to Ultrastructural
Probes of DNase I-Sensitive Sites:
Links to
RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma
Immuno-Pathology:
Links to Activated
T-Lymphocyte Immunotherapy:
Links to Medical
Systems Biology:
Links to Selective
Gene Transcription:
Links to RNA-Induced
Epigenetics:
Links to RNA-Induced
Embryogenesis:
Links to RNA and
Biological Causality:
Links to Reprogramming
and Neoplasia:
A Brief History of Activator RNA:
"Ultrastructural
Probes of Active DNA Sites, and the RNA Activators of DNA". (PowerPoint
Presentation).