Data Annotation Release Notes
- These "Annotation data..." collects those annotation data for Sense/Antisense (SA) pairs accompanying with our 2007 version data: NATsDB:Natural Antisense Transcript Database. Latest release is based on UniGene and GoldenPath of Jun,2006. See this paper.
- Name of organisms (abbreviation/common name):
hs (human) mm (mouse) dm (fly) cel (worm) cin (sea squirt)
gga (chicken) rn (rat) str (frog) dr (zebrafish) bt (cow) cfa (dog)
- All columns in all these files are listed as groups, with their definition following:
plus_acc,minus_acc,cluster_id,plus_type,minus_type,overlapping_pattern,plus_cds_region,minus_cds_region;
-
'cluster_id' is unique to define one sense/antisene or NOB pair.
-
The prefix, 'plus_' and 'minus_', indicate transcript encoded by plus strand and minus
strand, respectively. 'acc' is the GenBank or RefSeq accession number of the transcript.
-
'type' indicates the corresponding UniGene division of this transcript, mRNA or EST.
In case of mouse, relatively more representative sequences might be high-throughput
cDNAs (HTC).
-
'overlapping_pattern' indicate the six patterns including 33, 55, contain, complete, intron, other.
-
As for 'plus_cds_region' and 'minus_cds_region', these are from the GenBank annotation of coding regions for two partners.
-
Download the Annotation Data.
Center for Bioinformatics, Peking University