Data Annotation Release Notes
Center for Bioinformatics, Peking University
- These "Annotation data..." collects those annotation data for Sense/Antisense (SA) pairs accompanying with our 2007 version data: NATsDB:Natural Antisense Transcript Database. Latest release is based on UniGene and GoldenPath of Jun,2006. See this paper.
- Name of organisms (abbreviation/common name):
hs (human) mm (mouse) dm (fly) cel (worm) cin (sea squirt)
gga (chicken) rn (rat) str (frog) dr (zebrafish) bt (cow) cfa (dog)
- All columns in all these files are listed as groups, with their definition following:
'cluster_id' is unique to define one sense/antisene or NOB pair.
The prefix, 'plus_' and 'minus_', indicate transcript encoded by plus strand and minus
strand, respectively. 'acc' is the GenBank or RefSeq accession number of the transcript.
'type' indicates the corresponding UniGene division of this transcript, mRNA or EST.
In case of mouse, relatively more representative sequences might be high-throughput
'overlapping_pattern' indicate the six patterns including 33, 55, contain, complete, intron, other.
As for 'plus_cds_region' and 'minus_cds_region', these are from the GenBank annotation of coding regions for two partners.
Download the Annotation Data.