Predicted ORF products mean size in completely sequenced organisms

Predicted ORF products mean size in compltely sequenced organisms

Organis	size(Mb)    Mean size	std	ORFs	min	Max	Tot. aa
SC	   1.3		458.8	362.3	 6213	25	 4910	 2850290
CE	  97.0		423.3	371.6	19099	 4	 7829	 8096713
DM	 170.0		497.7	451.2	13695	 5	 7182	 6816125
ATH	 100.0		439.4	318.4	22671	 8	 5079	 9960638
CA                      479.6	333.9	 6169    21	 4162	 2958521
HS*	3000.0		481.4	426.3	21724	16	 6669	10484673
SP        15.0		456.9	353.8	 3579	13	 4717	 1635306
PF+	 100.0		768.9	760	  421	54	 4981	  322400

MJ	1.66		287.0	204.9	 1735	22	 2894	  497904
MTH	1.75		281.4	194.7	 1871	25	 1797	  526507
AF	2.18		275.4	182.9	 2408	25	 2425	  663110
PH	1.74		275.9	199.7	 2061	50	 4436	  568546
PA	1.76		303.6	187.4	 1765	18	 2122	  535784
APE	1.66		237.1	170.2	 2694	50	 1933	  638684
APEM	1.66		279.0	187.9	 1865	50	 1933	  520254
TA	1.56		306.6	195.6	 1478	45	 2081	  453104
TV	1.58		297.1	199.5	 1526	50	 2076	  453348
SSP2	2.99		282.3	171.3	 2977	40	 1426	  840471
H	2.57		285.2	187.6	 2058	30	 1370	  586961
PFU	1.9		266.5	183.8	 2208	10	 1740	  588353
STO	2.69            268.4	178.4	 2826	50	 1442	  758502
PYAE	2.2             251.8	191.0	 2605	18	 2785	  655840

HI	1.83		300.8	200.0	 1680	12	 1694	  505242
MG	0.58		364.1	255.8	  468	37	 1805	  170400
MP	0.81		351.4	249.8	  677	37	 1882	  237903
Ssp	3.57		326.2	255.5	 3168	29	 4199	 1033450
EC	4.60		316.8	207.5	 4290	14	 2383	 1359208
HP	1.66		317.4	239.2	 1577	12	 2893	  500601
BS	4.2		296.8	258.6	 4100	20	 4930	 1217000
BB	0.91		333.3	225.3	  850	30	 2166	  283331
BH	4.2		292.2	190.6	 4066	11	 1816	 1188110
AE	1.66		317.0	187.7	 1522	47	 1574	  482511
MT	4.41		339.4	263.7	 3924	27	 4151	 1331736
MTC	4.4             317.0	256.0	 4203	30	 4151	 1332550
ML	3.26		334.6	255.8	 1581	37	 3076	  538684
TP	1.14		340.1	223.1	 1533	30	 1533	  350676
CT	1.04		355.2	243.5	  877	45	 1786	  311506
CP	1.23		343.8	239.2	 1052	40	 1826	  361694
RP	1.11		333.8	231.4	  837	41	 2340	  279396
CJ	1.64		299.9	199.1	 1731	30	 1517	  519212
TM	1.86		315.3	196.8	 1849	30	 1690	  582898
DR	3.28		309.2	197.3	 3117	37	 1940	  963879
NM	2.18		288.1	232.2	 2081	12	 2703	  599559
XF	2.68		267.2	251.5	 2830	30	 3455	  756218
VC	4.0		303.6	232.0	 3837	26	 4558	 1164911
B	0.64		328.3	208.7	  575	38	 1407	  188781
PAE	6.3		334.3	249.6	 5570	23	 5627	 1861971
LMO	2.94            306.0	210.1	 2846	28	 2044	  870878
LIN	3.01		299.8	213.9	 2968	37	 2167	  889684
STY	4.8		302.3	212.3	 4398	13	 3624	 1328726
YP      4.65            319.1	242.6	 3895	14	 3705	 1242950
SAMU50	2.8		295.4	250.8	 2714	16	 6713	  801649
SAN315	2.81		301.3	254.1	 2594	16	 6713	  781585
SPY     1.85            304.0	206.6	 1696	27	 2045	  515585
MM      7.59            299.3	212.4	 7272	 6	 3930	 2178104
SM	6.7		308.9	198.1	 6204	 31	 2832	 1917270
AGRT	5.3		319.1	206.3	 5299	 30	 2802	 1691132

This table shows the species size (in Mb), its ORF mean size, the corresponding standard deviation, its total number of ORFs, the smallest ORF size, the largest ORF size and total number of amino acids in the species.

Organisms have been classified according to the three major domains of life and using the following abbreviations:

S. cerevisiae (SC), C. elegans (CE), S. pombe (SP), Drosophila Melanogaster (DM), C. albicans (CACD), A. thaliana (ATH), Human predicted proteome (ncbi 15/02/2001)(HS).

M. jannaschii (MJ), M. thermoautotrophicum (MTH), A. fulgidus (AF), P. horikoshii (PH), P. abyssi (PA), Aeropyrum pernix K1 (APE), Thermoplasma acidophilum (TA), Thermoplasma volcanium (TV), Halobacterium sp. NRC-1 (H), Sulfolobus solfataricus P2 (SSP2), P. furiousis (PFU), Sulfolobus tokodaii (STO), Pyrobaculum aerophilum (PYAE).

H. influenzae (HI), M. genitalium (MG), M. pneumoniae (MP), Synechocystis sp. (Ssp.), E. coli (EC), H. pylori (HP), B. subtilis (BS), Bacillus halodurans (BH), B. burgdorferi (BB), M. tuberculosis (MT), A. aeolicus (AE), T. pallidum (TP), C. trachomatis (CT), Chlamydia pneumoniae (CP), R. prowazekii (RP), C. jejuni (CJ), T. maritima (TM), Deinococcus radiodurans (DR), Neisseria meningitidis (NM), Xylella fastidiosa (XF), Vibrio cholerae (VC), Pseudomonas aeruginosa (PAE), Buchnera sp. (B), Mycobacterium leprae (ML), M. tuberculosis CDC 1551 (MTC), Yersinia pestis (YP), Salmonella Typhi (STY), Staphylococcus aureus N315 (SAN315), Staphylococcus aureus Mu50 (SAMU50), Listeria monocytogenes EGD-e (LMO), Listeria innocua (LIN), Streptococcus pyogenes M1 (SPY), Agrobacterium tumefaciens (AGRT), .Mesorhizobium loti (MM), Sinorhizobium meliloti (SM).

*:21724 protein sequences downloaded from the ncbi ftp server.

+: chromosomes 2 and 3 only

Go to top menu


Fredj Tekaia