Duplication Conservation in completely sequenced organisms

Proportion of ancestral duplication and conservation as calculated from the partitions of the surveyed genomes

					Query proteome

org	ORFs-	PARTs-	SC	CE	SP*	MJ	MTH	AF	PH	HI	MG	MP	Ssp	EC	HP	BS	BB	AE	MT	TP	CT	RP	CJ	
SC	6182	4437	100.	12.87	57.47	18.72	19.15	15.81	14.12	18.80	24.93	22.16	14.33	13.82	14.68	12.43	13.79	21.08	12.02	14.31	19.08	22.05	14.81	
CE	19099	7558	33.49	100.	42.66	18.05	18.98	17.70	17.33	20.33	27.42	24.52	15.58	14.75	16.47	13.21	18.67	21.60	13.13	16.19	21.30	25.26	15.11	
SP*	3579	2248	46.20	17.50	100.	21.86	22.65	19.95	19.59	23.16	31.02	28.30	18.93	16.98	21.12	16.09	25.14	27.39	14.59	19.48	24.37	28.02	19.50	

MJ	1735	1180	8.83	3.83	11.96	100.	58.76	47.22	39.18	18.80	27.14	23.11	18.43	13.70	18.18	15.07	19.25	31.28	12.45	14.90	16.71	20.21	18.89	
MTH	1871	1227	10.41	4.53	13.21	62.71	100.	47.43	37.29	21.71	26.03	23.11	20.17	16.01	19.90	18.18	20.40	32.92	16.57	18.77	19.77	22.66	21.08	
AF	2408	1423	15.19	5.08	13.74	58.72	56.47	100.	42.31	23.52	23.54	21.46	20.02	18.35	21.37	20.05	20.68	36.04	17.17	17.48	21.30	25.42	21.54	
PH	2061	1373	14.58	7.23	17.03	52.03	47.84	47.43	100.	26.79	27.97	28.77	20.42	19.65	25.61	25.18	27.44	37.59	18.03	21.71	23.39	27.56	22.90	

HI	1713	1377	9.55	4.07	12.23	19.91	19.55	17.21	14.12	100.	54.57	48.58	28.67	39.50	40.04	26.62	42.24	42.78	24.64	38.61	48.88	53.29	41.42	
MG	468	361	5.25	2.19	5.42	14.91	12.38	10.33	6.62	16.12	100.	86.55	12.03	8.61	17.21	13.33	29.31	16.85	9.27	21.00	23.67	25.72	14.81	
MP	677	424	5.09	2.03	5.64	11.52	12.55	9.83	6.91	16.63	97.78	100.	12.88	9.58	17.78	13.44	29.02	18.06	9.44	21.71	23.81	26.18	13.98	
Ssp	3168	2002	11.38	4.86	14.50	29.57	31.21	27.33	17.55	44.51	56.23	51.65	100.	32.55	39.47	30.00	41.09	51.33	32.50	38.96	48.60	52.67	38.54	
EC	4292	2473	12.46	5.64	16.45	29.40	33.00	29.44	21.99	78.28	55.67	52.35	38.71	100.	47.71	36.61	47.12	55.05	35.07	44.71	53.89	62.78	50.71	
HP	1577	1226	8.63	3.12	10.14	23.64	22.08	16.65	13.47	39.07	49.30	44.81	24.22	23.29	100.	23.20	38.93	44.16	18.93	34.97	41.64	50.38	56.61	
BS	4100	2573	11.40	4.76	14.72	27.37	29.42	26.63	20.32	49.89	65.92	60.37	34.86	35.70	42.74	100.	49.42	53.93	32.41	45.65	52.36	52.52	43.15	
BB	850	696	5.58	2.32	6.76	14.06	12.14	10.40	8.08	23.31	48.19	41.98	13.93	12.69	23.89	16.05	100.	26.27	10.90	46.36	32.86	35.22	21.76	
AE	1522	1157	11.78	5.39	14.36	36.69	34.88	35.34	23.67	41.17	52.07	45.51	31.26	27.53	46.16	29.80	46.83	100.	24.81	39.90	47.49	54.36	44.06	
MT	3924	2329	9.28	4.41	13.38	21.18	25.67	20.94	15.65	40.30	51.24	45.04	33.36	28.83	32.38	27.82	33.90	43.47	100.	33.92	40.25	46.70	32.65	
TP	1031	852	7.43	3.05	8.14	12.54	13.77	11.31	10.56	26.79	47.36	42.21	17.83	17.38	27.32	19.27	56.89	28.08	16.31	100.	37.04	36.60	24.56	
CT	877	718	14.04	5.98	13.43	21.94	22.16	19.88	18.86	37.18	52.35	46.22	25.22	23.73	34.66	28.87	43.24	36.30	20.22	39.08	100.	49.15	30.08	
RP	837	653	10.75	5.22	11.78	18.89	18.41	15.67	14.34	33.18	47.64	44.81	21.57	20.09	33.52	22.81	41.81	34.31	17.00	32.27	44.28	100.	31.21	
CJ	1731	1323	9.44	3.85	11.87	22.20	20.94	19.18	15.58	45.17	51.52	47.87	27.52	27.57	66.23	25.80	45.25	50.30	20.65	37.67	45.82	52.06	100.	

This table shows the proportions (in %) of the partitions of a given proteome showing highly significant matches when predicted ORF products of this organism were used as query sequences against predicted ORF products of the same or another organism.

Definition: A set of predicted ORF products is called a Partition if, and only if, the following properties are verified:

a) each member of the set has at least one highly significant match with one other member of the set;

b) no member of the set has highly significant matches with members not included in the set;

c) the set is minimal (i.e cannot be divided into subsets verifying a and b).

Note that each ORF product that has no significant match in its own organism verify these properties and therefore is considered as a partition with a single member.

In this table, Tij is the proportion of distinct partitions from organism j having a common ancestry with one or several ORF product(s) of organism i (ie.Tij equals the number of distinct partitions of j including members having significant matches in i divided by the total number of distinct partitions of j, and is expressed in percent). Tij is called the "weight" of common ancestry between j and i.

Proportions of ancestral duplication in each genome equals 100 (i.e. each gene is its unique match).

Organisms have been classified according to the three major domains of life and the following abbreviations have been used:

S. cerevisiae (SC), C. elegans (CE), S. pombe (SP), M. jannaschii (MJ), M. thermoautotrophicum (MTH), A. fulgidus (AF), P. horikoshii (PH), H. influenzae (HI), M. genitalium (MG), M. pneumoniae (MP), Synechocystis sp. (Ssp.), E. coli (EC), H. pylori (HP), B. subtilis (BS), B. burgdorfi (BB), M. tuberculosis (MT), A. aeolicus (AE), T. pallidum (TP), C. trachomatis (CT), R. prowazekii (RP), and C. jejuni (CJ).

-: Number of distinct ORFs and distinct partitions in the proteome (not used for correspondence analysis).

*: only 68% of the total proteome of S. pombe was considered in this study.

see corresponding distribution obtained by correspondence analysis or the corresponding genomic tree.

Go to top menu


Fredj Tekaia