**10. References**

Arnaiz, O., Gout, J. F., Betermier, M., Bouhouche, K., Cohen, J., Duret, L., Kapusta, A., Meyer, E. & Sperling, L. (2010). Gene expression in a paleopolyploid: a

Detection and Analysis of Functional Specialization in Duplicated Genes 55

Gallach, M., Chandrasekaran, C. & Betran, E. (2010). Analyses of nuclearly encoded

Ganko, E. W., Meyers, B. C. & Vision, T. J. (2007). Divergence in expression

Gibson, T. A. & Goldberg, D. S. (2009). Questioning the ubiquity of neofunctionalization.

Goettel, W. & Messing, J. (2010). Divergence of gene regulation through chromosomal

Guo, H., Weiss, R. E., Gu, X. & Suchard, M. A. (2007). Time squared: repeated measures on

Han, M. V., Demuth, J. P., McGrath, C. L., Casola, C. & Hahn, M. W. (2009). Adaptive

Harhay, G. P., Smith, T. P., Alexander, L. J., Haudenschild, C. D., Keele, J. W., Matukumalli,

Jarinova, O., Hatch, G., Poitras, L., Prudhomme, C., Grzyb, M., Aubin, J., Berube-Simard, F.

Johnson, D. A. & Thomas, M. A. (2007). The monosaccharide transporter gene family in

Karanth, S., Denovan-Wright, E. M., Thisse, C., Thisse, B. & Wright, J. M. (2009). Tandem

Kassahn, K. S., Dang, V. T., Wilkins, S. J., Perkins, A. C. & Ragan, M. A. (2009). Evolution of

Langille, M. G. & Clark, D. V. (2007). Parent genes of retrotransposition-generated gene

Li, Q., Liu, X., He, Q., Hu, L., Ling, Y., Wu, Y., Yang, X. & Yu, L. (2011). Systematic analysis

Li, Z., Liu, Q., Song, M., Zheng, Y., Nan, P., Cao, Y., Chen, G., Li, Y. & Zhong,

analyses in vertebrates. *Genome Res.,* Vol. 19, No. 8, pp. 1404-1418

duplicated genes in rice. *BMC Bioinformatics,* Vol. 10 Suppl 6

genes in teleosts. *Development,* Vol. 135, No. 21, pp. 3543-3553

divergence. *Mol. Biol. Evol.,* Vol. 24, No. 11, pp. 2412-2423

evolution of young gene duplicates in mammals. *Genome Res.,* Vol. 19, No. 5, pp.

L. K., Schroeder, S. G., Van Tassell, C. P., Gresham, C. R., Bridges, S. M., Burgess, S. C. & Sonstegard, T. S. (2010). An atlas of bovine gene expression reveals novel distinctive tissue characteristics and evidence for improving genome annotation.

A., Jeannotte, L. & Ekker, M. (2008). Functional resolution of duplicated hoxb5

Arabidopsis and rice: a history of duplications, adaptive evolution, and functional

duplication of the fabp1b gene and subsequent divergence of the tissue-specific distribution of fabp1b.1 and fabp1b.2 transcripts in zebrafish (*Danio rerio*). *Genome,*

gene function and regulatory control after whole-genome duplication: comparative

duplicates in *Drosophila melanogaster* have distinct expression profiles. *Genomics,*

of gene expression level with tissue-specificity, function and protein subcellular localization in human transcriptome. *Mol. Biol. Rep.,* Vol. 38, No. 4, pp. 2597-2602. Li, Z., Zhang, H., Ge, S., Gu, X., Gao, G. & Luo, J. (2009). Expression pattern divergence of

Y. (2005). Detecting correlation between sequence and expression divergences

835-850

2298-2309

859-867

*PLoS Comput. Biol.,* Vol. 5, No. 1

*Genome Biol.,* Vol. 11, No. 10

Vol. 52, No. 12, pp. 985-992

Vol. 90, No. 3, pp. 334-343

rearrangements. *BMC Genomics,* Vol. 11, No. 678

phylogenies. *Mol. Biol. Evol.,* Vol. 24, No. 2, pp. 352-362

mitochondrial genes suggest gene duplication as a mechanism for resolving intralocus sexually antagonistic conflict in Drosophila. *Genome Biol. Evol.,* Vol. 2, pp.

between duplicated genes in *Arabidopsis*. *Mol. Biol. Evol.,* Vol. 24, No. 10, pp.

transcriptome resource for the ciliate *Paramecium tetraurelia*. *BMC Genomics,* Vol. 11, No. 547


Barkman, T. & Zhang, J. (2009). Evidence for escape from adaptive conflict? *Nature,* Vol. 462,

Bershtein, S. & Tawfik, D. S. (2008). Ohno's model revisited: measuring the frequency of

Burkhart, J. M., Vaudel, M., Zahedi, R. P., Martens, L. & Sickmann, A. (2011). iTRAQ protein

Canestro, C., Catchen, J. M., Rodriguez-Mari, A., Yokoi, H. & Postlethwait, J. H. (2009).

Chaudhary, B., Flagel, L., Stupar, R. M., Udall, J. A., Verma, N., Springer, N. M. & Wendel, J.

Comelli, R. N. & Gonzalez, D. H. (2009). Divergent regulatory mechanisms in the response

Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. (2004). WebLogo: a sequence logo

Deng, C., Cheng, C. H., Ye, H., He, X. & Chen, L. (2010). Evolution of an antifreeze protein

Des Marais, D. L. & Rausher, M. D. (2008). Escape from adaptive conflict after duplication in an anthocyanin pathway gene. *Nature,* Vol. 454, No. 7205, pp. 762-765 Doxey, A. C., Yaish, M. W., Moffatt, B. A., Griffith, M. & McConkey, B. J. (2007). Functional

Edger, P. P. & Pires, J. C. (2009). Gene and genome duplications: the impact of dosage-

Field, S. F. & Matz, M. V. (2010). Retracing evolution of red fluorescence in GFP-like proteins

Flagel, L., Udall, J., Nettleton, D. & Wendel, J. (2008). Duplicate gene expression in

Flagel, L. E. & Wendel, J. F. (2010). Evolutionary rate variation, genomic dominance and

from Faviina corals. *Mol. Biol. Evol.,* Vol. 27, No. 2, pp. 225-233

generator. *Genome Res.,* Vol. 14, No. 6, pp. 1188-1190

*U.S.A.,* Vol. 107, No. 50, pp. 21593-21598

evolution. *BMC Biol.,* Vol. 6, No. 16

*Phytol.,* Vol. 186, No. 1, pp. 184-193

No. 547

No. 7274

1134

Vol. 5, No. 5

503-517

1179-1181

1045-1055

699-717

25, No. 11, pp. 2311-2318

transcriptome resource for the ciliate *Paramecium tetraurelia*. *BMC Genomics,* Vol. 11,

potentially adaptive mutations under various mutational drifts. *Mol.Biol.Evol.,* Vol.

quantification: A quality-controlled workflow. *Proteomics,* Vol. 11, No. 6, pp. 1125-

Consequences of lineage-specific gene loss on functional evolution of surviving paralogs: ALDH1A and retinoic acid signaling in vertebrate genomes. *PLoS Genet.,*

F. (2009). Reciprocal silencing, transcriptional bias and functional divergence of homeologs in polyploid cotton (gossypium). *Genetics,* Vol. 182, No. 2, pp.

of respiratory chain component genes to carbohydrates suggests a model for gene evolution after duplication. *Plant. Signal. Behav.,* Vol. 4, No. 12, pp.

by neofunctionalization under escape from adaptive conflict. *Proc. Natl Acad. Sci.* 

divergence in the Arabidopsis beta-1,3-glucanase gene family inferred by phylogenetic reconstruction of expression states. *Mol. Biol. Evol.,* Vol. 24, No. 4, pp.

sensitivity on the fate of nuclear genes. *Chromosome Res.,* Vol. 17, No. 5, pp.

allopolyploid *Gossypium* reveals two temporally distinct phases of expression

duplicate gene expression evolution during allotetraploid cotton speciation. *New* 


Detection and Analysis of Functional Specialization in Duplicated Genes 57

Teshima, K. M. & Innan, H. (2008). Neofunctionalization of duplicated genes under the

Tsankov, A. M., Thompson, D. A., Socha, A., Regev, A. & Rando, O. J. (2010). The

Turunen, O., Seelke, R. & Macosko, J. (2009). In silico evidence for functional specialization after genome duplication in yeast. *FEMS Yeast Res.,* Vol. 9, No. 1, pp. 16-31 Udall, J. A., Swanson, J. M., Nettleton, D., Percifield, R. J. & Wendel, J. F. (2006). A novel

Van de Peer, Y., Maere, S. & Meyer, A. (2009). The evolutionary significance of ancient

Venkataram, S. & Fay, J. C. (2010). Is transcription factor binding site turnover a sufficient

Viaene, T., Vekemans, D., Becker, A., Melzer, S. & Geuten, K. (2010). Expression divergence

Wang, Z., Dong, X., Ding, G. & Li, Y. (2010). Comparing the retention mechanisms of

Xiong, J., Feng, L., Yuan, D., Fu, C. & Miao, W. (2010). Genome-wide identification and

Xue, C., Huang, R., Liu, S. Q. & Fu, Y. X. (2010). Recombination facilitates

Xue, C. & Fu, Y. (2009). Preservation of duplicate genes by originalization. *Genetica,* Vol. 136,

Yang, X., Tuskan, G. A. & Cheng, M. Z. (2006). Divergence of the Dof gene families in

Yasukawa, J., Tomioka, S., Aigaki, T. & Matsuo, T. (2010). Evolution of expression patterns

Yim, W. C., Lee, B. M. & Jang, C. S. (2009). Expression diversity and evolutionary dynamics

Zhan, Z., Ren, J., Zhang, Y., Zhao, R., Yang, S. & Wang, W. (2011). Evolution of alternative splicing in newly evolved genes of *Drosophila*. *Gene,* Vol. 470, No. 1-2, pp. 1-6

duplication. *Plant Physiol.,* Vol. 142, No. 3, pp. 820-830

genome duplications. *Nat. Rev. Genet.,* Vol. 10, No. 10, pp. 725-732

role of nucleosome positioning in the evolution of gene regulation. *PLoS Biol.,* 

approach for characterizing expression levels of genes duplicated by polyploidy.

explanation for cis-regulatory sequence divergence? *Genome Biol. Evol.,* Vol. 2, pp.

of the AGL6 MADS domain transcription factor lineage after a core eudicot duplication suggests functional diversification. *BMC Plant. Biol.,* Vol. 10,

tandem duplicates and retrogenes in human and mouse genomes. *Genet. Sel. Evol.,*

evolution of ATP-binding cassette transporters in the ciliate *Tetrahymena thermophila*: A case of functional divergence in a multigene family. *BMC Evol.Biol.,*

neofunctionalization of duplicate genes via originalization. *BMC Genet.,* Vol. 11,

poplar, *Arabidopsis*, and rice suggests multiple modes of gene evolution after

of two odorant-binding protein genes, Obp57d and Obp57e, in Drosophila. *Gene,*

of rice duplicate genes. *Mol. Genet. Genomics,* Vol. 281, No. 5, pp. 483-493,

pressure of gene conversion. *Genetics,* Vol. 178, No. 3, pp. 1385-1398

Vol. 8, No. 7

851-858

No. 148

Vol. 42, pp. 24

Vol. 10, No. 330

No. 1, pp. 69-78

Vol. 467, No. 1-2, pp. 25-34

No. 46

1617-4623

*Genetics,* Vol. 173, No. 3, pp. 1823-1827

in a comparative analysis of human serpin genes. *BioSystems,* Vol. 82, No. 3, pp. 226-234


Lockton, S. & Gaut, B. S. (2005). Plant conserved non-coding sequences and paralogue

MacCarthy, T. & Bergman, A. (2007). The limits of subfunctionalization. *BMC Evol.Biol.,* Vol.

Mikhaylova, L. M., Nguyen, K. & Nurminsky, D. I. (2008). Analysis of the *Drosophila* 

Nielsen, M. G., Gadagkar, S. R. & Gutzwiller, L. (2010). Tubulin evolution in insects: gene

Oakley, T. H., Gu, Z., Abouheif, E., Patel, N. H. & Li, W. H. (2005). Comparative methods for

Pagel, M. & Meade, A. (2006). Bayesian Analysis of Correlated Evolution of Discrete

Panchin, A. Y., Gelfand, M. S., Ramensky, V. E. & Artamonova, I. I. (2010). Asymmetric

Qian, W., Liao, B. Y., Chang, A. Y. & Zhang, J. (2010). Maintenance of duplicate genes and

Rajashekar, B., Samson, P., Johansson, T. & Tunlid, A. (2007). Evolution of nucleotide

Redon, R., Ishikawa, S., Fitch, K. R., Feuk, L., Perry, G. H., Andrews, T. D., et al. (2006).

Ren, X. Y., Fiers, M. W., Stiekema, W. J. & Nap, J. P. (2005). Local coexpression domains of

Semon, M. & Wolfe, K. H. (2008). Preferential subfunctionalization of slow-evolving genes

Shoja, V., Murali, T. M. & Zhang, L. (2007). Expression divergence of tandemly arrayed

Skamnioti, P., Furlong, R. F. & Gurr, S. J. (2008). The fate of gene duplicates in the genomes of fungal pathogens. *Commun. Integr. Biol.,* Vol. 1, No. 2, pp. 196-198

genes in human and mouse. *Comp. Funct. Genomics, 60964*

fungus *Paxillus involutus*. *New Phytol.,* Vol. 174, No. 2, pp. 399-411

Ohno, S. (1970). *Evolution by Gene Duplication*, Springer-Verlag, 0-04-575015-7, New York Osada, N. & Innan, H. (2008). Duplication and gene conversion in the *Drosophila melanogaster*

evolution. *Trends Genet.,* Vol. 21, No. 1, pp. 60-65

constrained gene family. *BMC Evol.Biol.,* Vol. 10, No. 113

genomic data. *Mol.Biol.Evol.,* Vol. 22, No. 1, pp. 40-50

*Genetics,* Vol. 179, No. 1, pp. 305-315

genome. *PLoS Genet.,* Vol. 4, No. 12

226-234

7, No. 213

6

Vol. 5

pp. 425-430

pp. 444-454

24, pp. 8333-8338

923-934

in a comparative analysis of human serpin genes. *BioSystems,* Vol. 82, No. 3, pp.

*melanogaster* testes transcriptome reveals coordinate regulation of paralogous genes.

duplication and subfunctionalization provide specialized isoforms in a functionally

the analysis of gene-expression evolution: an example using yeast functional

Characters by Reversible-Jump Markov Chain Monte Carlo. *Am. Nat.,* Vol. 167, No.

and non-uniform evolution of recently duplicated human genes. *Biol. Direct,* 

their functional redundancy by reduced expression. *Trends Genet.,* Vol. 26, No. 10,

sequences and expression patterns of hydrophobin genes in the ectomycorrhizal

Global variation in copy number in the human genome. *Nature,* Vol. 444, No. 7118,

two to four genes in the genome of *Arabidopsis*. *Plant Physiol.,* Vol. 138, No. 2, pp.

after allopolyploidization in *Xenopus laevis*. *Proc. Natl Acad. Sci. U.S.A.,* Vol. 105, No.


**4** 

*Germany* 

**Predicting Tandemly Arrayed Gene** 

*Abteilung NMR basierte Strukturbiologie, Max-Planck-Institut für* 

Since the first high-quality eukaryotic genome assemblies became available the large scale analysis of the origin of new genes came into the focus of many studies (Shoja & Zhang, 2006; Zhou et al., 2008). New genes can originate through multiple mechanisms including gene duplication, gene fusion/fission, exon shuffling, retroposition, horizontal gene transfer, and de novo from noncoding sequences (Long et al., 2003). Although initial models proposed that new copies of genes soon become nonfuntional (Nei & Roychoudhury, 1973; Ohno, 1970) it has since been shown for numerous genes that they retain function through creating redundancy, subfunctionalization, and neofunctionalization (Hahn, 2009; Li et al., 2005; Massingham et al., 2001). While de novo origination from noncoding sequence has been shown to play an unexpectedly important role (Zhou et al., 2008) most of the new genes are derived through duplications. Gene duplicates are normally classified into dispersed and tandem duplicates. Tandem duplications of clusters of genes, single genes, groups of exons, or single exons are thought to be formed by unequal crossing-over events, or misaligned homologous recombinational repair (Babushok et al., 2007; Zhang, 2003). A comparative analysis of the human, mouse, and rat genome has shown that about 15 % of all genes represent tandemly arrayed genes (Shoja & Zhang, 2006). A similar number of about 20 % has been found for the fruit fly *Drosophila melanogaster* (Quijano et al., 2008). All these analyses rely on the particular dataset of annotated genes used and the specific methods for defining genes as tandem genes. However, first annotations of genomes are in most cases done by automatic gene prediction programs, nowadays often supported by incorporating additional EST data, and therefore miss many genes, include artificially fused neighbouring genes, and contain mis-predicted exons and introns. Although these errors seem small, in the case of distinguishing tandem gene duplicates from genomic region duplication and *trans-*spliced genes they are essential. In addition, defining tandem genes by a certain number of nucleotides appearing in-between cannot separate tandem gene duplicates from duplications of small genomic regions. Tandemly arrayed gene duplicates are often conserved between species. Examples are the olfactory receptor genes that constitute a very large gene family of several hundred genes per species in vertebrates (Aloni et al., 2006) and the HOX genes (Garcia-Fernandez, 2005; Zhang & Nei, 1996). While algorithms have been developed to reconstruct the history and evolution of tandemly arrayed genes (Bertrand et al., 2008; Elemento et al., 2002) specific programs are not

available for the prediction and local reconstruction of these gene arrays.

**1. Introduction** 

**Duplicates with WebScipio** 

*Biophysikalische Chemie, Am Fassberg 11, Göttingen* 

Klas Hatje and Martin Kollmar

Zou, C., Lehti-Shiu, M. D., Thomashow, M. & Shiu, S. H. (2009). Evolution of stressregulated gene expression in duplicate genes of *Arabidopsis thaliana*. *PLoS Genet.,* Vol. 5, No. 7, e1000581
