The evolution and population diversity of human-specific segmental duplications

dc.contributor.authorDennis, Megan Y
dc.contributor.authorHarshman, Lana
dc.contributor.authorNelson, Bradley J
dc.contributor.authorPenn, Osnat
dc.contributor.authorCantsilieris, Stuart
dc.contributor.authorHuddleston, John
dc.contributor.authorAntonacci, Francesca
dc.contributor.authorPenewit, Kelsi
dc.contributor.authorDenman, Laura
dc.contributor.authorRaja, Archana
dc.contributor.authorBaker, Carl
dc.contributor.authorMark, Kenneth
dc.contributor.authorMalig, Maika
dc.contributor.authorJanke, Nicolette
dc.contributor.authorEspinoza, Claudia
dc.contributor.authorStessman, Holly A
dc.contributor.authorNuttle, Xander
dc.contributor.authorHoekzema, Kendra
dc.contributor.authorGraves, Tina A
dc.contributor.authorWilson, Richard K
dc.contributor.authorEichler, Evan E
dc.date.accessioned2017-05-22T19:55:47Z
dc.date.available2017-05-22T19:55:47Z
dc.date.issued2017-02-17
dc.description.abstractSegmental duplications contribute significantly to the evolution, adaptation and diseaseassociated instability of the human genome. The largest and most identical duplications suffer from the poorest characterization, often corresponding to genome gaps and misassembly. Here we focus on creating a framework to understand the evolution, copy number variation and coding potential of human-specific segmental duplications (HSDs). We identify 218 HSDs (>5 kbp in length) based on analysis of 322 deeply sequenced ape and human genomes. We target 268 large-insert human bacterial artificial chromosomes, 85 of which have been incorporated into the most recent human reference build (GRCh38) correcting 24 large euchromatic gaps, and 269 nonhuman primate clones for finished sequencing in order to resolve the structure and evolution of the largest, most complex regions with protein-coding potential (n=80 genes/33 gene families). Our analyses indicate that these HSDs (28 duplications ranging in length from 11–677 kbp) are non-randomly organized (P<1x10-6), cluster in association with core duplicons (P<1x10-7) and the majority represent intrachromosomal events arranged predominantly in an interspersed inverted orientation (18/26; P=0.014). Phylogenetic reconstruction suggests different waves of HSD with the latest burst occurring <1.3 million years ago. These 16 duplications and 28 genes would be specific to the genus Homo, including three gene families absent in ancient Neanderthal and Denisova genomes. Of particular interest are the TCAF1/TCAF2 family, which is the most stratified of the Homo sapiens-specific duplications and has been implicated in the somatosensation of cold. Overall, copy number variation analysis (n=2,379 genomes), RNA sequence mapping (GTEx) and targeted resequencing of the protein-coding regions (n=3,275 controls) identify ten gene families where copy number never returns to the ancestral state, there is evidence of mRNA splicing and expression, and no common gene-disruptive mutation events are observed in the general population. We propose that this subset of genes, including functional paralogs ARHGAP11B and SRGAP2C, represents excellent candidates for the evolution of human-specific adaptive traits.en_US
dc.description.sponsorshipThis work was supported, in part, by U.S. National Institutes of Health (NIH) grants from NINDS (R00NS083627, M.Y.D.) and NHGRI (R01HG002385 and P01HG004120, E.E.E. and U41HG007635 to R.K.W. and E.E.E.) as well as The Paul G. Allen Family Foundation (11631 to E.E.E.). S.C. is supported by a National Health and Medical Research Council (NHMRC) CJ Martin Biomedical Fellowship (#1073726). E.E.E. is an investigator of the Howard Hughes Medical Institute.en_US
dc.identifier.citationDennis MY, Harshman L, Nelson BJ, Penn O, Cantsilieris S, Huddleston J, Antonacci F, Penewit K, Denman L, Raja A, Baker C, Mark K, Malig M, Janke N, Espinoza C, Stessman HAF, Nuttle X, Hoekzema K, Lindsay-Graves TA, Wilson RK, Eichler EE. (2017). The evolution and population diversity of human-specific segmental duplications. Nat Ecol Evol Feb 17;1:69. doi:10.1038/s41559-016-0069.en_US
dc.identifier.otherdoi:10.1038/s41559-016-0069
dc.identifier.urihttp://hdl.handle.net/1773/38703
dc.language.isoen_USen_US
dc.publisherNature Ecology & Evolutionen_US
dc.rightsAttribution 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/*
dc.subjectsegmental duplicationsen_US
dc.titleThe evolution and population diversity of human-specific segmental duplicationsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 5 of 5
Loading...
Thumbnail Image
Name:
HSD_MDennis_NatEcoEvo_comb-InitialSubmission.pdf
Size:
5.82 MB
Format:
Adobe Portable Document Format
Description:
main article
Loading...
Thumbnail Image
Name:
Dennis_NatEcolEvol_2017-Supp.pdf
Size:
4.71 MB
Format:
Adobe Portable Document Format
Description:
Supplement
Loading...
Thumbnail Image
Name:
Dennis_NatEcolEvol_2017-SuppTables.xlsx
Size:
1.37 MB
Format:
Microsoft Excel XML
Description:
Supplemental tables
Loading...
Thumbnail Image
Name:
Dennis_NatEcolEvol_2017-SuppDataset1.zip
Size:
7.59 MB
Format:
Unknown data format
Description:
Supplement Dataset 1
Loading...
Thumbnail Image
Name:
Dennis_NatEcolEvol_2017-SuppDataset2.xlsx
Size:
2.68 MB
Format:
Microsoft Excel XML
Description:
Supplement Dataset 2

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.6 KB
Format:
Item-specific license agreed upon to submission
Description: