The evolution and population diversity of human-specific segmental duplications
| dc.contributor.author | Dennis, Megan Y | |
| dc.contributor.author | Harshman, Lana | |
| dc.contributor.author | Nelson, Bradley J | |
| dc.contributor.author | Penn, Osnat | |
| dc.contributor.author | Cantsilieris, Stuart | |
| dc.contributor.author | Huddleston, John | |
| dc.contributor.author | Antonacci, Francesca | |
| dc.contributor.author | Penewit, Kelsi | |
| dc.contributor.author | Denman, Laura | |
| dc.contributor.author | Raja, Archana | |
| dc.contributor.author | Baker, Carl | |
| dc.contributor.author | Mark, Kenneth | |
| dc.contributor.author | Malig, Maika | |
| dc.contributor.author | Janke, Nicolette | |
| dc.contributor.author | Espinoza, Claudia | |
| dc.contributor.author | Stessman, Holly A | |
| dc.contributor.author | Nuttle, Xander | |
| dc.contributor.author | Hoekzema, Kendra | |
| dc.contributor.author | Graves, Tina A | |
| dc.contributor.author | Wilson, Richard K | |
| dc.contributor.author | Eichler, Evan E | |
| dc.date.accessioned | 2017-05-22T19:55:47Z | |
| dc.date.available | 2017-05-22T19:55:47Z | |
| dc.date.issued | 2017-02-17 | |
| dc.description.abstract | Segmental duplications contribute significantly to the evolution, adaptation and diseaseassociated instability of the human genome. The largest and most identical duplications suffer from the poorest characterization, often corresponding to genome gaps and misassembly. Here we focus on creating a framework to understand the evolution, copy number variation and coding potential of human-specific segmental duplications (HSDs). We identify 218 HSDs (>5 kbp in length) based on analysis of 322 deeply sequenced ape and human genomes. We target 268 large-insert human bacterial artificial chromosomes, 85 of which have been incorporated into the most recent human reference build (GRCh38) correcting 24 large euchromatic gaps, and 269 nonhuman primate clones for finished sequencing in order to resolve the structure and evolution of the largest, most complex regions with protein-coding potential (n=80 genes/33 gene families). Our analyses indicate that these HSDs (28 duplications ranging in length from 11–677 kbp) are non-randomly organized (P<1x10-6), cluster in association with core duplicons (P<1x10-7) and the majority represent intrachromosomal events arranged predominantly in an interspersed inverted orientation (18/26; P=0.014). Phylogenetic reconstruction suggests different waves of HSD with the latest burst occurring <1.3 million years ago. These 16 duplications and 28 genes would be specific to the genus Homo, including three gene families absent in ancient Neanderthal and Denisova genomes. Of particular interest are the TCAF1/TCAF2 family, which is the most stratified of the Homo sapiens-specific duplications and has been implicated in the somatosensation of cold. Overall, copy number variation analysis (n=2,379 genomes), RNA sequence mapping (GTEx) and targeted resequencing of the protein-coding regions (n=3,275 controls) identify ten gene families where copy number never returns to the ancestral state, there is evidence of mRNA splicing and expression, and no common gene-disruptive mutation events are observed in the general population. We propose that this subset of genes, including functional paralogs ARHGAP11B and SRGAP2C, represents excellent candidates for the evolution of human-specific adaptive traits. | en_US |
| dc.description.sponsorship | This work was supported, in part, by U.S. National Institutes of Health (NIH) grants from NINDS (R00NS083627, M.Y.D.) and NHGRI (R01HG002385 and P01HG004120, E.E.E. and U41HG007635 to R.K.W. and E.E.E.) as well as The Paul G. Allen Family Foundation (11631 to E.E.E.). S.C. is supported by a National Health and Medical Research Council (NHMRC) CJ Martin Biomedical Fellowship (#1073726). E.E.E. is an investigator of the Howard Hughes Medical Institute. | en_US |
| dc.identifier.citation | Dennis MY, Harshman L, Nelson BJ, Penn O, Cantsilieris S, Huddleston J, Antonacci F, Penewit K, Denman L, Raja A, Baker C, Mark K, Malig M, Janke N, Espinoza C, Stessman HAF, Nuttle X, Hoekzema K, Lindsay-Graves TA, Wilson RK, Eichler EE. (2017). The evolution and population diversity of human-specific segmental duplications. Nat Ecol Evol Feb 17;1:69. doi:10.1038/s41559-016-0069. | en_US |
| dc.identifier.other | doi:10.1038/s41559-016-0069 | |
| dc.identifier.uri | http://hdl.handle.net/1773/38703 | |
| dc.language.iso | en_US | en_US |
| dc.publisher | Nature Ecology & Evolution | en_US |
| dc.rights | Attribution 3.0 United States | * |
| dc.rights.uri | http://creativecommons.org/licenses/by/3.0/us/ | * |
| dc.subject | segmental duplications | en_US |
| dc.title | The evolution and population diversity of human-specific segmental duplications | en_US |
| dc.type | Article | en_US |
Files
Original bundle
1 - 5 of 5
Loading...
- Name:
- HSD_MDennis_NatEcoEvo_comb-InitialSubmission.pdf
- Size:
- 5.82 MB
- Format:
- Adobe Portable Document Format
- Description:
- main article
Loading...
- Name:
- Dennis_NatEcolEvol_2017-Supp.pdf
- Size:
- 4.71 MB
- Format:
- Adobe Portable Document Format
- Description:
- Supplement
Loading...
- Name:
- Dennis_NatEcolEvol_2017-SuppTables.xlsx
- Size:
- 1.37 MB
- Format:
- Microsoft Excel XML
- Description:
- Supplemental tables
Loading...
- Name:
- Dennis_NatEcolEvol_2017-SuppDataset1.zip
- Size:
- 7.59 MB
- Format:
- Unknown data format
- Description:
- Supplement Dataset 1
Loading...
- Name:
- Dennis_NatEcolEvol_2017-SuppDataset2.xlsx
- Size:
- 2.68 MB
- Format:
- Microsoft Excel XML
- Description:
- Supplement Dataset 2
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.6 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
