The recent discovery of a plasmid-borne colistin resistance gene, mcr-1, heralds the emergence of truly pan-drug resistant bacteria (1).….
Obtaining high-quality samples from wild animals is a major obstacle for genomic studies of many taxa, particularly at the population level, as collection methods for such samples are typically invasive. DNA from feces is easy to obtain noninvasively, but is dominated by bacterial and other non-host DNA. The high proportion of non-host DNA drastically reduces the efficiency of high-throughput sequencing for host animal genomics. To address this issue, we developed an inexpensive capture method for enriching host DNA from noninvasive fecal samples. Our method exploits natural differences in CpG-methylation density between vertebrate and bacterial genomes to preferentially bind and isolate host DNA from majority-bacterial samples. We demonstrate that the enrichment is robust, efficient, and compatible with downstream library preparation methods useful for population studies (e.g., RADseq). Compared to other enrichment strategies, our method is quick and inexpensive, adding only a negligible cost to sample preparation. In combination with downstream methods such as RADseq, our approach allows for cost-effective and customizable genomic-scale genotyping that was previously feasible in practice only with invasive samples. Because feces are widely available and convenient to collect, our method empowers researchers to explore genomic-scale population-level questions in organisms for which invasive sampling is challenging or undesirable.
Neoplasms occur naturally in invertebrates but are not known to develop in tapeworms. We observed nests of monomorphic, undifferentiated cells in samples from lymph-node and lung biopsies in a man infected with the human immunodeficiency virus (HIV). The morphologic features and invasive behavior of the cells were characteristic of cancer, but their small size suggested a nonhuman origin. A polymerase-chain-reaction (PCR) assay targeting eukaryotes identified Hymenolepis nana DNA. Although the cells were unrecognizable as tapeworm tissue, immunohistochemical staining and probe hybridization labeled the cells in situ. Comparative deep sequencing identified H. nana structural genomic variants that are compatible with mutations described in cancer. Invasion of human tissue by abnormal, proliferating, genetically altered tapeworm cells is a novel disease mechanism that links infection and cancer.
The CRISPR-associated endonuclease Cas9 binds to a guide RNA and cleaves double-stranded DNA with a sequence complementary to the RNA guide. The Cas9-RNA system has been harnessed for numerous applications, such as genome editing. Here we use high-speed atomic force microscopy (HS-AFM) to visualize the real-space and real-time dynamics of CRISPR-Cas9 in action. HS-AFM movies indicate that, whereas apo-Cas9 adopts unexpected flexible conformations, Cas9-RNA forms a stable bilobed structure and interrogates target sites on the DNA by three-dimensional diffusion. These movies also provide real-time visualization of the Cas9-mediated DNA cleavage process. Notably, the Cas9 HNH nuclease domain fluctuates upon DNA binding, and subsequently adopts an active conformation, where the HNH active site is docked at the cleavage site in the target DNA. Collectively, our HS-AFM data extend our understanding of the action mechanism of CRISPR-Cas9.
- Proceedings of the National Academy of Sciences of the United States of America
- Published over 3 years ago
Epidemiological and genetic association studies show that genetics play an important role in the attainment of education. Here, we investigate the effect of this genetic component on the reproductive history of 109,120 Icelanders and the consequent impact on the gene pool over time. We show that an educational attainment polygenic score, POLYEDU, constructed from results of a recent study is associated with delayed reproduction (P < 10(-100)) and fewer children overall. The effect is stronger for women and remains highly significant after adjusting for educational attainment. Based on 129,808 Icelanders born between 1910 and 1990, we find that the average POLYEDU has been declining at a rate of ∼0.010 standard units per decade, which is substantial on an evolutionary timescale. Most importantly, because POLYEDU only captures a fraction of the overall underlying genetic component the latter could be declining at a rate that is two to three times faster.
Whereas domestication of livestock, pets, and crops is well documented, it is still unclear to what extent microbes associated with the production of food have also undergone human selection and where the plethora of industrial strains originates from. Here, we present the genomes and phenomes of 157 industrial Saccharomyces cerevisiae yeasts. Our analyses reveal that today’s industrial yeasts can be divided into five sublineages that are genetically and phenotypically separated from wild strains and originate from only a few ancestors through complex patterns of domestication and local divergence. Large-scale phenotyping and genome analysis further show strong industry-specific selection for stress tolerance, sugar utilization, and flavor production, while the sexual cycle and other phenotypes related to survival in nature show decay, particularly in beer yeasts. Together, these results shed light on the origins, evolutionary history, and phenotypic diversity of industrial yeasts and provide a resource for further selection of superior strains. PAPERCLIP.
- Proceedings of the National Academy of Sciences of the United States of America
- Published almost 4 years ago
We report on the sequencing of 10,545 human genomes at 30×-40× coverage with an emphasis on quality metrics and novel variant and sequence discovery. We find that 84% of an individual human genome can be sequenced confidently. This high-confidence region includes 91.5% of exon sequence and 95.2% of known pathogenic variant positions. We present the distribution of over 150 million single-nucleotide variants in the coding and noncoding genome. Each newly sequenced genome contributes an average of 8,579 novel variants. In addition, each genome carries on average 0.7 Mb of sequence that is not found in the main build of the hg38 reference genome. The density of this catalog of variation allowed us to construct high-resolution profiles that define genomic sites that are highly intolerant of genetic variation. These results indicate that the data generated by deep genome sequencing is of the quality necessary for clinical use.
Background Ebola virus has been detected in the semen of men after their recovery from Ebola virus disease (EVD), but little information is available about its prevalence or the duration of its persistence. We report the initial findings of a pilot study involving survivors of EVD in Sierra Leone. Methods We enrolled a convenience sample of 100 male survivors of EVD in Sierra Leone, at different times after their recovery from EVD, and recorded self-reported information about sociodemographic characteristics, the EVD episode, and health status. Semen specimens obtained at baseline were tested by means of a quantitative reverse-transcriptase-polymerase-chain-reaction (RT-PCR) assay with the use of the target-gene sequences of NP and VP40. Results A total of 93 participants provided an initial semen specimen for analysis, of whom 46 (49%) had positive results on quantitative RT-PCR. Ebola virus RNA was detected in the semen of all 9 men who had a specimen obtained 2 to 3 months after the onset of EVD, in the semen of 26 of 40 (65%) who had a specimen obtained 4 to 6 months after onset, and in the semen of 11 of 43 (26%) who had a specimen obtained 7 to 9 months after onset; the results for 1 participant who had a specimen obtained at 10 months were indeterminate. The median cycle-threshold values (for which higher values indicate lower RNA levels) were 32.0 with the NP gene target and 31.1 with the VP40 gene target for specimens obtained at 2 to 3 months, 34.5 and 32.3, respectively, for specimens obtained at 4 to 6 months, and 37.0 and 35.6, respectively, for specimens obtained at 7 to 9 months. Conclusions These data showed the persistence of Ebola virus RNA in semen and declining persistence with increasing months since the onset of EVD. We do not yet have data on the extent to which positivity on RT-PCR is associated with virus infectivity. Although cases of suspected sexual transmission of Ebola have been reported, they are rare; hence the risk of sexual transmission of the Ebola virus is being investigated. (Funded by the World Health Organization and others.).
In order to explore the diversity and selective signatures of duplication and deletion human copy number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single nucleotide variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load.
Background Despite evidence that genetic factors contribute to the duration of gestation and the risk of preterm birth, robust associations with genetic variants have not been identified. We used large data sets that included the gestational duration to determine possible genetic associations. Methods We performed a genomewide association study in a discovery set of samples obtained from 43,568 women of European ancestry using gestational duration as a continuous trait and term or preterm (<37 weeks) birth as a dichotomous outcome. We used samples from three Nordic data sets (involving a total of 8643 women) to test for replication of genomic loci that had significant genomewide association (P<5.0×10(-8)) or an association with suggestive significance (P<1.0×10(-6)) in the discovery set. Results In the discovery and replication data sets, four loci (EBF1, EEFSEC, AGTR2, and WNT4) were significantly associated with gestational duration. Functional analysis showed that an implicated variant in WNT4 alters the binding of the estrogen receptor. The association between variants in ADCY5 and RAP2C and gestational duration had suggestive significance in the discovery set and significant evidence of association in the replication sets; these variants also showed genomewide significance in a joint analysis. Common variants in EBF1, EEFSEC, and AGTR2 showed association with preterm birth with genomewide significance. An analysis of mother-infant dyads suggested that these variants act at the level of the maternal genome. Conclusions In this genomewide association study, we found that variants at the EBF1, EEFSEC, AGTR2, WNT4, ADCY5, and RAP2C loci were associated with gestational duration and variants at the EBF1, EEFSEC, and AGTR2 loci with preterm birth. Previously established roles of these genes in uterine development, maternal nutrition, and vascular control support their mechanistic involvement. (Funded by the March of Dimes and others.).