Genome_files_Draft assembly of Phytopthora capsici from long-read sequencing uncovers complexity (V2)
datasetposted on 2021-02-24, 02:55 authored by David Haak, Aureliano Bombarely, John McDowell, John Herlihy, Chenming Cui
Resolving complex plant pathogen genomes is important for identifying the genomic shifts associated with rapid adaptation to selective agents such as host and fungicide, yet assembling these genomes remains challenging and expensive. Phytophthora capsici is an important, globally distributed plant pathogen that exhibits wide-spread fungicide resistance and a broad host range. As with other pathogenic oomycetes, P. capsici has a complex life history and a complex genome. Here we leverage Oxford Nanopore Technology (ONT) and existing short read resources, to rapidly generate a low-cost, improved assembly. We generated 10Gbp from a single MinIon flow cell resulting in > 1.25 million reads with an N50 of 13kb. The resulting assembly is 124Mbp in 906 scaffolds with an N50 length of 232kb. This assembly is 92% bigger than the current draft genome of 64Mbp. We confirmed this larger genome size using flow cytometry, with an estimated size of 110Mbp. BUSCO analysis identified 96.5% complete orthologs (39.7% duplicated). Evolutionary analysis supports a recent whole genome duplication in this group. Our work provides a blueprint for rapidly integrating benchtop long-read sequencing with existing short-read data, to dramatically improve assembly quality and integrity of complex genomes and offer novel insights into pathogen genome function and evolution. The original dataset can be found at the following DOI, doi:10.7294/73hy-9s55.