Abstract
Continuous improvements in long-read sequencing allow us to tackle increasingly big and complex genomes. Here we present the principles of long-read genome assembly, taking Solanum pennellii nanopore sequencing as an example.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Provart NJ, Brady SM, Parry G et al (2021) Anno genominis XX: 20 years of Arabidopsis genomics. Plant Cell 33(4):832–845
Michael TP, Jupe F, Bemm F et al (2018) High contiguity Arabidopsis thaliana genome assembly with a single nanopore flow cell. Nat Commun 9(1):541
Michael TP, VanBuren R (2020) Building near-complete plant genomes. Curr Opin Plant Biol 54:26–33
Panda K, Slotkin RK (2020) Long-read cDNA sequencing enables a “gene-like” transcript annotation of transposable elements. Plant Cell 32(9):2687–2698
Shahid S, Slotkin RK (2020) The current revolution in transposable element biology enabled by long reads. Curr Opin Plant Biol 54:49–56
Schmidt MHW, Vogel A, Denton AK et al (2017) De novo assembly of a new Solanum pennellii accession using Nanopore sequencing. Plant Cell 29(10):2336–2348
Belser C, Istace B, Denis E et al (2018) Chromosome-scale assemblies of plant genomes using nanopore long reads and optical maps. Nat Plants 4(11):879–887
Della Coletta R, Qiu Y, Ou S et al (2021) How the pan-genome is changing crop genomics and improvement. Genome Biol 22(1):3
Lewin HA, Robinson GE, Kress WJ et al (2018) Earth BioGenome project: sequencing life for the future of life. Proc Natl Acad Sci U S A 115(17):4325–4333
Amarasinghe SL, Ritchie ME, Gouil Q (2021) long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data. GigaScience. 10(2):giab003
Li H. Seqtk: toolkit for processing sequences in FASTA/Q formats. Accessed 05 Mar 2021. https://github.com/lh3/seqtk
Fukasawa Y, Ermini L, Wang H et al (2020) LongQC: a quality control tool for third generation sequencing long read data. G3 (Bethesda) 10(4):1193–1196
Wick R. Porechop: adapter trimmer for Oxford Nanopore reads. Accessed: 2021-03-05. https://github.com/rrwick/Porechop
Wick R. Filtlong: quality filtering tool for long reads. Accessed: 2021-03-05. https://github.com/rrwick/Filtlong
Kolmogorov M, Yuan J, Lin Y et al (2019) Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37(5):540–546
Li H (2018) Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34(18):3094–3100
Oxford Nanopore technologies. Medaka: sequence correction. Accessed: 2021-03-05. https://github.com/nanoporetech/medaka
Gurevich A, Saveliev V, Vyahhi N et al (2013) QUAST: quality assessment tool for genome assemblies. Bioinformatics 29(8):1072–1075
Seppey M, Manni M, Zdobnov EM (2019) BUSCO: assessing genome assembly and annotation completeness. In: Kollmar M (ed) Gene prediction, Methods in Molecular Biology, vol 1962. Springer New York, New York, NY, pp 227–245
Wick RR, Judd LM, Holt KE (2019) Performance of neural network basecalling tools for Oxford Nanopore sequencing. Genome Biol 20(1):129
Murigneux V, Rai SK, Furtado A et al (2020) Comparison of long-read methods for sequencing and assembly of a plant genome. GigaScience 9(12):giaa146
Leger A, Leonardi T (2019) pycoQC, interactive quality control for Oxford Nanopore sequencing. J Open Source Soft 4(34):1236
Vaser R, Sović I, Nagarajan N et al (2017) Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res 27(5):737–746
Walker BJ, Abeel T, Shea T et al (2014) Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9(11):e112963
Bolger A, Scossa F, Bolger ME et al (2014) The genome of the stress tolerant wild tomato species Solanum pennellii. Nat Genet 46(9):1034–1038
Tørresen OK, Star B, Mier P et al (2019) Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases. Nucleic Acids Res 47(21):10994–11006
Lang D, Zhang S, Ren P et al (2020) Comparison of the two up-to-date sequencing technologies for genome assembly: HiFi reads of Pacific biosciences sequel II system and ultralong reads of Oxford Nanopore. GigaScience 9(12):giaa123
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
1 Electronic Supplementary Material
Data S1
(TXT 1 KB)
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Gouil, Q. (2022). Assembling Plant Genomes with Long-Read Sequencing. In: Lambing, C. (eds) Plant Gametogenesis. Methods in Molecular Biology, vol 2484. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-2253-7_22
Download citation
DOI: https://doi.org/10.1007/978-1-0716-2253-7_22
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-2252-0
Online ISBN: 978-1-0716-2253-7
eBook Packages: Springer Protocols