Assembly

DNA sequence data has become an indispensable tool for Molecular Biology & Evolutionary Biology. Study in these fields now require a genome sequence to work from. We call this a ‘Reference Sequence.’ We need to build a reference for each species. We do this by Genome Assembly. De novo Genome Assembly is the process of reconstructing the original DNA sequence from the fragment reads alone.

Requirements

Before diving into this topic, we recommend you to have a look at:

Material

You can view the tutorial materials in different languages by clicking the dropdown icon next to the slides (slides) and tutorial (tutorial) buttons below.
Lesson Slides Hands-on Recordings Input dataset Workflows
An Introduction to Genome Assembly
An introduction to get started in genome assembly and annotation
Assembly of metagenomic sequencing data
Assembly of the mitochondrial genome from PacBio HiFi reads
Chloroplast genome assembly
De Bruijn Graph Assembly
Decontamination of a genome assembly
Deeper look into Genome Assembly algorithms
ERGA post-assembly QC
Genome Assembly Quality Control
Genome Assembly of MRSA from Oxford Nanopore MinION data (and optionally Illumina data)
Genome Assembly of a bacterial genome (MRSA) sequenced using Illumina MiSeq Data
Genome assembly using PacBio data
Large genome assembly and polishing
Making sense of a newly assembled genome
Unicycler Assembly
Unicycler assembly of SARS-CoV-2 genome with preprocessing to remove human genome reads
Using the VGP workflows to assemble a vertebrate genome with HiFi and Hi-C data
Vertebrate genome assembly using HiFi, Bionano and Hi-C data - Step by Step

Frequently Asked Questions

Common questions regarding this topic have been collected on a dedicated FAQ page . Common questions related to specific tutorials can be accessed from the tutorials themselves.

Follow topic updates rss-feed with our RSS Feed

Community Resources

Community Home Maintainer Home

Editorial Board

This material is reviewed by our Editorial Board:

orcid logoSimon Gladman avatar Simon GladmanAnton Nekrutenko avatar Anton Nekrutenkoorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoCristóbal Gallardo avatar Cristóbal Gallardo

Contributors

This material was contributed to by:

orcid logoWolfgang Maier avatar Wolfgang MaierMarcella Sozzoni avatar Marcella Sozzoniorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoPolina Polunina avatar Polina Poluninaorcid logoTom Brown avatar Tom Brownorcid logoHelena Rasche avatar Helena RascheDeepti Varshney avatar Deepti Varshneyorcid logoLinelle Abueg avatar Linelle AbuegFabian Recktenwald avatar Fabian RecktenwaldBazante Sanders avatar Bazante Sandersorcid logoSimon Gladman avatar Simon Gladmanorcid logoErwan Corre avatar Erwan Correorcid logoNate Coraor avatar Nate Coraororcid logoSaskia Hiltemann avatar Saskia Hiltemannorcid logoAnna Syme avatar Anna SymeGiulio Formenti avatar Giulio FormentiTeresa Müller avatar Teresa MüllerBrandon Pickett avatar Brandon Pickettorcid logoMiaomiao Zhou avatar Miaomiao Zhouorcid logoCristóbal Gallardo avatar Cristóbal Gallardoorcid logoBjörn Grüning avatar Björn GrüningAnton Nekrutenko avatar Anton NekrutenkoMatúš Kalaš avatar Matúš Kalašorcid logoAnthony Bretaudeau avatar Anthony Bretaudeauorcid logoLaura Leroi avatar Laura Leroiorcid logoAlex Ostrovsky avatar Alex Ostrovskyorcid logoBérénice Batut avatar Bérénice Batutorcid logoAlexandre Cormier avatar Alexandre Cormierorcid logoStéphanie Robin avatar Stéphanie Robin

Funding

These individuals or organisations provided funding support for the development of this resource