Assembly

DNA sequence data has become an indispensable tool for Molecular Biology & Evolutionary Biology. Study in these fields now require a genome sequence to work from. We call this a ‘Reference Sequence.’ We need to build a reference for each species. We do this by Genome Assembly. De novo Genome Assembly is the process of reconstructing the original DNA sequence from the fragment reads alone.

You can view the tutorial materials in different languages by clicking the dropdown icon next to the slides (slides) and tutorial (tutorial) buttons below.

Requirements

Before diving into this topic, we recommend you to have a look at:

Material

Lesson Slides Hands-on Recordings Input dataset Workflows
An Introduction to Genome Assembly
An introduction to get started in genome assembly and annotation
Assembly of metagenomic sequencing data
Chloroplast genome assembly
De Bruijn Graph Assembly
Deeper look into Genome Assembly algorithms
ERGA post-assembly QC
Genome Assembly Quality Control
Genome Assembly of MRSA from Oxford Nanopore MinION data (and optionally Illumina data)
Genome Assembly of a bacterial genome (MRSA) sequenced using Illumina MiSeq Data
Genome assembly using PacBio data
Large genome assembly and polishing
Making sense of a newly assembled genome
Unicycler Assembly
Unicycler assembly of SARS-CoV-2 genome with preprocessing to remove human genome reads
VGP assembly pipeline - short version
VGP assembly pipeline: Step by Step

Galaxy instances

You can use a public Galaxy instance which has been tested for the availability of the used tools. They are listed along with the tutorials above.

You can also use the following Docker image for these tutorials:

docker run -p 8080:80 quay.io/galaxy/assembly-training

NOTE: Use the -d flag at the end of the command if you want to automatically download all the data-libraries into the container.

It will launch a flavored Galaxy instance available on http://localhost:8080. This instance will contain all the tools and workflows to follow the tutorials in this topic. Login as admin with password password to access everything.

Frequently Asked Questions

Common questions regarding this topic have been collected on a dedicated FAQ page . Common questions related to specific tutorials can be accessed from the tutorials themselves.

Editorial Board

This material is reviewed by our Editorial Board:

orcid logoSimon Gladman avatar Simon GladmanAnton Nekrutenko avatar Anton Nekrutenkoorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoCristóbal Gallardo avatar Cristóbal Gallardo

For any question related to this topic and the content, you can contact them or visit our Gitter channel.

Contributors

This material was contributed to by:

Bazante Sanders avatar Bazante Sandersorcid logoMiaomiao Zhou avatar Miaomiao Zhouorcid logoHelena Rasche avatar Helena Rascheorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoAnna Syme avatar Anna SymeFabian Recktenwald avatar Fabian Recktenwaldorcid logoAlex Ostrovsky avatar Alex OstrovskyBrandon Pickett avatar Brandon Pickettorcid logoBérénice Batut avatar Bérénice Batutorcid logoPolina Polunina avatar Polina Poluninaorcid logoSimon Gladman avatar Simon Gladmanorcid logoWolfgang Maier avatar Wolfgang MaierGiulio Formenti avatar Giulio Formentiorcid logoSaskia Hiltemann avatar Saskia Hiltemannorcid logoErwan Corre avatar Erwan CorreAvans Hogeschool avatar Avans Hogeschoolorcid logoAnthony Bretaudeau avatar Anthony Bretaudeauorcid logoCristóbal Gallardo avatar Cristóbal GallardoAnton Nekrutenko avatar Anton NekrutenkoMarcella Sozzoni avatar Marcella Sozzoniorcid logoLaura Leroi avatar Laura Leroiorcid logoStéphanie Robin avatar Stéphanie RobinLinelle Abueg avatar Linelle Abuegorcid logoAlexandre Cormier avatar Alexandre Cormier

Funders

This material was funded by:

ABRomics avatar ABRomicsGallantries: Bridging Training Communities in Life Science, Environment and Health avatar Gallantries