DNA sequence data has become an indispensable tool for Molecular Biology & Evolutionary Biology. Study in these fields now require a genome sequence to work from. We call this a 'Reference Sequence.' We need to build a reference for each species. We do this by Genome Assembly. De novo Genome Assembly is the process of reconstructing the original DNA sequence from the fragment reads alone.


Before diving into this topic, we recommend you to have a look at:

Galaxy instances

You can use a public Galaxy instance which has been tested for the availability of the used tools. They are listed along with the tutorials above.

You can also use the following Docker image for these tutorials:

docker run -p 8080:80

NOTE: Use the -d flag at the end of the command if you want to automatically download all the data-libraries into the container.

It will launch a flavored Galaxy instance available on http://localhost:8080. This instance will contain all the tools and workflows to follow the tutorials in this topic. Login as admin with password password to access everything.


This material is maintained by:

Simon GladmanSimon Gladman, Anton NekrutenkoAnton Nekrutenko, Delphine LariviereDelphine Lariviere

For any question related to this topic and the content, you can contact them or visit our Gitter channel.


This material was contributed to by:

Simon GladmanSimon Gladman, Helena RascheHelena Rasche, Saskia HiltemannSaskia Hiltemann, Anton NekrutenkoAnton Nekrutenko, Delphine LariviereDelphine Lariviere, Wolfgang MaierWolfgang Maier