Adding auto-generated video to your slides

Author(s)	Helena Rasche
Reviewers

Overview
Questions:

How can we add auto-generated video?

How does it work?

What do I need to do to make it optimal for viewers?

Objectives:

Adding a video to a set of slides

Time estimation: 20 minutes

Supporting Materials:

FAQs

Published: Oct 20, 2020

Last modification: Nov 9, 2023

License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License. The GTN Framework is licensed under MIT

purl PURL: https://gxy.io/GTN:T00071

version Revision: 12

Video Lectures

Based on the work by Delphine Larivière and James Taylor with their COVID-19 Lectures we have implemented a similar feature in the Galaxy Training Network.

Agenda

In this tutorial, we will:

Video Lectures

How it Works

Enabling Video

Writing Good Captions

Enable the Video

Voices

How it works: In Detail

Conclusion

How it Works

We wrote a short script which does the following:

Locally and in production:

Extracts a ‘script’ from the slides. We extract every presenter comment in the slidedeck, and turn this into a text file.
Every line of this text file is then narrated by Amazon Polly (if you have money) or MozillaTTS (free).
The slide deck is converted to a PDF, and then each slide is extracted as a PNG.
Captions are extracted from the audio components.
The narration is stitched together into an mp3
The images are stitched together into an mp4 file
The video, audio, and captions are muxed together into a final mp4 file

In production

We use Amazon Polly, paid for by the Galaxyproject
The result is uploaded to an S3 bucket

Enabling Video

We have attempted to simplify this process as much as possible, but making good slides which work well is up to you.

Writing Good Captions

Every slide must have some narration in the presenter notes. It does not make sense for students to see a slide without commentary. For each slide, you’ll need to write presenter notes in full, but short sentences.

Sentence Structure

Use simple and uncomplex sentences whenever possible. Break up ideas into easy to digest bits. Students will be listening to this spoken and possibly reading the captions.

2021-05-01 There used to be a limit of ~120 characters per sentence, but this is no longer an issue. We now break up sentences which are too long in the captions and show them over multiple timepoints. So if you need to write a really long sentence, you can, but we still advise to simplify sentences where possible.

Captions per Slide

Every slide must have some speaker notes in this system, NO exceptions.

Punctuation

Sentences should end with punctuation like . or ? or even ! if you’re feeling excited.

Abbreviations

These are generally fine as-is. (e.g. e.g./i.e. is fine as-is, RNA is fine, etc.) Make sure abbreviations are all caps though.

Good This role deploys CVMFS.

“Weird” Names

In the captions you will want to teach the GTN how to pronounce these words by editing bin/ari-map.yml to provide your definition.

E.g.

Word	Pronunciation
SQLAlchemy	SQL alchemy
FastQC	fast QC
nginx	engine X
gxadmin	GX admin
/etc	/ E T C

The same applies to the many terms we read differently from how they are written, e.g. ‘src’ vs ‘source’. Most of us would pronounce it like the latter, even though it isn’t spelt that way. Our speaking robot doesn’t know what we mean, so we need to spell it out properly.

So we write the definition in the bin/ari-map.yml file.

Other Considerations

(Written 2020-12-16, things may have changed since.)

Be sure to check the pronunciation of the slides. There are known issues with heteronyms, words spelt the same but having different pronunciation and meaning. Consider “read” for a classic example, or “analyses” for one that comes up often in the GTN. “She analyses data” and “Multiple analyses” are pronounced quite differently based on their usage in sentences. See the wiktionary page for more information, or the list of English heteronyms you might want to be aware of.

This becomes an issue for AWS Polly and Mozilla’s TTS which both don’t have sufficient context sometimes to choose between the two pronunciations. You’ll find that “many analyses” is pronounced correctly while “multiple analyses” isn’t.

Oftentimes the services don’t understand part of speech, so by adding adjectives to analyses, you confuse the engine in to thinking it should be the third person singular pronunciation. This is probably because it only has one or two words of context ahead of the word to be pronounced.

Enable the Video

Lastly, we need to tell the GTN framework we would like videos to be generated.

Hands On: Enable video

Edit the slides.html for your tutorial

Add video: true to the top

That’s it! With this, videos can be automatically generated.

Voices

There are multiple voices available, see the following list:

Name	Region	Neural
Amy	en-GB	True
Aria	en-NZ	True
Brian	en-GB	True
Emma	en-GB	True
Joanna	en-US	True
Joey	en-US	True
Kendra	en-US	True
Matthew	en-US	True
Nicole	en-AU	False
Olivia	en-AU	True
Raveena	en-IN	False
Salli	en-US	True
Ayanda	en-ZA	True
Geraint	en-GB-WLS	True

By default a random voice is chosen every time the video is rebuilt (only whenever a change is made to that slide deck.) We do this to ensure a good diversity of genders and nationalities in the audio samples.

However, if you have a preferred voice, you can set that permanently for that video, add the following metadata to the top of your slide deck:

voice:
  id: Lupe
  lang: es-US
  neural: true

The above voice example is specific to Spanish language content, hence not being represented in the first list.

How it works: In Detail

We take our markdown slides, e.g. topics/introduction/tutorials/galaxy-intro-short/slides.html
In order for them to be processed, slides must have an annotation saying video: true in the header metadata, and then ‘speaker notes’ (everything after the ??? before the —)
This is turned into our ‘plain text slides’ which just renders the markdown a bit more nicely (example)
Then we run ari.sh which does the following:
- make video is run which runs bin/ari-make.sh
- This builds PDFs for any slides which have changed
- And runs ./bin/ari.sh with the PDF, the original Slides, and where the mp4 should be saved.
  - In ./bin/ari.sh
  - It extracts metadata from the tutorial (title, authors, etc.)
  - It builds a ‘script’, a json document with blocks for every line of the speaker notes that were in the slides.
  - Those get converted into mp3 files by AWS Polly (or MozillaTTS), one per slide.
  - The PDFs get turned into a series of PNG images
  - We take the timings of the mp3 files together with the json ‘script’ to write out webvtt / srt subtitles which get embedded into the video, and supplied next to it.
  - editly is used to knit together the PNGs + mp3s with appropriate delay

All of this is run on cron by .github/workflows/video.yml which handles building all of these videos and then later uploading them to s3.

Many of the scripts internally are prefixed with ari, we named our internal version after github.com/jhudsl/ari/ which inspired it, but we wanted a version that would be more closely tied to the GTN and integrate with our infrastructure nicely, so we ended up writing our own.

Conclusion

You've Finished the Tutorial

Key points

Thanks to the GTN, videos are easy to add

Be mindful of your captions. Short sentences are good!

Frequently Asked Questions

Have questions about this tutorial? Have a look at the available FAQ pages and support channels

Feedback

Did you use this material as an instructor? Feel free to give us feedback on how it went.
Did you use this material as a learner or student? Click the form below to leave feedback.

Citing this Tutorial

Helena Rasche, Adding auto-generated video to your slides (Galaxy Training Materials). https://training.galaxyproject.org/training-material/topics/contributing/tutorials/slides-with-video/tutorial.html Online; accessed TODAY
Hiltemann, Saskia, Rasche, Helena et al., 2023 Galaxy Training: A Powerful Framework for Teaching! PLOS Computational Biology 10.1371/journal.pcbi.1010752
Batut et al., 2018 Community-Driven Data Analysis Training for Biology Cell Systems 10.1016/j.cels.2018.05.012

@misc{contributing-slides-with-video,
author = "Helena Rasche",
	title = "Adding auto-generated video to your slides (Galaxy Training Materials)",
	year = "",
	month = "",
	day = "",
	url = "\url{https://training.galaxyproject.org/training-material/topics/contributing/tutorials/slides-with-video/tutorial.html}",
	note = "[Online; accessed TODAY]"
}
@article{Hiltemann_2023,
	doi = {10.1371/journal.pcbi.1010752},
	url = {https://doi.org/10.1371%2Fjournal.pcbi.1010752},
	year = 2023,
	month = {jan},
	publisher = {Public Library of Science ({PLoS})},
	volume = {19},
	number = {1},
	pages = {e1010752},
	author = {Saskia Hiltemann and Helena Rasche and Simon Gladman and Hans-Rudolf Hotz and Delphine Larivi{\`{e}}re and Daniel Blankenberg and Pratik D. Jagtap and Thomas Wollmann and Anthony Bretaudeau and Nadia Gou{\'{e}} and Timothy J. Griffin and Coline Royaux and Yvan Le Bras and Subina Mehta and Anna Syme and Frederik Coppens and Bert Droesbeke and Nicola Soranzo and Wendi Bacon and Fotis Psomopoulos and Crist{\'{o}}bal Gallardo-Alba and John Davis and Melanie Christine Föll and Matthias Fahrner and Maria A. Doyle and Beatriz Serrano-Solano and Anne Claire Fouilloux and Peter van Heusden and Wolfgang Maier and Dave Clements and Florian Heyl and Björn Grüning and B{\'{e}}r{\'{e}}nice Batut and},
	editor = {Francis Ouellette},
	title = {Galaxy Training: A powerful framework for teaching!},
	journal = {PLoS Comput Biol}
}

                   

Congratulations on successfully completing this tutorial!

Developing GTN training material
This tutorial is part of a series to develop GTN training material, feel free to also look at:

Overview of the Galaxy Training Material

Contributing to the Galaxy Training Network with GitHub

Principles of learning and how they apply to training and teaching

Contributing with GitHub via its interface

Preview the GTN website as you edit your training material

Including a new topic

GTN Metadata

Adding auto-generated video to your slides

Design and plan session, course, materials

Updating diffs in admin training

Generating PDF artefacts of the website

Teaching Python

Tools, Data, and Workflows for tutorials

Adding Quizzes to your Tutorial

Creating Interactive Galaxy Tours

Creating a new tutorial

Single Cell Publication - Data Analysis

FAIR Galaxy Training Material

Single Cell Publication - Data Plotting

Creating content in Markdown

Creating Slides

Updating tool versions in a tutorial

You can use Ephemeris's shed-tools install command to install the tools used in this tutorial.
shed-tools install [-g GALAXY] [-a API_KEY] -t <(curl https://training.galaxyproject.org/training-material/api/topics/contributing/tutorials/slides-with-video/tutorial.json | jq .admin_install_yaml -r)
Alternatively you can copy and paste the following YAML
---
install_tool_dependencies: true
install_repository_dependencies: true
install_resolver_dependencies: true
tools: []

No feedback has been recieved yet for this training. Be the first one by filling in the feedback form.