Introduction to DH in Galaxy

digital-humanities-introduction_to_dh/introduction-to-digital-humanities

Author(s)
Daniela Schneider
version Version
3
last_modification Last updated
May 4, 2026
license License
CC-BY-4.0
galaxy-tags Tags
Introduction
DH
DigitalHumanities
textcleaning

Features
Tutorial
hands_on Introduction to Digital Humanities in Galaxy

Workflow Testing
Tests: ✅
Results: Not yet automated
FAIRness purl PURL
https://gxy.io/GTN:W00361
RO-Crate logo with flask Download Workflow RO-Crate Workflowhub cloud with gears logo View on WorkflowHub
Launch in Tutorial Mode question
galaxy-download Download
flowchart TD
  0["ℹ️ Input Dataset\nInput Text one"];
  style 0 stroke:#2c3143,stroke-width:4px;
  1["ℹ️ Input Dataset\nInput Text two"];
  style 1 stroke:#2c3143,stroke-width:4px;
  2["Remove Hyperlink Poem One"];
  0 -->|output| 2;
  3["Remove Hyperlink Poem Two"];
  1 -->|output| 3;
  4["Remove Punctuation Poem One"];
  2 -->|out_file1| 4;
  5["Break Poem One into sentences"];
  2 -->|out_file1| 5;
  6["Remove Punctuation Poem Two"];
  3 -->|out_file1| 6;
  7["Break Poem Two into sentences"];
  3 -->|out_file1| 7;
  8["Generate the first WordCloud"];
  4 -->|outfile| 8;
  56f52608-c40a-468a-bfd2-e6718cc436db["Output\nWordCloud01"];
  8 --> 56f52608-c40a-468a-bfd2-e6718cc436db;
  style 56f52608-c40a-468a-bfd2-e6718cc436db stroke:#2c3143,stroke-width:4px;
  9["Replace Space with linebreak Poem One"];
  4 -->|outfile| 9;
  10["Extract Specific Sentences from Poem One"];
  5 -->|outfile| 10;
  0d463da6-de82-4130-a113-9cf527acdca1["Output\noutput01"];
  10 --> 0d463da6-de82-4130-a113-9cf527acdca1;
  style 0d463da6-de82-4130-a113-9cf527acdca1 stroke:#2c3143,stroke-width:4px;
  11["Replace Space with linebreak Poem Two"];
  6 -->|outfile| 11;
  12["Generate a second WordCloud"];
  6 -->|outfile| 12;
  70815cdd-a460-4d5e-8923-e9842b618393["Output\nWordCloud02"];
  12 --> 70815cdd-a460-4d5e-8923-e9842b618393;
  style 70815cdd-a460-4d5e-8923-e9842b618393 stroke:#2c3143,stroke-width:4px;
  13["Extract Specific Sentences from Poem Two"];
  7 -->|outfile| 13;
  8cad2dd7-d6b6-4320-860c-f9125cf77bef["Output\noutput02"];
  13 --> 8cad2dd7-d6b6-4320-860c-f9125cf77bef;
  style 8cad2dd7-d6b6-4320-860c-f9125cf77bef stroke:#2c3143,stroke-width:4px;
  14["Compare the texts"];
  9 -->|outfile| 14;
  11 -->|outfile| 14;
  22682b82-6f29-43e3-a649-b411e42d5cd2["Output\nhtml_file"];
  14 --> 22682b82-6f29-43e3-a649-b411e42d5cd2;
  style 22682b82-6f29-43e3-a649-b411e42d5cd2 stroke:#2c3143,stroke-width:4px;

Inputs

Input Label
Input dataset Input Text one
Input dataset Input Text two

Outputs

From Output Label
toolshed.g2.bx.psu.edu/repos/bgruening/wordcloud/wordcloud/1.9.6+galaxy0 Generate a word cloud Generate the first WordCloud
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy3 Search in textfiles Extract Specific Sentences from Poem One
toolshed.g2.bx.psu.edu/repos/bgruening/wordcloud/wordcloud/1.9.6+galaxy0 Generate a word cloud Generate a second WordCloud
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy3 Search in textfiles Extract Specific Sentences from Poem Two
toolshed.g2.bx.psu.edu/repos/bgruening/diff/diff/3.10+galaxy1 diff Compare the texts

Tools

Tool Links
Remove beginning1
toolshed.g2.bx.psu.edu/repos/bgruening/diff/diff/3.10+galaxy1 View in ToolShed
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy3 View in ToolShed
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy3 View in ToolShed
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_line/9.5+galaxy3 View in ToolShed
toolshed.g2.bx.psu.edu/repos/bgruening/wordcloud/wordcloud/1.9.6+galaxy0 View in ToolShed

To use these workflows in Galaxy you can either click the links to download the workflows, or you can right-click and copy the link to the workflow which can be used in the Galaxy form to import workflows.

Importing into Galaxy

Below are the instructions for importing these workflows directly into your Galaxy server of choice to start using them!
Hands On: Importing a workflow
  1. Click on galaxy-workflows-activity Workflows in the Galaxy activity bar (on the left side of the screen, or in the top menu bar of older Galaxy instances). You will see a list of all your workflows
  2. Click on galaxy-upload Import at the top-right of the screen
  3. Provide your workflow
    • Option 1: Paste the URL of the workflow into the box labelled “Archived Workflow URL”
    • Option 2: Upload the workflow file in the box labelled “Archived Workflow File”
  4. Click the Import workflow button

Below is a short video demonstrating how to import a workflow from GitHub using this procedure:

Video: Importing a workflow from URL

Version History

Version Commit Time Comments
5 ae13dd9bb 2026-05-04 07:28:02 Updated WF to show proper output labels
4 28e46ee00 2026-04-30 11:35:08 Updated WF file and Tutorial answer history corresspondingly
3 b9f9a43db 2026-04-30 10:51:59 Added updated WF
2 88b5f825c 2025-11-24 12:47:42 Add files via upload
1 7363e4025 2025-09-09 12:06:43 Add tests for intro to digital humanities tutorial

For Admins

Installing the workflow tools

wget https://training.galaxyproject.org/training-material/topics/digital-humanities/tutorials/introduction_to_dh/workflows/introduction-to-digital-humanities.ga -O workflow.ga
workflow-to-tools -w workflow.ga -o tools.yaml
shed-tools install -g GALAXY -a API_KEY -t tools.yaml
workflow-install -g GALAXY -a API_KEY -w workflow.ga --publish-workflows