Nucleoli segmentation and feature extraction using CellProfiler

Author(s)	Beatriz Serrano-Solano Jean-Karim Hériché
Reviewers

Overview
Questions:

How do I run an image analysis pipeline on public data using CellProfiler?

How do I analyse the DNA channel of fluorescence siRNA screens?

How do I download public image data into my history?

How do I segment and label cell nuclei?

How do I segment nucleoli (as the absence of DNA)?

How do I combine nuclei and nucleoli into one segmentation mask?

How do I extract the background of an image?

How do I relate the nucleoli to their parent nucleus?

How do I measure the image and object features?

How do I measure the image quality?

Objectives:

How to download images from a public image repository.

How to segment cell nuclei using CellProfiler in Galaxy.

How to segment cell nucleoli using CellProfiler in Galaxy.

How to extract features for images, nuclei and nucleoli.

Requirements:

Introduction to Galaxy Analyses

tutorial Hands-on: FAIR Bioimage Metadata

tutorial Hands-on: REMBI - Recommended Metadata for Biological Images – metadata guidelines for bioimaging data

Time estimation: 4 hours

Supporting Materials:

Slides

Workflows

FAQs

instances Available on these Galaxies

Known Working

UseGalaxy.eu ✅ ⭐️

Possibly Working

Galaxy@AuBi

UseGalaxy.be

UseGalaxy.cz

UseGalaxy.fr

UseGalaxy.no

UseGalaxy.org.au

Published: Jul 1, 2020

Last modification: Nov 6, 2025

License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License. The GTN Framework is licensed under MIT

purl PURL: https://gxy.io/GTN:T00183

version Revision: 10

The nucleolus is a prominent structure of the nucleus of eukaryotic cells and is involved in ribosome biogenesis and cell cycle regulation. In DNA staining of cells, nucleoli can be identified as the absence of DNA in nuclei (Fig. 1). Phenotypes caused by reduced gene function are widely used to elucidate gene function and image-based RNA interference (RNAi) screens are routinely used to find and characterize genes involved in a particular biological process. While screens typically focus on one biological process of interest, the molecular markers used can also inform on other processes. Re-using published screens image data can then be a cost-effective alternative to performing new experiments. In particular, regardless of the targeted biological process, many screens include a DNA label and therefore can also reveal the effect of gene knock-downs on nucleoli.

In this tutorial, we will analyse DNA channel images of publicly available RNAi screens to extract numerical descriptors (i.e. features) of nucleoli. The images and associated metadata will be retrieved from the Image Data Resource (IDR), a repository that collects image datasets of tissues and cells.

To process and analyse the images, we will use CellProfiler (“CellProfiler 3.0: Next-generation image processing for biology” 2018), a popular image analysis software. CellProfiler normally comes as a desktop application in which users can compose image analysis workflows from a series of modules. Many of these modules are now also available as tools in Galaxy.

To fully emulate the behaviour of the standalone CellProfiler in Galaxy, each image analysis workflow needs to have three parts:

1) StartingModules tool to initialise the pipeline,

2) tools performing the analysis (Fig. 2): identification of the nuclei, nucleoli and background, together with the feature extraction,

3) CellProfiler tool to actually run the pipeline.

High-level view of the workflow. — **Figure 2**: High-level view of the workflow

In this tutorial, you will learn how to create a workflow that downloads a selection of images from the IDR, and uses CellProfiler to segment the nuclei and nucleoli. You will also learn how to extract and export features at three different levels: image, nucleus, nucleolus.

Agenda

In this tutorial, we will cover:

Get data

Start CellProfiler pipeline

Object segmentation

Segment nuclei

Segment nucleoli

Combine segmentation masks

Background extraction

Identify the foreground

Remove the foreground from the original image

Feature extraction

Measure the granularity, texture, intensity, size and shape

Relate nucleoli to their parent nucleus

Measure the image quality-related parameters

Export the features to a CSV

Run the CellProfiler pipeline

Conclusion

Get data

Hands On: Download images from the IDR

If you are logged in, create a new history for this tutorial.

To create a new history simply click the new-history icon at the top of the history panel:

IDR Download ( Galaxy version 0.45) with the following parameters:

“How would you like to specify the IDs of images to download?”: As text (comma-separated list of IDs or a valid IDR link)

“Image IDs to download”: http://idr.openmicroscopy.org/webclient/?show=image-295900|image-295905|image-295910|image-295918|image-295928|image-295934

“Name of the channel to download”: Cy3

“z-plane of images to download”: 0

“Image frame to download”: 0

“Limit the download to a selected region of the image?”: No, download the entire image plane

“Skip failed retrievals?”: Yes

“Download images in a tarball?”: Yes

To get the valid IDR link, go to the dataset of interest in the IDR and select in the preview of a plate a few images ((figure Fig. 3 - 1)). Once you see them at the bottom of the page (figure Fig. 3 - 2), select them again and click the link button in the top-right corner of the right panel (figure Fig. 3 - 3).

Open image in new tab

Figure 3: IDR interface

Comment

IMPORTANT: When the number of images to download is high, it is strongly recommended to enable the option “Download images in a tarball?” in order to improve the performance.

Question

Why are we taking the Cy3 channel in the example data?

How could we download 100,000 images in one go?

The Cy3 dye was used in the study to stain DNA. Since we want to segment the abscence of DNA, Cy3 is the only channel that we need to download from the IDR.

We could upload a text file with the image ids of interest.

Start CellProfiler pipeline

The tool Starting Modules tool comprises the first 4 modules of the standalone CellProfiler. It needs to be the first tool of a workflow because it sets the naming and metadata handling for the rest of tools.

Hands On: Specify metadata to CellProfiler

Starting Modules ( Galaxy version 3.1.9+galaxy2) with the following parameters:

Images

“Do you want to filter only the images? “: Select the images only

Metadata

“Do you want to extract the metadata?”: Yes, specify metadata

“Metadata extraction method”: Extract from file/folder names

“Metadata source”: File name

“Select the pattern to extract metadata from the file name”: field1__field2__field3__field4__field5__field6

“Extract metadata from”: All images

NamesAndTypes

“Process 3D”: No, do not process 3D data

“Assign a name to”: Give every image the same name

“Name to assign these images”: DNA

“Select the image type”: Grayscale image

“Set intensity range from”: Image metadata

Groups

“Do you want to group your images?”: Yes, group the images

“param”: field1

Comment

The images downloaded from the IDR are named following the pattern: plateName__imageID__cropX__cropY__cropWidth__cropHeight. These fields indicate to which plate the image belongs, what is the identifier of the image in the IDR, and the 4 cropping parameters selected. In our case, the upper-left corner (X, Y) and the width and height from there. We have a total of 6 metadata values encoded in the name of the file, separated by __. The pattern to extract our metadata from the file name properly is, therefore: field1__field2__field3__field4__field5__field6. It is important to keep in mind that, for CellProfiler, our plateName will be called field1, imageID will be field2, cropX will be field3, etc. These matches are relevant for a meaningful interpretation of the features.

Object segmentation

Segment nuclei

Since we are interested in segmenting the nucleoli, you may wonder why we need to segment nuclei first. There are several reasons for that:

Get the nuclei features. The intensity, size, shape, number of nucleoli per nucleus, etc. can be informative to study the nucleoli.
Avoid the detection of wrong spots. In a first segmentation, we detect the nuclei, while in the second pass we will segment the holes (nucleoli). The holes need to fall inside the nuclei, and hence the importance of having them segmented too.

In the first step, we will identify the nuclei that are complete, meaning that they are not touching the borders of the image.

Hands On: Segment nuclei that are complete within the boundaries of the image

IdentifyPrimaryObjects ( Galaxy version 3.1.9+galaxy2) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of Starting Modules tool)

“Use advanced settings?”: Yes, use advanced settings

“Enter the name of the input image (from NamesAndTypes)”: DNA

“Enter the name of the primary objects to be identified”: Nuclei

“Typical minimum diameter of objects, in pixel units (Min)”: 15

“Typical maximum diameter of objects, in pixel units (Max)”: 200

“Discard objects outside the diameter range?”: Yes

“Discard objects touching the border of the image?”: Yes

“Threshold strategy”: Global

“Thresholding method”: Otsu

“Two-class or three-class thresholding?”: Two classes

“Threshold correction factor”: 0.9

“Method to distinguish clumped objects”: Shape

“Method to draw dividing lines between clumped objects”: Shape

“Automatically calculate size of smoothing filter for declumping?”: Yes

“Automatically calculate minimum allowed distance between local maxima?”: Yes

“Handling of objects if excessive number of objects identified”: Continue

Comment

The names entered to the input image and objects have to be consistent (case sensitive) with the ones in NamesAndTypes and the tools to follow.

The min and max diameter of the objects (Typical minimum diameter of objects, in pixel units (Min) and Typical minimum diameter of objects, in pixel units (Max)) will have to be adjusted to the resolution of the images.

The Threshold correction factor is used to adjust the threshold in Otsu’s method. The value depends on the images and it might be useful to test several values and pick the one that works best for a particular kind of image.

Question

We are using here Otsu’s thresholding method for segmentation. What other segmentation options are available? What is the difference between them?

In the global methods we have Manual, Measurement, Minimum cross entropy, Otsu and Robust background. For the adaptive ones we only have Otsu. Check the parameters’ help to get more information on each one.

From the previous tool, we got a group of objects (nuclei). Now, we want to export the segmentation masks as a single image to check how well the segmentation algorithm is performing. We also want to label the nuclei with their identifiers for future visual inspection of the results. The output of this step will look like (you will only get this image after running the whole workflow):

**Figure 4**: Identified nuclei with labels.

Question

Why are some nuclei not labeled in the image above?

We have indicated to the tool IdentifyPrimaryObjects tool that the nuclei that are either outside the diameter range or touching the border should be discarded.

Hands On: Mask the nuclei detected

ConvertObjectsToImage ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of IdentifyPrimaryObjects tool)

“Enter the name of the input objects you want to convert to an image”: Nuclei

“Enter the name of the resulting image”: MaskNuclei

“Select the color format”: Binary (black & white)

DisplayDataOnImage ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ConvertObjectsToImage tool)

“Display object or image measurements?”: Object

“Enter the name of the input objects”: Nuclei

“Measurement category”: Number

“Enter the name of the image on which to display the measurements”: DNA

“Display mode”: Text

“Text color”: #ff0000

“Number of decimals”: 0

“Name the output image that has the measurements displayed”: ImageDisplay

SaveImages ( Galaxy version 3.1.9+galaxy2) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of DisplayDataOnImage tool)

“Select the type of image to save”: Image

“Saved the format to save the image(s)”: tiff

“Enter the name of the image to save”: ImageDisplay

“Select method for constructing file names”: From image filename

“Enter the image name (from NamesAndTypes) to be used as file prefix”: DNA

“Append a suffix to the image file name?”: Yes

“Text to append to the image name”: _nucleiNumbers

“Overwrite existing files without warning?”: Yes

Comment

The Text color parameter can be any of your choice, the hexa code is not really relevant. The only consideration is that it needs to be visible on top of the nuclei.

Segment nucleoli

The nucleoli are lacking intensity in the DNA staining and therefore, we need to enhance the black holes before masking.

Hands On: Detect and mask dark holes

EnhanceOrSuppressFeatures ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of SaveImages tool)

“Enter the name of the input image”: DNA

“Enter a name for the resulting image”: DNAdarkholes

“Select the operation”: Enhance

“Feature type”: Dark holes

“Maximum hole size”: 15

MaskImage ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of EnhanceOrSuppressFeatures tool)

“Enter the name of the input image”: DNAdarkholes

“Enter the name of the resulting image”: MaskDNAdarkholes

“Use objects or an image as a mask?”: Objects

“Enter the name objects to mask the input image”: Nuclei

“Invert the mask?”: No

Now that we have all the holes in one mask, we can segment the nucleoli as individual objects in the same way as we did with the nuclei. All the nucleoli can be then combined into one single image.

Hands On: Segment nucleoli as individual objects

IdentifyPrimaryObjects ( Galaxy version 3.1.9+galaxy2) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MaskImage tool)

“Use advanced settings?”: Yes, use advanced settings

“Enter the name of the input image (from NamesAndTypes)”: MaskDNAdarkholes

“Enter the name of the primary objects to be identified”: Nucleoli

“Typical minimum diameter of objects, in pixel units (Min)”: 2

“Typical maximum diameter of objects, in pixel units (Max)”: 15

“Discard objects touching the border of the image?”: Yes

“Threshold strategy”: Global

“Thresholding method”: Otsu

“Two-class or three-class thresholding?”: Two classes

“Threshold correction factor”: 0.9

“Method to distinguish clumped objects”: Shape

“Method to draw dividing lines between clumped objects”: Shape

“Automatically calculate size of smoothing filter for declumping?”: Yes

“Automatically calculate minimum allowed distance between local maxima?”: Yes

“Handling of objects if excessive number of objects identified”: Continue

ConvertObjectsToImage ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of IdentifyPrimaryObjects tool)

“Enter the name of the input objects you want to convert to an image”: Nucleoli

“Enter the name of the resulting image”: MaskNucleoli

“Select the color format”: Binary (black & white)

Combine segmentation masks

We have now one segmentation mask per image with all the nuclei detected, MaskNuclei, and another one for the nucleoli, MaskNucleoli. These are binary masks in which the background is black and the objects detected are white. We would like to check whether both segmentation steps went well. That could be achieved by combining both of them (using different colors) into one image. Here, we are converting the nucleus mask to blue and the nucleoli to magenta. The outcome that you will obtain after the execution of the workflow will look like:

Combined mask for nuclei and nucleoli. — **Figure 5**: Combined segmentation masks for nuclei (blue) and nucleoli (magenta).

Hands On: Convert and save the nuclei and nucleoli masks

GrayToColor ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ConvertObjectsToImage tool)

“Enter the name of the resulting image”: CombinedMask

“Select a color scheme”: RGB

“Enter the name of the image to be colored red”: MaskNucleoli

“Relative weight for the red image”: 0.8

“Enter the name of the image to be colored blue”: MaskNuclei

“Relative weight for the blue image”: 0.5

SaveImages ( Galaxy version 3.1.9+galaxy2) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of GrayToColor tool)

“Select the type of image to save”: Image

“Saved the format to save the image(s)”: tiff

“Enter the name of the image to save”: CombinedMask

“Select method for constructing file names”: From image filename

“Enter the image name (from NamesAndTypes) to be used as file prefix”: DNA

“Append a suffix to the image file name?”: Yes

“Text to append to the image name”: _combinedMask

“Overwrite existing files without warning?”: Yes

Comment

You can pick any other color of your choice, as long as the contrast is good enough to distinguish both objects.

We are saving here a tiff image but any other format of your choice would work too.

Background extraction

The background extraction is useful for quality control. For instance, in a high-exposed or low-contrast image, the nuclei will not be very different from the background and that may lead to the wrong segmentation.

To extract the background, we first need to get the foreground and subtract it from the original image. We already have the nuclei mask, however, we excluded incomplete nuclei, i.e., those touching the borders or those which sizes are outside the specified range. This means that the mask is not covering all the nuclei and we need to get rid of the constraints to get the complete foreground. At this stage, we want to detect everything with a certain intensity (foreground) and subtract it from the complete image to get the background.

Identify the foreground

Hands On: Segment all nuclei

IdentifyPrimaryObjects ( Galaxy version 3.1.9+galaxy2) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of SaveImages tool)

“Use advanced settings?”: Yes, use advanced settings

“Enter the name of the input image (from NamesAndTypes)”: DNA

“Enter the name of the primary objects to be identified”: NucleiIncludingTouchingBorders

“Typical minimum diameter of objects, in pixel units (Min)”: 15

“Typical maximum diameter of objects, in pixel units (Max)”: 200

“Discard objects outside the diameter range?”: No

“Discard objects touching the border of the image?”: No

“Threshold strategy”: Global

“Thresholding method”: Otsu

“Two-class or three-class thresholding?”: Two classes

“Threshold correction factor”: 0.9

“Method to distinguish clumped objects”: Shape

“Method to draw dividing lines between clumped objects”: Shape

“Automatically calculate size of smoothing filter for declumping?”: Yes

“Automatically calculate minimum allowed distance between local maxima?”: Yes

“Handling of objects if excessive number of objects identified”: Continue

ConvertObjectsToImage ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of IdentifyPrimaryObjects tool)

“Enter the name of the input objects you want to convert to an image”: NucleiIncludingTouchingBorders

“Enter the name of the resulting image”: Image_NucleiIncludingTouchingBorders

“Select the color format”: Binary (black & white)

Remove the foreground from the original image

Hands On: Subtract the foreground from the original image

ImageMath ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ConvertObjectsToImage tool)

“Enter a name for the resulting image”: BG

“Operation”: Subtract

In “First Image”:

“Image or measurement?”: Image

“Enter the name of the first image”: DNA

In “Second Image”:

“Image or measurement?”: Image

“Enter the name of the second image”: Image_NucleiIncludingTouchingBorders

“Ignore the image masks?”: No

Feature extraction

Now that we have the objects of interest segmented and the background extracted, we can start measuring parameters on them. In particular it is relevant to:

measure the granularity, texture and intensity of the nuclei and original image,
measure the intensity of the original image, the background and foreground,
measure the size and shape of nuclei and nucleoli,
assess the quality of the original image,
measure the area ocuppied by nuclei and nucleoli.

A step that requires special attention is the relationship nucleolus-nucleus. This is useful to compute statistics on the number of nucleoli by nucleus.

Comment

The order in which the tools in this section are chained is not relevant for the outcome.

Measure the granularity, texture, intensity, size and shape

Hands On: Measure the granularity, texture, intensity, size and shape

MeasureGranularity ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of ImageMath tool)

In “new image”:

param-repeat “Insert new image”

“Enter the name of a greyscale image to measure”: DNA

MeasureTexture ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureGranularity tool)

In “new image”:

param-repeat “Insert new image”

“Enter the name of an image to measure”: DNA

“Measure images or objects?”: Objects

In “new object”:

param-repeat “Insert new object”

“Enter the names of the objects to measure”: Nuclei

MeasureObjectIntensity ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureTexture tool)

In “new image”:

param-repeat “Insert new image”

“Enter the name of an image to measure”: DNA

In “new object”:

param-repeat “Insert new object”

“Enter the name of the objects to measure”: Nuclei

MeasureObjectSizeShape ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureObjectIntensity tool)

In “new object”:

param-repeat “Insert new object”

“Enter the name of the object to measure”: Nuclei

param-repeat “Insert new object”

“Enter the name of the object to measure”: Nucleoli

Question

Why are we measuring the granularity, texture and intensity of the original image and the nuclei only?

The nucleoli was not stained in the DNA channel and hence, the granularity, texture and intensity are constant values.

Relate nucleoli to their parent nucleus

It might be relevant to compute some statistics on the number of nucleoli inside each nucleus. CellProfiler has a very interesting module to relate both objects, in which each one of the nucleoli is assigned an identifier and linked to the identifier of its parent nucleus.

Hands On: Relate nucleoli to their parent nucleus

RelateObjects ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureObjectSizeShape tool)

“Parent objects”: Nuclei

“Child objects”: Nucleoli

“Calculate child-parent distances?”: Both

“Calculate distances to other parents?”: No

“Do you want to save the children with parents as a new object set?”: Yes

In this section, we will measure the image quality, the area occupied by the nuclei and nucleoli in the original image and the intensity of the foreground and background.

Hands On: Measure the image quality

MeasureImageQuality ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of RelateObjects tool)

“Calculate blur metrics?”: Yes

“Calculate thresholds?”: Yes

“Use all thresholding methods?”: No

In “new threshold method”:

param-repeat “Insert new threshold method”

“Select a thresholding method”: Otsu

“Two-class or three-class thresholding?”: Two classes

MeasureImageAreaOccupied ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureImageQuality tool)

In “new area”:

param-repeat “Insert new area”

“Measure the area occupied in a binary image, or in objects?”: Objects

“Enter the name of the objects to measure”: Nuclei

param-repeat “Insert new area”

“Measure the area occupied in a binary image, or in objects?”: Objects

“Enter the name of the objects to measure”: Nucleoli

MeasureImageIntensity ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureImageAreaOccupied tool)

In “new image”:

param-repeat “Insert new image”

“Enter the name of the image to measure”: DNA

“Measure the intensity only from areas enclosed by objects?”: No

param-repeat “Insert new image”

“Enter the name of the image to measure”: DNA

“Measure the intensity only from areas enclosed by objects?”: Yes

“Enter the name of the objects that the intensity will be aggregated within”: Nuclei

param-repeat “Insert new image”

“Enter the name of the image to measure”: BG

“Measure the intensity only from areas enclosed by objects?”: No

Question

In step 3, we are measuring the area occupied by objects for the DNA twice. Why is that?

We want to have the global intensity of the image, so the parameter “Measure the intensity only from areas enclosed by objects?” has to be set to No. However, we also want to obtain the intensity of the foreground, so only the parts of the images that are enclosed by nuclei.

Export the features to a CSV

All the parameters that we have measured related to the images and objects need to be exported to a file for each one of 6 example images analysed.

Hands On: Export features

ExportToSpreadsheet ( Galaxy version 3.1.9+galaxy2) with the following parameters:

param-file “Select the input CellProfiler pipeline”: output_pipeline (output of MeasureImageIntensity tool)

“Select the column delimiter”: Tab

“Add a prefix to file names?”: Do not add prefix to the file name

“Create a GenePattern GCT file?”: No

“Export all measurement types?”: Yes

Run the CellProfiler pipeline

All the steps in our workflow (except for the IDR download tool) have been passing through an output_pipeline as a parameter. This was the way to assemble all the modules from CellProfiler, now we can run all of them together!

Hands On: Run CellProfiler pipeline

Run CellProfiler pipeline ( Galaxy version 3.1.9+galaxy1) with the following parameters:

param-file “Pipeline file”: output_pipeline (output of ExportToSpreadsheet tool)

“Are the input images packed into a tar archive?”: Yes

param-file “A tarball of images”: output_tar (output of IDR Download tool)

“Detailed logging file?”: Yes

Comment

This is the only time-consuming step of the workflow, as it needs to perform the whole analysis in the input dataset.

Conclusion

In this tutorial, you have downloaded images from a public image repository into your Galaxy history. After that, you have built and run a typical image analysis pipeline, composed of segmentation of several objects and feature extraction. As an outcome, you got plenty of features to analyse! And some masks to check that the segmentation algorithms worked as expected. Now you are ready to perform your biological data analysis!

You've Finished the Tutorial

Key points

Galaxy workflows can download images from the IDR, selecting specific channels, time points, z-stack positions and crop the image in different ways.

CellProfiler in Galaxy can segment and extract features of any object of interest.

The features and masks can be exported for further analysis.

Frequently Asked Questions

Have questions about this tutorial? Have a look at the available FAQ pages and support channels

Useful literature

Further information, including links to documentation and original publications, regarding the tools, analysis techniques and the interpretation of results described in this tutorial can be found here.

References

Integration of biological data by kernels on graph nodes allows prediction of new genes involved in mitotic chromosome condensation, 2014 Mol Biol Cell. 2014;25(16):2522-2536: 10.1091/mbc.E13-04-0221
CellProfiler 3.0: Next-generation image processing for biology, 2018 PLoS Biol. 16(7):e2005970: 10.1371/journal.pbio.2005970 https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.2005970

Feedback

Did you use this material as an instructor? Feel free to give us feedback on how it went.
Did you use this material as a learner or student? Click the form below to leave feedback.

Citing this Tutorial

Beatriz Serrano-Solano, Jean-Karim Hériché, Nucleoli segmentation and feature extraction using CellProfiler (Galaxy Training Materials). https://training.galaxyproject.org/training-material/topics/imaging/tutorials/tutorial-CP/tutorial.html Online; accessed TODAY
Hiltemann, Saskia, Rasche, Helena et al., 2023 Galaxy Training: A Powerful Framework for Teaching! PLOS Computational Biology 10.1371/journal.pcbi.1010752
Batut et al., 2018 Community-Driven Data Analysis Training for Biology Cell Systems 10.1016/j.cels.2018.05.012

@misc{imaging-tutorial-CP,
author = "Beatriz Serrano-Solano and Jean-Karim Hériché",
	title = "Nucleoli segmentation and feature extraction using CellProfiler (Galaxy Training Materials)",
	year = "",
	month = "",
	day = "",
	url = "\url{https://training.galaxyproject.org/training-material/topics/imaging/tutorials/tutorial-CP/tutorial.html}",
	note = "[Online; accessed TODAY]"
}
@article{Hiltemann_2023,
	doi = {10.1371/journal.pcbi.1010752},
	url = {https://doi.org/10.1371%2Fjournal.pcbi.1010752},
	year = 2023,
	month = {jan},
	publisher = {Public Library of Science ({PLoS})},
	volume = {19},
	number = {1},
	pages = {e1010752},
	author = {Saskia Hiltemann and Helena Rasche and Simon Gladman and Hans-Rudolf Hotz and Delphine Larivi{\`{e}}re and Daniel Blankenberg and Pratik D. Jagtap and Thomas Wollmann and Anthony Bretaudeau and Nadia Gou{\'{e}} and Timothy J. Griffin and Coline Royaux and Yvan Le Bras and Subina Mehta and Anna Syme and Frederik Coppens and Bert Droesbeke and Nicola Soranzo and Wendi Bacon and Fotis Psomopoulos and Crist{\'{o}}bal Gallardo-Alba and John Davis and Melanie Christine Föll and Matthias Fahrner and Maria A. Doyle and Beatriz Serrano-Solano and Anne Claire Fouilloux and Peter van Heusden and Wolfgang Maier and Dave Clements and Florian Heyl and Björn Grüning and B{\'{e}}r{\'{e}}nice Batut and},
	editor = {Francis Ouellette},
	title = {Galaxy Training: A powerful framework for teaching!},
	journal = {PLoS Comput Biol}
}

                   

Congratulations on successfully completing this tutorial!

You can use Ephemeris's shed-tools install command to install the tools used in this tutorial.

shed-tools install [-g GALAXY] [-a API_KEY] -t <(curl https://training.galaxyproject.org/training-material/api/topics/imaging/tutorials/tutorial-CP/tutorial.json | jq .admin_install_yaml -r)

Alternatively you can copy and paste the following YAML

---
install_tool_dependencies: true
install_repository_dependencies: true
install_resolver_dependencies: true
tools:
- name: cp_cellprofiler
  owner: bgruening
  revisions: e917a2424690
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_cellprofiler
  owner: bgruening
  revisions: eb6c08714244
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_common
  owner: bgruening
  revisions: e7273daa5ae2
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_common
  owner: bgruening
  revisions: 6070780813d9
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_convert_objects_to_image
  owner: bgruening
  revisions: df3ad3ba5c6f
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_convert_objects_to_image
  owner: bgruening
  revisions: ef4aed1d34c4
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_display_data_on_image
  owner: bgruening
  revisions: 2484086dee41
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_display_data_on_image
  owner: bgruening
  revisions: 5d28ac336861
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_enhance_or_suppress_features
  owner: bgruening
  revisions: f2ed92b011a9
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_enhance_or_suppress_features
  owner: bgruening
  revisions: bace5a93a97c
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_export_to_spreadsheet
  owner: bgruening
  revisions: f8fa0f6718a3
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_export_to_spreadsheet
  owner: bgruening
  revisions: f24c370e5ea4
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_gray_to_color
  owner: bgruening
  revisions: 31d195bedf6d
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_gray_to_color
  owner: bgruening
  revisions: 3245681b2f6d
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_identify_primary_objects
  owner: bgruening
  revisions: 993faa34333b
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_identify_primary_objects
  owner: bgruening
  revisions: 87d4d92d7487
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_image_math
  owner: bgruening
  revisions: c09dcda1143f
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_image_math
  owner: bgruening
  revisions: ff1a86a87364
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_mask_image
  owner: bgruening
  revisions: 3a05117669cf
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_mask_image
  owner: bgruening
  revisions: 667a513e67e4
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_granularity
  owner: bgruening
  revisions: a1a961c46d4b
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_granularity
  owner: bgruening
  revisions: 2564f461de62
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_image_area_occupied
  owner: bgruening
  revisions: 20dbb1f7ec7b
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_image_area_occupied
  owner: bgruening
  revisions: a32c3f810ade
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_image_intensity
  owner: bgruening
  revisions: 5499dc6f8ac1
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_image_intensity
  owner: bgruening
  revisions: b64af54c5c09
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_image_quality
  owner: bgruening
  revisions: e972e07c7845
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_image_quality
  owner: bgruening
  revisions: d3890df07463
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_object_intensity
  owner: bgruening
  revisions: 980aed9e8fd5
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_object_intensity
  owner: bgruening
  revisions: e52c75a28c4b
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_object_size_shape
  owner: bgruening
  revisions: 79e10d12d90a
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_object_size_shape
  owner: bgruening
  revisions: d85f11570109
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_texture
  owner: bgruening
  revisions: 035a58c7f54a
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_measure_texture
  owner: bgruening
  revisions: b76841ee5560
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_relate_objects
  owner: bgruening
  revisions: 8083a87d2fdf
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_relate_objects
  owner: bgruening
  revisions: 48102a685037
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_save_images
  owner: bgruening
  revisions: 4ecf93628f97
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: cp_save_images
  owner: bgruening
  revisions: 27edbb90ab2b
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: idr_download_by_ids
  owner: iuc
  revisions: e08b1dc0480c
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/
- name: idr_download_by_ids
  owner: iuc
  revisions: f92941d1a85e
  tool_panel_section_label: Imaging
  tool_shed_url: https://toolshed.g2.bx.psu.edu/

No feedback has been recieved yet for this training. Be the first one by filling in the feedback form.