Workflows are really about:
I became familiar with Galaxy!
(and spent more time with researchers)
Installing a framework / tool / docker / plugin Just use your browser
Installing tools Just use your browser
YAML / Python / text editor faffing Just use (the GUI) in your browser
"Yeah you can use my workflow! Clone this repo, install this tool, configure this library, download this file" "Just go to [your URL] in your browser"
Well, maybe. But as a group we should pick one.
List of 361 362 workflow engines: https://github.com/common-workflow-language/common-workflow-language/wiki/Existing-Workflow-systems List of 260 workflow engines: https://github.com/pditommaso/awesome-pipeline
Let's look at a worked example
To mine data from a database of phrases
But... how do we start?
I thought this for a long time but now have an answer!
Clone this repo for an example self hosted Galaxy using Docker
<tool id="letter_count" name="Letter Count" version="0.1.0"> <description>Counting letters in a string</description> <requirements> <container type="docker">python:3.13-slim</container> </requirements> <command> <![CDATA[ python '$__tool_directory__/letter_count.py' '$input_file' '$output_file' ]]> </command> <inputs> <param type="data" name="input_file" label="Sentance File" help="Enter the string to count the letters" /> </inputs> <outputs> <data format="txt" name="output_file" label="Letter Count" help="Output file containing the letter count" /> </outputs> <help> This tool counts the number of letter in each word of a string. </help> </tool>
Well...
We trail a persistant, central Galaxy instance for RSEs
We put a workstation under a desk
Even if we don't start an instance:
Gutenberg moment? Opening Up Research