Integration between Pipeline Pilot and Python has improved considerably over recent versions of Pipeline Pilot, opening up the possibility to incorporate an extensive range of functionality through inclusion of Python scripts. Python is well known for its data science capabilities, the attached example protocol includes several example components demonstrating use of the Jupyter Notebook component in utilisation of scikit-learn clustering algorithms (https://scikit-learn.org/stable/modules/clustering.html#clustering)