diverse subset from larger screening set

I would like to generate a 'diverse' subset of structures from a much larger vendor library of structures. Any good ways of doing this within PP? I was thinking of a tanimoto metric for dissimilarity.