Preparing data for use with clustering techniques