Preparing Biological Datasets for ML