Papers on datasets for machine learning for protein function Background Collection of datasets for protein function prediction. Format Within each category, papers are listed in reverse chronological order (newest first). Where possible, a link should be provided. Categories Single Mutations Single mutations